A new research paper reveals that base AI models, unlike their instruction-tuned counterparts, are often misclassified as human by popular AI text detectors like GPTZero and Pangram. The study proposes a method called Humanization by Iterative Paraphrasing (HIP) to fine-tune base models into paraphrasers, which can then iteratively refine generated text to evade detection. This technique, tested on Llama-3 and Qwen-3 models across various sizes, demonstrates improved detector evasion while preserving semantic meaning, suggesting current detectors may be tracking instruction-tuning artifacts rather than inherent machine-generated text qualities. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT New methods for evading AI text detection could impact academic integrity and content authenticity verification.
RANK_REASON Academic paper detailing a new method for evading AI text detectors. [lever_c_demoted from research: ic=1 ai=1.0]