Base AI models evade detection, new research shows

By PulseAugur Editorial · [1 sources] · 2026-05-19 08:13

A new research paper reveals that base AI models, unlike their instruction-tuned counterparts, are often misclassified as human by popular AI text detectors like GPTZero and Pangram. The study proposes a method called Humanization by Iterative Paraphrasing (HIP) to fine-tune base models into paraphrasers, which can then iteratively refine generated text to evade detection. This technique, tested on Llama-3 and Qwen-3 models across various sizes, demonstrates improved detector evasion while preserving semantic meaning, suggesting current detectors may be tracking instruction-tuning artifacts rather than inherent machine-generated text qualities. AI

IMPACT New methods for evading AI text detection could impact academic integrity and content authenticity verification.

RANK_REASON Academic paper detailing a new method for evading AI text detectors. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Base AI models evade detection, new research shows

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · J. Zico Kolter · 2026-05-19 08:13

Base Models Look Human To AI Detectors

As AI-generated text enters the real-world at scale, institutions increasingly use commercial AI-text detectors, especially in education and academic-integrity workflows. We report a surprising empirical finding about such systems: when evaluated by GPTZero and Pangram, generated…

COVERAGE [1]

Base Models Look Human To AI Detectors

RELATED ENTITIES

RELATED TOPICS