PulseAugur
LIVE 21:31:09
tool · [1 source] ·
1
tool

Base AI models evade detection, new research shows

A new research paper reveals that base AI models, unlike their instruction-tuned counterparts, are often misclassified as human by popular AI text detectors like GPTZero and Pangram. The study proposes a method called Humanization by Iterative Paraphrasing (HIP) to fine-tune base models into paraphrasers, which can then iteratively refine generated text to evade detection. This technique, tested on Llama-3 and Qwen-3 models across various sizes, demonstrates improved detector evasion while preserving semantic meaning, suggesting current detectors may be tracking instruction-tuning artifacts rather than inherent machine-generated text qualities. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT New methods for evading AI text detection could impact academic integrity and content authenticity verification.

RANK_REASON Academic paper detailing a new method for evading AI text detectors. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 · J. Zico Kolter ·

    Base Models Look Human To AI Detectors

    As AI-generated text enters the real-world at scale, institutions increasingly use commercial AI-text detectors, especially in education and academic-integrity workflows. We report a surprising empirical finding about such systems: when evaluated by GPTZero and Pangram, generated…