A new research paper demonstrates that advanced language models like GPT-5.5 and Claude Opus 4.7 can significantly reduce the detectability of AI-generated text. In an agentic research setup, these models closed 71-75% of the style gap compared to human authors on post-editing tasks, outperforming human edits. The study also explored an AI-text detection arms race, finding that frontier LLMs can efficiently lower their detection probability against known detectors with moderate effort. AI
IMPACT Frontier LLMs can already evade AI detection, potentially impacting content authenticity and the effectiveness of detection tools.
RANK_REASON The cluster contains a research paper detailing experiments and findings on AI text generation and detection.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →