Researchers have identified a significant issue in detecting machine-generated text, stemming from a phenomenon akin to Simpson's Paradox. Current methods average token scores, which masks a non-uniform signal across the detector model's hidden space. A new approach introduces a learned local calibration step, improving detection accuracy by aggregating calibrated log-likelihood ratios instead of raw scores. This method dramatically enhances performance, with one variant improving AUROC from 0.63 to 0.85 on GPT-5.4 text. AI
影响 Improves the reliability of distinguishing AI-generated text, crucial for combating misinformation and ensuring authenticity.
排序理由 Academic paper proposing a novel methodology for detecting machine-generated text.
AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →