New TRACE method detects LLM ghostwriters in long texts

By PulseAugur Editorial · [1 sources] · 2026-06-10 04:00

Researchers have developed a new method called TRACE to detect ghostwriters generated by large language models in long-form texts. This technique creates a unique fingerprint by analyzing token-level transition patterns, such as word rank, using a separate lightweight language model. TRACE has demonstrated state-of-the-art performance on a new dataset called GhostWriteBench, which includes texts over 50,000 words generated by frontier LLMs, and shows robustness in out-of-distribution scenarios and with limited training data. AI

IMPACT Provides a new tool for identifying AI-generated content in long-form writing, impacting content authenticity and copyright.

RANK_REASON The cluster contains an academic paper detailing a new method and dataset for LLM authorship attribution. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Anudeex Shetty, Qiongkai Xu, Olga Ohrimenko, Jey Han Lau · 2026-06-10 04:00

Who Wrote the Book? Detecting and Attributing LLM Ghostwriters

arXiv:2603.28054v2 Announce Type: replace Abstract: In this paper, we introduce GhostWriteBench, a dataset for LLM authorship attribution. It comprises long-form texts (50K+ words per book) generated by frontier LLMs, and is designed to test generalisation across multiple out-of-…

COVERAGE [1]

Who Wrote the Book? Detecting and Attributing LLM Ghostwriters

RELATED ENTITIES

RELATED TOPICS