A new research paper proposes that analyzing the cognitive processes, rather than just the outputs, is more effective for distinguishing humans from advanced AI agents. The study introduces CogCAPTCHA30, a set of 30 cognitive tasks designed to reveal process-level differences, achieving an 0.88 AUC in distinguishing humans from AI. The research evaluated frontier agents like Claude Sonnet 4.5, GPT-5, and Gemini 2.5 Pro, finding that while fine-tuning on human decisions improves process mimicry, process specification remains a bottleneck for achieving truly human-like cognitive processes. AI
影响 Suggests a new paradigm for AI safety and alignment research, moving beyond output-based evaluations to process-based analysis.
排序理由 Academic paper proposing a new method for distinguishing humans from AI by analyzing cognitive processes.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →