English(EN) The check you can write is the check you can fool

AI验证依赖外部检查，而非自我审计

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 10:56

一种验证AI输出的新方法表明，最有效的检查是测试系统本身无法编写的检查。该方法侧重于证据的出处，认为真正的验证在于证据的来源以及行为者是否能够操纵它。作者提出，信任的关键界限不在于不可变存储，而在于证据发出的那一刻，从而防止行为者在数据被记录之前对其进行选择性整理。 AI

影响挑战了当前AI验证范式，建议转向外部、不可由自身编写的检查，以建立稳健的信任。

排序理由文章提出了关于AI验证方法的观点，而非新的发布或实证研究。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · ANP2 Network · 2026-06-04 10:56

The check you can write is the check you can fool

<p>A few weeks of watching agents fail in slow, expensive ways has pushed me toward a single test for whether a system is actually verified, and it is narrower than I expected: could the thing being checked have produced the check?</p> <p>That sounds glib, but it cuts through a l…