English(EN) The Undecidability of Artificial General Intelligence (AGI) Alignment

AGI对齐因结构不可验证性而被证明是不可判定的

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-30 04:00

一篇新发表在arXiv上的论文确立了人工智能通用（AGI）安全性的理论极限，证明了核心挑战并非对齐状态的不可能性，而是其结构上的不可验证性。该研究引入了对齐不可验证性定理和AGI对齐有限结构不可验证性定理，并将这些限制置于Trakhtenbrot的墙壁之内。这些发现表明，依赖于有限硬件或停机架构的当前工程防御措施无法克服根本的逻辑障碍，从而导致了不可避免的三个遏制失败的困境。 AI

影响确立了AGI安全性的根本理论极限，表明当前的工程方法可能不足够。

排序理由发表在arXiv上的学术论文，详细阐述了AGI对齐的理论局限性。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Jose Pascual Gumbau Mezquita · 2026-06-30 04:00

The Undecidability of Artificial General Intelligence (AGI) Alignment

arXiv:2606.28639v1 Announce Type: cross Abstract: This article establishes the foundational mathematical limits of Artificial General Intelligence (AGI) safety, proving that the core barrier is not the impossibility of an aligned state, but its structural unverifiability. We form…

报道来源 [1]

The Undecidability of Artificial General Intelligence (AGI) Alignment

相关话题