English(EN) Actionable Interpretability Must Be Defined in Terms of Symmetries

新研究论文通过对称性重新定义AI可解释性

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-15 04:00

一篇新论文提出，应使用对称性框架重新定义AI中的可解释性概念。作者认为，当前的定义对于形式化测试或设计来说是不够的。他们引入了四种特定的对称性——推理等变性、信息不变性、概念闭包不变性以及结构不变性——并相信这些对称性可以将可解释模型形式化为概率模型的一个子集。这种方法旨在统一可解释的推理方法，并为验证是否符合安全和监管标准提供一个正式系统。 AI

影响提出了一种新的AI可解释性形式化框架，可能实现更严格的安全和监管合规性。

排序理由该集群包含一篇提出AI可解释性新理论框架的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Pietro Barbiero, Mateo Espinosa Zarlenga, Francesco Giannini, Alberto Termine, Filippo Bonchi, Mateja Jamnik, Giuseppe Marra · 2026-06-15 04:00

Actionable Interpretability Must Be Defined in Terms of Symmetries

arXiv:2601.12913v4 Announce Type: replace Abstract: This paper argues that interpretability research in Artificial Intelligence (AI) is fundamentally ill-posed as existing definitions of interpretability fail to describe how interpretability can be formally tested or designed for…

报道来源 [1]

Actionable Interpretability Must Be Defined in Terms of Symmetries

相关实体

相关话题