English(EN) Metonymy in vision models undermines attention-based interpretability

研究发现：视觉模型中的转喻破坏了基于注意力的可解释性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-07 12:14

一篇新发表在arXiv上的研究论文引入了视觉模型中的“视觉转喻”概念，即物体的局部编码了关于整个物体的信息。这种现象破坏了基于注意力的可解释性方法，这些方法假设局部性，即局部信息只应编码其对应的图像区域的信息。研究表明，现代视觉 Transformer 违反了这一假设，使得基于局部的推理和可解释性技术不可靠。 AI

影响突出了视觉模型可解释性中的一个根本性问题，可能需要新的方法来理解模型行为。

排序理由该集群包含一篇研究论文，详细介绍了关于视觉模型可解释性的新发现。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Ananthu Aniraj, Cassio F. Dantas, Dino Ienco, Massimiliano Mancini, Diego Marcos · 2026-05-08 04:00

视觉模型中的转喻削弱了基于注意力的可解释性

arXiv:2605.06095v1 Announce Type: new Abstract: Part-based reasoning is a classical strategy to make a computer vision model directly focus on the object parts that are relevant to the downstream task. In the context of deep learning, this also serves to improve by-design interpr…
arXiv cs.CV TIER_1 English(EN) · Diego Marcos · 2026-05-07 12:14

视觉模型中的转喻损害了基于注意力的可解释性

Part-based reasoning is a classical strategy to make a computer vision model directly focus on the object parts that are relevant to the downstream task. In the context of deep learning, this also serves to improve by-design interpretability, often by using part-centric attention…

报道来源 [2]

视觉模型中的转喻削弱了基于注意力的可解释性

视觉模型中的转喻损害了基于注意力的可解释性

相关实体

相关话题