English(EN) Decoding the Multimodal Maze: A Systematic Review on the Adoption of Explainability in Multimodal Attention-based Models

AI多模态模型的可解释性缺乏稳健评估

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-12 04:00

一篇最近发表在arXiv上的系统性文献综述，考察了多模态注意力AI模型的可解释性。该综述涵盖了2020年1月至2024年初的研究，发现大多数研究集中在视觉-语言和纯语言模型上，并频繁采用基于注意力的技术进行解释。然而，这些方法常常难以完全捕捉跨模态交互，并且目前对多模态可解释性的评估实践缺乏一致性和稳健性。作者们提出了建议，以促进该领域更严格和标准化的评估，从而推动负责任的AI发展。 AI

影响强调了在多模态AI可解释性领域标准化评估方法这一关键需求，以确保更具可解释性和问责制性的系统。

排序理由该集群是一篇发表在arXiv上的系统性文献综述，属于研究类别。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Md Raisul Kibria, S\'ebastien Lafond, Janan Arslan · 2026-06-12 04:00

Decoding the Multimodal Maze: A Systematic Review on the Adoption of Explainability in Multimodal Attention-based Models

arXiv:2508.04427v2 Announce Type: replace-cross Abstract: Multimodal learning has witnessed remarkable advancements in recent years, particularly with the integration of attention-based models, leading to significant performance gains across a variety of tasks. Parallel to this p…

报道来源 [1]

Decoding the Multimodal Maze: A Systematic Review on the Adoption of Explainability in Multimodal Attention-based Models

相关实体

相关话题