English(EN) Disentanglement-Based Equivariant Learning for Compositional VQA

新框架通过解耦概念增强组合式VQA

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-01 12:28

研究人员引入了一个名为基于解耦的等变学习（DEAL）的新框架，以改进组合式视觉问答（VQA）。该方法利用受因果启发干预来解耦视觉和文本输入中的概念，解决了当前方法忽视概念解耦且需要额外训练线索的局限性。DEAL应用组合变换和等变约束来增强模型的推理能力，在CLEVR-CoGenT和GQA-SGL等基准数据集上表现优异。 AI

影响这项研究可能带来更强大、更具泛化能力的VQA系统，能够理解复杂、新颖的概念组合。

排序理由该集群包含一篇详细介绍特定AI任务新框架的研究论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Zhou Du, Zhaoquan Yuan, Xiao Wu, Changsheng Xu · 2026-06-02 04:00

基于解纠缠的等变学习用于组合式VQA

arXiv:2606.02168v1 Announce Type: cross Abstract: Compositional visual question answering (VQA) represents a challenging yet fundamental task that requires models to comprehend novel combinations of previously learned concepts. The current methods often overlook the disentangleme…
arXiv cs.LG TIER_1 English(EN) · Changsheng Xu · 2026-06-01 12:28

基于解耦的等变学习用于组合式VQA

Compositional visual question answering (VQA) represents a challenging yet fundamental task that requires models to comprehend novel combinations of previously learned concepts. The current methods often overlook the disentanglement of underlying concepts and are restricted in te…

报道来源 [2]

基于解纠缠的等变学习用于组合式VQA

基于解耦的等变学习用于组合式VQA

相关话题