Brief · PulseAugur

TOOL · arXiv cs.AI English(EN) · 6h

MVCL-DAF++: Enhancing Multimodal Intent Recognition via Prototype-Aware Contrastive Alignment and Coarse-to-Fine Dynamic Attention Fusion

Researchers have introduced MVCL-DAF++, an advancement in multimodal intent recognition designed to improve semantic grounding and robustness. The new model incorporates prototype-aware contrastive alignment to enhance semantic consistency and a coarse-to-fine attention fusion mechanism for hierarchical cross-modal interaction. This approach has achieved new state-of-the-art results on the MIntRec and MIntRec2.0 benchmarks, notably improving rare-class recognition. AI

IMPACT Enhances multimodal understanding, potentially improving applications that rely on interpreting complex, multi-source inputs.

MVCL-DAF++
MIntRec
MIntRec2.0
Haofeng Huang