Researchers have developed a new model called ML-SAN to improve emotion recognition in conversations by accounting for individual differences in expression. This Multi-Level Speaker-Adaptive Network uses a three-stage process to calibrate input features, adapt modality trust based on speaker identity, and maintain speaker consistency in the latent space. Tests on the MELD and IEMOCAP datasets indicate that ML-SAN performs better, particularly with less common sentiment categories and diverse speakers. AI
影响 Improves multimodal emotion recognition by adapting to individual speaker expression styles, enhancing machine empathy.
排序理由 This is a research paper introducing a novel model for emotion recognition.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →