Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty
Researchers have explored the impact of upper-face affective cues on audiovisual sentence recognition, particularly when audio quality is degraded. Their study utilized the CREMA-D corpus to train classifiers under various facial cue conditions, including audio only, audio with lower-face features, audio with upper-face features, and audio with both. The findings indicate that while lower-face features significantly improve robustness in noisy audio, upper-face affective cues contribute to better calibration and confidence estimation, suggesting a role for expressive facial information in multimodal interaction systems. AI
IMPACT Suggests affective facial cues could improve robustness and confidence estimation in multimodal AI systems, particularly in noisy environments.