Brief

last 24h

[2/2] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI English(EN) · 10h

Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty

Researchers have explored the impact of upper-face affective cues on audiovisual sentence recognition, particularly when audio quality is degraded. Their study utilized the CREMA-D corpus to train classifiers under various facial cue conditions, including audio only, audio with lower-face features, audio with upper-face features, and audio with both. The findings indicate that while lower-face features significantly improve robustness in noisy audio, upper-face affective cues contribute to better calibration and confidence estimation, suggesting a role for expressive facial information in multimodal interaction systems. AI

IMPACT Suggests affective facial cues could improve robustness and confidence estimation in multimodal AI systems, particularly in noisy environments.
- CREMA-D
TOOL · arXiv cs.LG English(EN) · 3w

Mixed-Precision Information Bottlenecks for On-Device Trait-State Disentanglement in Bipolar Agitation Detection

Researchers have developed a new framework called MP-IB for disentangling stable speaker traits from volatile affective states in voice data, specifically for detecting bipolar disorder agitation on resource-constrained devices. The system utilizes mixed-precision quantization, where different numerical precisions (FP16 for traits, INT4 for states) create an information bottleneck to separate these elements. This approach achieved a rho of 0.117 on the Bridge2AI-Voice dataset, outperforming existing methods and enabling real-time monitoring with a small memory footprint. AI

IMPACT Introduces a novel method for on-device AI analysis of voice data, potentially enabling real-time health monitoring on low-power devices.

Brief

Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty

Mixed-Precision Information Bottlenecks for On-Device Trait-State Disentanglement in Bipolar Agitation Detection