ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models
Researchers have developed ParaBridge, a novel on-policy self-distillation method designed to improve speech language models' ability to incorporate paralinguistic cues into dialogue. This technique trains models to better utilize non-lexical information, such as tone of voice or background noise, to generate more appropriate responses. ParaBridge significantly enhances performance on benchmarks like VoxSafeBench and EchoMind, while maintaining general language capabilities. AI
IMPACT Enhances speech models' ability to interpret and respond to nuanced vocal cues, potentially improving human-AI interaction.