PulseAugur
实时 15:03:03
English(EN) ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models

ParaBridge 方法改进了语音模型的副语言理解能力

研究人员开发了 ParaBridge,一种新颖的 on-policy 自蒸馏方法,旨在提高语音语言模型将副语言线索纳入对话的能力。该技术训练模型更好地利用非词汇信息,如语气或背景噪音,以生成更恰当的响应。ParaBridge 在 VoxSafeBenchEchoMind 等基准测试中显著提高了性能,同时保持了通用的语言能力。 AI

影响 增强了语音模型解释和响应细微语音线索的能力,可能改善人机交互。

排序理由 该集群包含一篇详细介绍语音语言模型新方法的论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Yuxiang Wang, Qinke Ni, Shengbo Cai, Wan Lin, Liqiang Zhang, Zhizheng Wu ·

    ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models

    arXiv:2606.10581v1 Announce Type: new Abstract: Speech carries more information than just words: a child's voice, a fearful tone, or a noisy background should all lead a sufficiently competent spoken-dialogue assistant to different replies. Current Speech Language Models (SLMs) c…

  2. arXiv cs.CL TIER_1 English(EN) · Zhizheng Wu ·

    ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models

    Speech carries more information than just words: a child's voice, a fearful tone, or a noisy background should all lead a sufficiently competent spoken-dialogue assistant to different replies. Current Speech Language Models (SLMs) can recognize such paralinguistic cues but often …