English(EN) FBK's Long-form SpeechLLMs for IWSLT 2026 Instruction Following

FBK 的 SpeechLLMs 在 IWSLT 2026 指令遵循任务中取得优异成绩

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-25 10:01

FBK 研究人员为 IWSLT 2026 指令遵循共享任务开发了 SpeechLLMs，重点关注短篇幅和长篇幅语音指令遵循。对于短篇幅任务，他们的模型取得了 2.0708 的 SIFS 分数。在长篇幅任务中，他们探索了各种分段方法，并引入了 HIFS 分数来评估性能，发现固定的 30 秒分段产生了最佳结果，得分为 2.0663。分析表明，长篇幅生成中的幻觉主要涉及重复插入，尽管短篇幅能力基本保持不变。 AI

影响这项研究有助于推进语音指令遵循模型，可能改进 AI 系统通过语音理解和执行命令的方式。

排序理由该集群包含一篇学术论文，详细介绍了提交给共享任务的新模型。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

FBK 的 SpeechLLMs 在 IWSLT 2026 指令遵循任务中取得优异成绩

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Zhihang Xie, Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli · 2026-06-26 04:00

FBK's Long-form SpeechLLMs for IWSLT 2026 Instruction Following

arXiv:2606.26819v1 Announce Type: new Abstract: This paper describes our submission to the IWSLT 2026 Instruction Following shared task. SpeechLLMs are developed for both short-form and long-form speech instruction following under constrained settings. For the short track, strong…
arXiv cs.CL TIER_1 English(EN) · Luisa Bentivogli · 2026-06-25 10:01

FBK's Long-form SpeechLLMs for IWSLT 2026 Instruction Following

This paper describes our submission to the IWSLT 2026 Instruction Following shared task. SpeechLLMs are developed for both short-form and long-form speech instruction following under constrained settings. For the short track, strong performance is achieved on MCIF, with a SIFS sc…

报道来源 [2]

FBK's Long-form SpeechLLMs for IWSLT 2026 Instruction Following

FBK's Long-form SpeechLLMs for IWSLT 2026 Instruction Following

相关实体

相关话题