English(EN) Measuring the Redundancy of Decoder Layers in SpeechLLMs

语音大语言模型解码器显示出显著的冗余，允许进行层剪枝

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-29 04:00

研究人员调查了语音大语言模型（SpeechLLMs）解码器层的冗余度，这些层通常占模型参数的90%以上。他们对不同模型规模的研究表明，在不显著影响自动语音识别（ASR）性能的情况下，可以剪枝相当一部分解码器层。研究结果表明，即使是70亿至80亿参数的模型，在只有60%的解码器层完好的情况下也能保持良好的ASR能力，这一趋势在不同规模和任务（包括语音翻译）中都有观察到。 AI

影响暗示了更高效的SpeechLLM架构的可能性，降低了计算成本并实现了更广泛的部署。

排序理由学术论文，详细介绍了模型架构的研究发现。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Adel Moumen, Guangzhi Sun, Philip C Woodland · 2026-06-29 04:00

Measuring the Redundancy of Decoder Layers in SpeechLLMs

arXiv:2603.05121v2 Announce Type: replace-cross Abstract: Speech Large Language Models route speech encoder representations into an LLM decoder that typically accounts for over 90% of total parameters. We study how much of this decoder capacity is actually needed for speech tasks…

报道来源 [1]

Measuring the Redundancy of Decoder Layers in SpeechLLMs

相关实体

相关话题