English(EN) ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition

新的蒸馏方法提高了Whisper ASR模型的效率

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 04:00

研究人员开发了自适应自知识蒸馏（ASKD），一种用于压缩大型AI模型的新颖框架。该方法在训练过程中动态减少对教师模型预测的依赖，鼓励学生模型发展独立推理能力。ASKD被应用于将Whisper语音识别模型蒸馏成一个更高效的版本ASKD-Whisper，其推理延迟降低了5倍，词错误率比其教师模型低1.07%。 AI

影响该技术可以实现大型ASR模型在资源受限设备上更高效的部署。

排序理由该集群包含一篇详细介绍模型压缩新颖方法及其在ASR模型应用的新学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Junseok Lee, Nahun Kim, Sangyong Lee, Chang-Jae Chun · 2026-06-02 04:00

ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition

arXiv:2601.19919v2 Announce Type: replace-cross Abstract: Knowledge distillation (KD) is one of the most effective paradigms for compressing large-scale foundation models into deployable architectures. In the context of Automatic Speech Recognition (ASR), previous studies have pr…

报道来源 [1]

ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition

相关实体

相关话题