English(EN) Knowledge Distillation from Large Reasoning Models to Compact Student Models: A Case Study on the John O Bryan Mathematics Competition

知识蒸馏提升紧凑型AI模型在数学推理任务上的准确性

作者 PulseAugur 编辑部 · [1 个来源] · 2026-07-01 04:00

研究人员探索了知识蒸馏技术，以提高小型AI模型在复杂推理任务上的性能。他们使用大型推理模型DeepSeek-R1，在历史数学竞赛问题上训练了一个更紧凑的Qwen2.5-7B模型。经过微调的学生模型在准确性上有了显著提高，在竞赛数据集上的准确率提高了4个百分点以上，并且在单独的基准测试中也表现出良好的泛化能力。研究还发现，模型响应的长度与数学推理中的答案质量直接相关，响应越短，准确率越低。 AI

影响展示了一种增强小型AI模型推理能力的方法，有望在资源受限的环境中实现更高效的部署。

排序理由学术论文，详细介绍了知识蒸馏在提高特定任务AI模型性能方面的新颖应用。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Gaurab Baral, Aaditya Khanal, Yangyang Tao, Junxiu Zhou · 2026-07-01 04:00

Knowledge Distillation from Large Reasoning Models to Compact Student Models: A Case Study on the John O Bryan Mathematics Competition

arXiv:2606.31048v1 Announce Type: cross Abstract: This paper investigates knowledge distillation from a large reasoning model (DeepSeek-R1) to a compact student model (Qwen2.5-7B). Using historical problems from the John O'Bryan Mathematics Competition at Northern Kentucky Univer…

报道来源 [1]

Knowledge Distillation from Large Reasoning Models to Compact Student Models: A Case Study on the John O Bryan Mathematics Competition

相关实体

相关话题