English(EN) RAFT: Data Refinement and Adaptive Distillation for Domain Fine-Tuning with Alleviated Forgetting

新的RAFT框架精炼领域微调，减少模型遗忘

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 04:00

研究人员推出RAFT，一个新颖的两阶段框架，旨在改进语言模型的领域特定微调，同时减轻在通用任务上的性能下降。RAFT通过自条件重写和语义过滤首先精炼领域特定数据，从而解决监督兼容性和轨迹保持等问题。然后，它采用一种自适应蒸馏过程，以原始模型在生成轨迹上的行为作为软目标，并以精炼后的答案为条件。 AI

影响这项研究提供了一种在不牺牲通用能力的情况下改进领域特定AI模型的方法，有望带来更强大、更多功能的AI应用。

排序理由这是一篇详细介绍语言模型微调新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Yuduo Li, Xiaofeng Shi, Qian Kou, Longbin Yu, Hua Zhou · 2026-06-02 04:00

RAFT: Data Refinement and Adaptive Distillation for Domain Fine-Tuning with Alleviated Forgetting

arXiv:2606.00147v1 Announce Type: cross Abstract: Domain-specific supervised fine-tuning (SFT) often improves in-domain performance at the cost of degrading a model's general capabilities. We view this degradation through two practical gaps in domain SFT: a supervision-compatibil…

报道来源 [1]

RAFT: Data Refinement and Adaptive Distillation for Domain Fine-Tuning with Alleviated Forgetting

相关实体

相关话题