English(EN) Model Merging by Output-Space Projection

新研究提供先进的模型合并技术

作者 PulseAugur 编辑部 · [3 个来源] · 2026-05-26 08:33

两篇新研究论文提出了模型合并的新颖方法，该技术可以将多个微调的AI模型合并成一个更强大的模型，而无需重新训练。第一篇论文《通过输出空间投影进行模型合并》将合并表述为凸二次规划，提供了一个封闭形式的诊断方法来预测合并质量。第二篇论文《从损失景观角度看模型合并：几何视角》引入了EpiMer，一个将合并视为黎曼流形上弗雷歇均值的框架，统一了现有方法并在图像分类任务上展示了卓越的性能。 AI

影响这些新的合并技术可以通过组合专业模型而无需昂贵的重新训练，从而实现更高效的AI模型开发和部署。

排序理由该集群包含两篇学术论文，详细介绍了AI模型合并的新研究方法。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.LG TIER_1 English(EN) · Bethan Evans, Benjamin Etheridge, Stephen Roberts, Jared Tanner · 2026-05-29 04:00

通过输出空间投影进行模型合并

arXiv:2605.29101v1 Announce Type: new Abstract: Model merging combines fine-tuned checkpoints into a single multi-task model without retraining. Existing methods - such as task arithmetic, model soups, TIES, and DARE - are computationally efficient and empirically successful, but…
arXiv stat.ML TIER_1 English(EN) · Juanwu Lu, Anand Bhaskar, Brian Axelrod, Ekaterina Tolstaya, Tristan Emrich · 2026-05-27 04:00

模型在损失景观上的融合：几何视角

arXiv:2605.26693v1 Announce Type: cross Abstract: Model merging offers a promising avenue for knowledge integration and parallel development without retraining. Yet, existing methods either ignore the geometry of the loss landscape or rely on intractable full-space Hessian approx…
arXiv stat.ML TIER_1 English(EN) · Tristan Emrich · 2026-05-26 08:33

模型在损失景观上的融合：几何视角

Model merging offers a promising avenue for knowledge integration and parallel development without retraining. Yet, existing methods either ignore the geometry of the loss landscape or rely on intractable full-space Hessian approximations. We propose EpiMer, a framework that cast…

报道来源 [3]

通过输出空间投影进行模型合并

模型在损失景观上的融合：几何视角

模型在损失景观上的融合：几何视角

相关话题