English(EN) CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

CrispEdit 算法通过保留通用能力来增强 LLM 编辑

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-05 04:00

研究人员开发了 CrispEdit，一种用于编辑大型语言模型 (LLM) 的新算法，该算法侧重于在修改特定行为的同时保留通用能力。该方法将编辑构建为一个约束优化问题，使用低曲率投影来确保更改不会破坏模型的广泛功能。通过采用 Kronecker-factored approximate curvature (K-FAC) 和无矩阵投影仪等技术，CrispEdit 实现了高效、可扩展的编辑，并在标准基准测试中以最小的能力下降显著提高了编辑成功率。 AI

影响引入了一种新的 LLM 编辑方法，旨在提高性能并减少意外副作用。

排序理由这是一篇详细介绍 LLM 编辑新算法的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Zarif Ikram, Arad Firouzkouhi, Stephen Tu, Mahdi Soltanolkotabi, Paria Rashidinejad · 2026-05-05 04:00

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

arXiv:2602.15823v2 Announce Type: replace Abstract: A central challenge in large language model (LLM) editing is capability preservation: methods that successfully change targeted behavior can quietly game the editing proxy and corrupt general capabilities, producing degenerate b…

报道来源 [1]

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

相关实体

相关话题