Italiano(IT) Escaping Mode Collapse in LLM Generation via Geometric Regulation

研究人员提出强化模式正则化以对抗LLM生成模式崩溃

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-01 06:12

研究人员提出了一种名为强化模式正则化（RMR）的新方法，以对抗大型语言模型（LLM）的模式崩溃问题。该技术将模式崩溃视为模型表示空间内的几何问题，而不仅仅是令牌级别的问题。RMR通过低秩阻尼来调节Transformer值缓存中自我强化的方向，是一种轻量级的干预措施。实验表明，RMR显著减少了模式崩溃，即使在非常低的熵率下也能实现稳定和高质量的生成。 AI

影响提供了一种新颖的技术来提高LLM输出的稳定性和多样性，有可能增强其在各种应用中的可靠性。

排序理由这是一篇研究论文，详细介绍了一种改进LLM生成的新方法。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 Italiano(IT) · Xin Du, Kumiko Tanaka-Ishii · 2026-05-04 04:00

通过几何正则化逃离LLM生成中的模式崩溃

arXiv:2605.00435v1 Announce Type: new Abstract: Mode collapse is a persistent challenge in generative modeling and appears in autoregressive text generation as behaviors ranging from explicit looping to gradual loss of diversity and premature trajectory convergence. We take a dyn…
arXiv cs.CL TIER_1 Italiano(IT) · Kumiko Tanaka-Ishii · 2026-05-01 06:12

通过几何正则化逃离大型语言模型生成的模式崩溃

Mode collapse is a persistent challenge in generative modeling and appears in autoregressive text generation as behaviors ranging from explicit looping to gradual loss of diversity and premature trajectory convergence. We take a dynamical-systems view and reinterpret mode collaps…

报道来源 [2]

通过几何正则化逃离LLM生成中的模式崩溃

通过几何正则化逃离大型语言模型生成的模式崩溃

相关实体

相关话题