English(EN) Low Perplexity is Repetition: A One-Dimensional Self-Conditioning Attractor in Continuous Diffusion LMs

新研究发现连续扩散语言模型过度重复文本，提出ACE修复方案

作者 PulseAugur 编辑部 · [2 个来源] · 2026-07-01 08:13

一篇新发表在arXiv上的研究论文指出了连续扩散语言模型（如ELF）的一个关键缺陷，即低困惑度得分因过度重复而具有误导性。研究发现，与人类生成的内容相比，这些模型会显著重复文本，而困惑度指标无意中奖励了这种行为。研究人员提出了一种名为ACE（Attractor-Contrast-Escape）的一维修复方法，该方法从自条件反馈循环中减去一个特定方向，将重复减少到接近人类水平，同时保持了具有竞争力的质量并提高了计算效率。 AI

影响强调了评估语言模型质量的潜在陷阱，并提供了一种改进输出和效率的方法。

排序理由研究论文，详细介绍了语言模型的缺陷和提出的解决方案。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Shuai Zhang, Zijie Chen, Hongliang He, Lun Du, Zhenzhong Lan · 2026-07-02 04:00

Low Perplexity is Repetition: A One-Dimensional Self-Conditioning Attractor in Continuous Diffusion LMs

arXiv:2607.00588v1 Announce Type: new Abstract: Continuous diffusion language models such as ELF report record-low generative perplexity (Gen-PPL). We find a catch: these models repeat far more than human text, and Gen-PPL rewards rather than penalizes that repetition, so its low…
arXiv cs.CL TIER_1 English(EN) · Zhenzhong Lan · 2026-07-01 08:13

低困惑度即重复：连续扩散模型中的一维自吸引子

Continuous diffusion language models such as ELF report record-low generative perplexity (Gen-PPL). We find a catch: these models repeat far more than human text, and Gen-PPL rewards rather than penalizes that repetition, so its low scores overstate quality. Strip the repetition …

报道来源 [2]

Low Perplexity is Repetition: A One-Dimensional Self-Conditioning Attractor in Continuous Diffusion LMs

低困惑度即重复：连续扩散模型中的一维自吸引子

相关实体

相关话题