PulseAugur
EN
LIVE 09:50:14

New research finds continuous diffusion LMs repeat text excessively, proposes ACE fix

A new research paper published on arXiv identifies a critical flaw in continuous diffusion language models, such as ELF, where low perplexity scores are misleading due to excessive repetition. The study found that these models repeat text significantly more than human-generated content, and the perplexity metric inadvertently rewards this behavior. Researchers propose a one-dimensional fix called ACE (Attractor-Contrast-Escape) that subtracts a specific direction from the self-conditioning feedback loop, reducing repetition to near-human levels while maintaining competitive quality and improving compute efficiency. AI

IMPACT Highlights a potential pitfall in evaluating language model quality and offers a method to improve both output and efficiency.

RANK_REASON Research paper detailing a flaw and proposed solution for language models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New research finds continuous diffusion LMs repeat text excessively, proposes ACE fix

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Shuai Zhang, Zijie Chen, Hongliang He, Lun Du, Zhenzhong Lan ·

    Low Perplexity is Repetition: A One-Dimensional Self-Conditioning Attractor in Continuous Diffusion LMs

    arXiv:2607.00588v1 Announce Type: new Abstract: Continuous diffusion language models such as ELF report record-low generative perplexity (Gen-PPL). We find a catch: these models repeat far more than human text, and Gen-PPL rewards rather than penalizes that repetition, so its low…

  2. arXiv cs.CL TIER_1 English(EN) · Zhenzhong Lan ·

    Low Perplexity is Repetition: A One-Dimensional Self-Conditioning Attractor in Continuous Diffusion LMs

    Continuous diffusion language models such as ELF report record-low generative perplexity (Gen-PPL). We find a catch: these models repeat far more than human text, and Gen-PPL rewards rather than penalizes that repetition, so its low scores overstate quality. Strip the repetition …