English(EN) Bifocal Diffusion Language Models: Asymmetric Bidirectional Context for Parallel Generation

新的双焦扩散语言模型提高了生成速度和质量

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-26 05:26

研究人员引入了双焦扩散语言模型（dLLMs），以解决离散扩散模型中生成质量和推理速度之间的权衡问题。新的范例，以 R2LM（从右到左的 Mamba）为例，使用非对称双向上下文来实现高质量和高效的 KV 缓存。实验表明，R2LM 在吞吐量方面显著优于双向 dLLMs 和自回归基线，同时保持了具有竞争力的生成质量。 AI

影响引入了一种新颖的架构，可在不牺牲生成质量的情况下显著提高扩散语言模型的推理速度。

排序理由该集群包含一篇详细介绍新模型架构和实验结果的学术论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Yuhang Chen, Xianfeng Wu, Jinhao Duan, Mingfu Liang, Xiaohan Wei, Yunchen Pu, Fei Tian, Chonglin Sun, Parish Aggarwal, Frank Shyu, Luke Simon, Sandeep Pandey, Xi Liu, Tianlong Chen · 2026-06-29 04:00

Bifocal Diffusion Language Models: Asymmetric Bidirectional Context for Parallel Generation

arXiv:2606.27732v1 Announce Type: cross Abstract: Discrete diffusion language models (dLLMs) recover masked tokens in parallel, offering significant speedups over autoregressive (AR) generation. However, such promising frameworks face a fundamental architectural design dilemma: \…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Tianlong Chen · 2026-06-26 05:26

双焦扩散语言模型：用于并行生成的非对称双向上下文

Discrete diffusion language models (dLLMs) recover masked tokens in parallel, offering significant speedups over autoregressive (AR) generation. However, such promising frameworks face a fundamental architectural design dilemma: \ding{182} Adopting bidirectional attention achieve…

报道来源 [2]

Bifocal Diffusion Language Models: Asymmetric Bidirectional Context for Parallel Generation

双焦扩散语言模型：用于并行生成的非对称双向上下文

相关实体

相关话题