PulseAugur
实时 07:23:44
English(EN) Bifocal Diffusion Language Models: Asymmetric Bidirectional Context for Parallel Generation

新的双焦扩散语言模型提高了生成速度和质量

研究人员引入了双焦扩散语言模型(dLLMs),以解决离散扩散模型中生成质量和推理速度之间的权衡问题。新的范例,以 R2LM(从右到左的 Mamba)为例,使用非对称双向上下文来实现高质量和高效的 KV 缓存。实验表明,R2LM 在吞吐量方面显著优于双向 dLLMs 和自回归基线,同时保持了具有竞争力的生成质量。 AI

影响 引入了一种新颖的架构,可在不牺牲生成质量的情况下显著提高扩散语言模型的推理速度。

排序理由 该集群包含一篇详细介绍新模型架构和实验结果的学术论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →

新的双焦扩散语言模型提高了生成速度和质量

报道来源 [5]

  1. arXiv cs.CL TIER_1 English(EN) · Yijie Jin, Jiajun Xu, Yuxuan Liu, Chenkai Xu, Yi Tu, Jiajun Li, Dandan Tu, Xiaohui Yan, Kai Yu, Pengfei Liu, Zhijie Deng ·

    Multi-Block Diffusion Language Models

    arXiv:2606.29215v1 Announce Type: cross Abstract: Block Diffusion Language Models (BD-LMs) improve diffusion-based text generation with KV caching and flexible-length generation. A natural next step is to extend them from Single-Block Diffusion (SingleBD) to Multi-Block Diffusion…

  2. arXiv cs.LG TIER_1 English(EN) · Tzu-Tao Chang, Benjamin Yuanyang Hong, Kiet Pham, Shivaram Venkataraman ·

    DiLaServe: High SLO Attainment Serving for Diffusion Language Models

    arXiv:2606.29094v1 Announce Type: new Abstract: Diffusion language models (DLMs) have recently emerged as a promising alternative to conventional autoregressive language models. By generating multiple tokens in parallel during each denoising step, they offer higher inference thro…

  3. arXiv cs.LG TIER_1 English(EN) · Gagan Jain ·

    Adaptive Block Diffusion: Resolving Training-Inference Mismatch in Diffusion Language Models

    arXiv:2606.29275v1 Announce Type: new Abstract: Diffusion Language Models (DLMs) are typically trained under fixed context structures, restricting denoising to predetermined token subsets. This creates a mismatch between training and inference, where models must operate over arbi…

  4. arXiv cs.AI TIER_1 English(EN) · Yuhang Chen, Xianfeng Wu, Jinhao Duan, Mingfu Liang, Xiaohan Wei, Yunchen Pu, Fei Tian, Chonglin Sun, Parish Aggarwal, Frank Shyu, Luke Simon, Sandeep Pandey, Xi Liu, Tianlong Chen ·

    Bifocal Diffusion Language Models: Asymmetric Bidirectional Context for Parallel Generation

    arXiv:2606.27732v1 Announce Type: cross Abstract: Discrete diffusion language models (dLLMs) recover masked tokens in parallel, offering significant speedups over autoregressive (AR) generation. However, such promising frameworks face a fundamental architectural design dilemma: \…

  5. arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Tianlong Chen ·

    双焦扩散语言模型:用于并行生成的非对称双向上下文

    Discrete diffusion language models (dLLMs) recover masked tokens in parallel, offering significant speedups over autoregressive (AR) generation. However, such promising frameworks face a fundamental architectural design dilemma: \ding{182} Adopting bidirectional attention achieve…