English(EN) Why Struggle with Continuous Latents? Interpretable Discrete Latent Reasoning via Rendered Compression

新的DLR方法通过离散潜在令牌增强LLM推理·跟踪2个来源

作者 PulseAugur 编辑部 · [4 个来源] · 2026-06-29 02:34

研究人员推出了一种名为离散潜在推理（DLR）的新颖方法，旨在提高大型语言模型中潜在推理的可解释性和效率。DLR将连续潜在状态转换为离散令牌，其灵感来源于基于渲染的压缩技术。该方法旨在通过将离散符号监督与离散潜在令牌对齐，来解决连续潜在方法中常见的که instability和可解释性不足的问题。在Qwen3-VL和LLaMA-3模型上进行的多个推理基准测试表明，DLR的压缩率高达20倍，同时保持了可解释的推理轨迹，优于现有的潜在推理基线。 AI

影响该方法可能带来更高效、更易于理解的LLM推理，从而可能降低推理成本并改善模型对齐。

排序理由该集群包含一篇详细介绍LLM推理新方法的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。我们如何撰写摘要 →

报道来源 [4]

arXiv cs.CL TIER_1 English(EN) · Ying Fan, Anej Svete, Kangwook Lee · 2026-07-01 04:00

Bridging the Gap Between Latent and Explicit Reasoning with Looped Transformers

arXiv:2606.31779v1 Announce Type: cross Abstract: Language models typically reason via explicit chain-of-thought (CoT), generating intermediate steps token-by-token. Latent CoT offers an alternative: it performs multi-step reasoning in the model's hidden states, replacing decoded…
arXiv cs.CL TIER_1 English(EN) · Kangwook Lee · 2026-06-30 14:58

Bridging the Gap Between Latent and Explicit Reasoning with Looped Transformers

Language models typically reason via explicit chain-of-thought (CoT), generating intermediate steps token-by-token. Latent CoT offers an alternative: it performs multi-step reasoning in the model's hidden states, replacing decoded tokens with continuous representations for greate…
arXiv cs.CL TIER_1 English(EN) · Shuochen Chang, Qingyang Liu, Shaobo Wang, Bingjie Gao, Qianli Ma, Haonan Zhao, Yibo Miao, Yulin Sun, Zelin Peng, Jiangtong Li, Li Niu · 2026-06-30 04:00

Why Struggle with Continuous Latents? Interpretable Discrete Latent Reasoning via Rendered Compression

arXiv:2606.29712v1 Announce Type: new Abstract: Large language models achieve high reasoning performance via explicit chain-of-thought and reinforcement learning, but require long output sequences and extended inference time. Latent reasoning reduces this cost by shifting computa…
arXiv cs.CL TIER_1 English(EN) · Li Niu · 2026-06-29 02:34

Why Struggle with Continuous Latents? Interpretable Discrete Latent Reasoning via Rendered Compression

Large language models achieve high reasoning performance via explicit chain-of-thought and reinforcement learning, but require long output sequences and extended inference time. Latent reasoning reduces this cost by shifting computation into a latent space; however, continuous la…

报道来源 [4]

Bridging the Gap Between Latent and Explicit Reasoning with Looped Transformers

Bridging the Gap Between Latent and Explicit Reasoning with Looped Transformers

Why Struggle with Continuous Latents? Interpretable Discrete Latent Reasoning via Rendered Compression

Why Struggle with Continuous Latents? Interpretable Discrete Latent Reasoning via Rendered Compression

相关实体

相关话题