PulseAugur
实时 11:11:17
English(EN) On the Cost and Benefit of Chain of Thought: A Learning-Theoretic Perspective

新理论解释思维链推理何时帮助或损害人工智能

研究人员开发了一个新的学习理论框架来分析人工智能模型中的思维链(CoT)推理。该框架将与CoT相关的风险分解为两个组成部分:从最优推理路径获得的收益和沿错误路径累积错误所产生的成本。该分析表明,CoT的有效性高度依赖于其组件的稳定性,并确定了有界、线性、指数误差增长的特定条件。 AI

影响 为理解和提高人工智能模型中复杂推理的可靠性提供了理论基础。

排序理由 该集群包含一篇学术论文,详细介绍了分析人工智能推理技术的理论框架。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

报道来源 [4]

  1. arXiv cs.AI TIER_1 English(EN) · Jingyi Sun, Qianli Wang, Pepa Atanasova, Nils Feldhus, Isabelle Augenstein ·

    Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization

    arXiv:2605.24960v1 Announce Type: cross Abstract: Chain-of-Thought (CoT) faithfulness, i.e., whether CoTs genuinely reflect large language models' (LLM) underlying behavior, is typically evaluated under two disjoint paradigms: contextual faithfulness, measured by perturbing the i…

  2. arXiv cs.CL TIER_1 English(EN) · Jinghan Jia, Joe Benton, Eric Easley ·

    Faithfulness as Information Flow: Evaluating and Training Faithful Chain-of-Thought Reasoning

    arXiv:2605.24286v1 Announce Type: cross Abstract: Chain-of-thought (CoT) reasoning is useful for monitoring language models only when the reasoning trace faithfully reflects the computation that produces the final answer. However, models can rely on prompt-to-answer shortcuts tha…

  3. arXiv cs.LG TIER_1 English(EN) · Yongyi Mao ·

    On the Cost and Benefit of Chain of Thought: A Learning-Theoretic Perspective

    We develop a learning-theoretic framework for understanding Chain of Thought (CoT). We model CoT as the interaction between an answer map and a chain rule that generates intermediate questions autoregressively, and define the reasoning risk of a hypothesis under this interaction.…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    On the Cost and Benefit of Chain of Thought: A Learning-Theoretic Perspective

    We develop a learning-theoretic framework for understanding Chain of Thought (CoT). We model CoT as the interaction between an answer map and a chain rule that generates intermediate questions autoregressively, and define the reasoning risk of a hypothesis under this interaction.…