English(EN) The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models

小型LLM使用位置复制捷径进行算术，绕过CoT逻辑

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-20 00:32

一篇新的研究论文揭示了小型语言模型在使用链式思考（CoT）提示执行算术任务时存在一个重要的捷径。这些模型倾向于复制答案分隔符之前的位置数字，而不是依赖逻辑排序，即使中间的推理步骤不正确或被打乱，这种位置复制也能解释其准确性的很大一部分，这突显了评估CoT忠实度时潜在的故障模式。 AI

影响揭示了评估小型LLM算术推理的一个关键缺陷，表明当前的忠实度评估可能具有误导性。

排序理由该集群包含一篇学术论文，详细介绍了关于语言模型行为的新发现。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Ming Liu · 2026-05-25 04:00

读取捷径：位置编号复制在小型语言模型算术CoT读取中占主导地位

arXiv:2605.22870v1 Announce Type: cross Abstract: Chain-of-thought (CoT) prompting is necessary for arithmetic in small language models, yet shuffling its steps preserves most performance. What does CoT contribute if not logical sequencing? In three 1-3B instruction-tuned LMs on …
Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-20 00:32

读取捷径：位置数字复制在小型语言模型算术CoT读取中占主导地位

Chain-of-thought (CoT) prompting is necessary for arithmetic in small language models, yet shuffling its steps preserves most performance. What does CoT contribute if not logical sequencing? In three 1-3B instruction-tuned LMs on GSM8K, we isolate the answer-readout stage via pre…