New metrics reveal RLVR doesn't guarantee reliable reasoning in LLMs

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-23 21:01

A new paper questions the effectiveness of Reinforcement Learning from Verifiable Rewards (RLVR) in ensuring that language models' reasoning chains accurately reflect their problem-solving processes. Researchers introduced metrics like Causal Importance of Reasoning (CIR) and Sufficiency of Reasoning (SR) to evaluate this, finding that while RLVR boosts accuracy, it doesn't consistently improve these reasoning metrics. The study suggests that fine-tuning before RLVR or using auxiliary rewards alongside outcome-based rewards can lead to more reliable and causally important reasoning. AI

影响 Challenges the assumption that RLVR guarantees reliable reasoning, suggesting modifications for more trustworthy AI outputs.

排序理由 Academic paper introducing new metrics and experimental findings on language model reasoning.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Qinan Yu, Alexa Tartaglini, Peter Hase, Carlos Guestrin, Christopher Potts · 2026-04-27 04:00

Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning

arXiv:2604.22074v1 Announce Type: new Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) on chain-of-thought reasoning has become a standard part of language model post-training recipes. A common assumption is that the reasoning chains trained through RLVR reliably r…
arXiv cs.CL TIER_1 English(EN) · Christopher Potts · 2026-04-23 21:01

Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning

Reinforcement Learning from Verifiable Rewards (RLVR) on chain-of-thought reasoning has become a standard part of language model post-training recipes. A common assumption is that the reasoning chains trained through RLVR reliably represent how a model gets to its answer. In this…

报道来源 [2]

Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning

Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning

相关实体

相关话题