PulseAugur
EN
LIVE 02:24:56

New research shows LLMs causally use scratchpad reasoning steps

Researchers have investigated whether large language models utilize their intermediate 'scratchpad' reasoning steps as intended for subsequent computations. By editing internal representations of these scratchpad states and observing the model's predictions, they found that models trained to use scratchpads causally adjust their subsequent steps based on these edited states. This effect was observed across different model families, suggesting that scratchpad oversight can indeed train models to use written states as part of their computational process, rather than just for human legibility. AI

IMPACT This research suggests that current methods for training LLMs to use intermediate reasoning steps may be effective, potentially leading to more reliable and interpretable AI systems.

RANK_REASON Academic paper detailing a new research finding on LLM internal reasoning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New research shows LLMs causally use scratchpad reasoning steps

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Benjamin Shih, John Winnicki, Eric Darve ·

    Do Models Read What They Write? Causal Registers in Scratchpad Reasoning

    arXiv:2606.29522v1 Announce Type: cross Abstract: A central hope behind process supervision is that models can expose intermediate variables that matter for their later behavior. For this to help with alignment, a scratchpad must be tied to the computation: when the model writes …