Brief

last 24h

[2/2] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI English(EN) · 1d

When Do LLMs Reason? A Dynamical Systems View via Entropy Phase Transitions

Researchers have developed a new framework called EDRM that uses early-stage entropy dynamics to determine when Large Language Models (LLMs) should engage in explicit reasoning. They observed that tasks benefiting from Chain-of-Thought (CoT) reasoning show a consistent reduction in entropy during generation, indicating a shift to a structured reasoning state. EDRM leverages this entropy reduction signal to adaptively select inference strategies, leading to significant token reductions and accuracy improvements across various benchmarks and LLMs. AI

IMPACT Optimizes LLM inference by selectively invoking reasoning, potentially reducing costs and improving efficiency for AI operators.
- LLMs
- Chain-of-Thought (CoT)
RESEARCH · Hugging Face Daily Papers English(EN) · 1w · [2 sources]

The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models

A new research paper reveals a significant shortcut in how small language models perform arithmetic tasks using chain-of-thought (CoT) prompting. Instead of relying on logical sequencing, these models tend to copy the number positioned just before the answer delimiter, regardless of the intermediate reasoning steps. This positional copying accounts for a large portion of their accuracy, even when the preceding steps are incorrect or shuffled, highlighting a potential failure mode in evaluating CoT faithfulness. AI

IMPACT Reveals a critical flaw in evaluating arithmetic reasoning in small LLMs, suggesting current faithfulness evaluations may be misleading.

Brief

When Do LLMs Reason? A Dynamical Systems View via Entropy Phase Transitions

The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models