Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

By PulseAugur Editorial · [1 sources] · 2026-04-23 12:36

Researchers have developed a new framework called Verbal Process Supervision (VPS) that enhances the reasoning capabilities of large language models without requiring gradient updates. This method utilizes structured natural-language critiques from a more powerful AI to guide an iterative generate-critique-refine process. Experiments on benchmarks like GPQA Diamond and AIME 2025 demonstrated significant improvements, with VPS surpassing existing state-of-the-art results and outperforming other methods like Reflexion and Self-Consistency. AI

IMPACT Introduces a new method for improving LLM reasoning performance without retraining, potentially reducing inference costs and improving accuracy on complex tasks.

RANK_REASON The cluster describes a new academic paper detailing a novel method for improving LLM reasoning.

Read on arXiv cs.CL →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Hao-Yuan Chen · 2026-04-23 12:36

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Inference-time scaling for LLM reasoning has focused on three axes: chain depth, sample breadth, and learned step-scorers (PRMs). We introduce a fourth axis, granularity of external verbal supervision, via Verbal Process Supervision (VPS), a training-free framework that uses stru…

COVERAGE [1]

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

RELATED ENTITIES

RELATED TOPICS