ENTITY Process Reward Models

Process Reward Models

PulseAugur coverage of Process Reward Models — every cluster mentioning Process Reward Models across labs, papers, and developer communities, ranked by signal.

Total · 30d

3 over 90d

Releases · 30d

0 over 90d

Papers · 30d

3 over 90d

TIER MIX · 90D

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 3 TOTAL

TOOL · CL_18581 · May 6 · 04:00

AI researchers develop controllable data synthesis for process reward models

Researchers have developed a new framework for creating synthetic process supervision data tailored for Process Reward Models (PRMs). This method allows for controlled injection of errors into reasoning chains, ensuring…
RESEARCH · CL_24786 · May 4 · 09:36

Unsupervised Process Reward Models reduce need for human supervision

Researchers have developed a method for training unsupervised Process Reward Models (uPRMs) that eliminates the need for human supervision in step-by-step reasoning supervision. This new approach uses LLM next-token pro…
RESEARCH · CL_10096 · Apr 30 · 04:00

Survey details process reward models for fine-grained LLM reasoning alignment

This survey paper systematically reviews Process Reward Models (PRMs), which evaluate and guide Large Language Models (LLMs) at the reasoning step or trajectory level, unlike traditional outcome-based models. It details…

AI researchers develop controllable data synthesis for process reward models

Unsupervised Process Reward Models reduce need for human supervision

Survey details process reward models for fine-grained LLM reasoning alignment