ENTITY Process Reward Model

Process Reward Model

PulseAugur coverage of Process Reward Model — every cluster mentioning Process Reward Model across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

2 over 90d

Releases · 30d

0 over 90d

Papers · 30d

2 over 90d

TIER MIX · 90D

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL

RESEARCH · CL_44959 · May 11 · 00:00

New VRPRM model enhances LLM reasoning with visual cues

Researchers have developed VRPRM, a novel process reward model that utilizes visual reasoning to enhance the fine-grained evaluation of Large Language Model (LLM) reasoning steps. This approach significantly reduces the…
RESEARCH · CL_15892 · May 4 · 08:51

New method debiases LLMs at decoding time, improving fairness without model retraining

Researchers have developed a novel method to mitigate biases in large language models during the decoding phase, without altering the model's weights. This approach uses a separate Process Reward Model (PRM) to score to…

New VRPRM model enhances LLM reasoning with visual cues

New method debiases LLMs at decoding time, improving fairness without model retraining