ENTITY
Process Reward Model
Process Reward Model
PulseAugur coverage of Process Reward Model — every cluster mentioning Process Reward Model across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New VRPRM model enhances LLM reasoning with visual cues
Researchers have developed VRPRM, a novel process reward model that utilizes visual reasoning to enhance the fine-grained evaluation of Large Language Model (LLM) reasoning steps. This approach significantly reduces the…
-
New method debiases LLMs at decoding time, improving fairness without model retraining
Researchers have developed a novel method to mitigate biases in large language models during the decoding phase, without altering the model's weights. This approach uses a separate Process Reward Model (PRM) to score to…