PulseAugur
EN
LIVE 03:28:03
ENTITY Process Reward Model

Process Reward Model

PulseAugur coverage of Process Reward Model — every cluster mentioning Process Reward Model across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL
  1. RESEARCH · CL_44959 ·

    New VRPRM model enhances LLM reasoning with visual cues

    Researchers have developed VRPRM, a novel process reward model that utilizes visual reasoning to enhance the fine-grained evaluation of Large Language Model (LLM) reasoning steps. This approach significantly reduces the…

  2. RESEARCH · CL_15892 ·

    New method debiases LLMs at decoding time, improving fairness without model retraining

    Researchers have developed a novel method to mitigate biases in large language models during the decoding phase, without altering the model's weights. This approach uses a separate Process Reward Model (PRM) to score to…