PulseAugur
LIVE 13:08:06
ENTITY GR-Ben

GR-Ben

PulseAugur coverage of GR-Ben — every cluster mentioning GR-Ben across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_15917 ·

    New GR-Ben benchmark evaluates AI's general reasoning and error detection

    Researchers have introduced GR-Ben, a new benchmark designed to evaluate the error detection capabilities of process reward models (PRMs) across a wider range of reasoning tasks beyond just mathematics. The benchmark co…