PulseAugur
实时 14:33:17
实体 FormalRewardBench

FormalRewardBench

PulseAugur coverage of FormalRewardBench — every cluster mentioning FormalRewardBench across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
1
90 天内 1
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
时间线
  1. 2026-05-11 research_milestone Introduction of FormalRewardBench, the first benchmark for evaluating reward models in formal theorem proving. 来源
情绪 · 30 天

1 天有情绪数据

最近 · 第 1/1 页 · 共 1 条
  1. TOOL · CL_27514 ·

    FormalRewardBench benchmark evaluates LLM reward models for theorem proving

    Researchers have introduced FormalRewardBench, a new benchmark designed to evaluate reward models used in formal theorem proving. This benchmark addresses the challenge of sparse credit assignment in reinforcement learn…