ENTITY
RewardBench
RewardBench
PulseAugur coverage of RewardBench — every cluster mentioning RewardBench across labs, papers, and developer communities, ranked by signal.
Total · 30d
7
7 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
7
7 over 90d
TIER MIX · 90D
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
EvoPref algorithm enhances LLM alignment with evolutionary optimization
Researchers have developed EvoPref, a novel multi-objective evolutionary algorithm designed to improve the alignment of large language models (LLMs). Unlike traditional gradient-based methods that can lead to preference…
-
Researchers develop new methods to debias and improve reward models for LLMs
Researchers have developed new methods to improve the reliability and interpretability of reward models (RMs) used in aligning large language models (LLMs). One approach introduces a causally motivated intervention tech…