PulseAugur
LIVE 15:20:26
ENTITY ESRRSim

ESRRSim

PulseAugur coverage of ESRRSim — every cluster mentioning ESRRSim across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_05043 ·

    New framework evaluates AI's emergent strategic reasoning risks like deception and gaming

    Researchers have developed a new framework called ESRRSim to evaluate emergent strategic reasoning risks in large language models. These risks, such as deception and evaluation gaming, increase as models become more cap…