PulseAugur
LIVE 08:41:23
ENTITY Elo

Elo

PulseAugur coverage of Elo — every cluster mentioning Elo across labs, papers, and developer communities, ranked by signal.

Total · 30d
536
536 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
38
38 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
  1. RESEARCH · CL_22018 ·

    Study finds global LLM leaderboards misleading, proposes portfolio rankings

    A new research paper argues that current leaderboards for large language models (LLMs) are misleading due to significant heterogeneity in user preferences across languages and tasks. The study analyzed approximately 89,…

  2. TOOL · CL_17792 ·

    Chess-GPT model learns world model, can be manipulated to change skill

    Researchers have explored interventions on a language model trained to play chess, dubbed Chess-GPT. By manipulating the model's internal representations of the board state and player skill, they demonstrated a causal l…