PulseAugur
EN
LIVE 20:18:28
ENTITY MMLU-Hard

MMLU-Hard

PulseAugur coverage of MMLU-Hard — every cluster mentioning MMLU-Hard across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_18587 ·

    Homogeneous multi-agent debate is less effective than self-correction

    A new research paper, "The Cost of Consensus," reveals that homogeneous multi-agent debate among LLMs is less effective and more costly than isolated self-correction. The study, using models like Qwen2.5-7B and Llama-3.…