PulseAugur
LIVE 04:24:03
ENTITY GSM-Hard

GSM-Hard

PulseAugur coverage of GSM-Hard — every cluster mentioning GSM-Hard across labs, papers, and developer communities, ranked by signal.

Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
  1. TOOL · CL_18587 ·

    Homogeneous multi-agent debate is less effective than self-correction

    A new research paper, "The Cost of Consensus," reveals that homogeneous multi-agent debate among LLMs is less effective and more costly than isolated self-correction. The study, using models like Qwen2.5-7B and Llama-3.…

  2. TOOL · CL_15467 ·

    New SGDe framework compiles workflows for small language models

    Researchers have developed Semantic Gradient Descent (SGDe), a novel teacher-student framework designed to compile complex agentic workflows into deterministic structures for enterprise deployment of smaller language mo…