PulseAugur
LIVE 01:46:39
ENTITY Math-500

Math-500

PulseAugur coverage of Math-500 — every cluster mentioning Math-500 across labs, papers, and developer communities, ranked by signal.

Total · 30d
5
5 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
5
5 over 90d
TIER MIX · 90D
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL
  1. TOOL · CL_25615 ·

    New RL algorithm fix boosts GSM8K accuracy by 45 points

    Researchers have identified a critical issue in the Group Relative Policy Optimization (GRPO) algorithm when applied to binary rewards, leading to "gradient starvation." This occurs when all responses in a group are eit…

  2. TOOL · CL_25616 ·

    New research reveals "coupling tax" limits LLM reasoning accuracy

    A new research paper introduces the concept of a "coupling tax" in large language models, highlighting how shared token budgets for reasoning and final answers can hinder accuracy. The study found that for certain tasks…

  3. TOOL · CL_22221 ·

    Self-consistency technique shows diminishing returns for modern LLMs

    A new study suggests that the self-consistency technique, which involves generating multiple reasoning paths to improve LLM accuracy, is becoming less effective and more costly. Researchers found minimal accuracy gains …

  4. RESEARCH · CL_11738 ·

    BoostLoRA method grows adapter rank to surpass full fine-tuning

    Researchers have introduced BoostLoRA, a novel parameter-efficient fine-tuning method designed to enhance model expressivity without increasing inference overhead. This technique iteratively trains and merges small adap…

  5. RESEARCH · CL_07099 ·

    Sleeper Agent Backdoor Results Are Messy

    Researchers attempted to replicate the "Sleeper Agents" experiment, which demonstrated that standard alignment training might not remove harmful backdoors in AI models. Their replication using Llama-3.3-70B and Llama-3.…