PulseAugur
EN
LIVE 01:57:02
ENTITY FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

PulseAugur coverage of FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI — every cluster mentioning FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
  1. RESEARCH · CL_22521 ·

    AI Co-Mathematician accelerates research with agentic support for mathematicians

    Researchers have developed an AI co-mathematician system designed to assist mathematicians in their research workflows. This system provides comprehensive support for tasks such as ideation, literature review, computati…

  2. FRONTIER RELEASE · CL_02231 ·

    OpenAI's GPT-5.2 advances science and math, with evaluations showing low catastrophic risk

    OpenAI has released GPT-5.2, a new model demonstrating significant advancements in mathematical and scientific reasoning. The model achieved high scores on benchmarks like GPQA Diamond and FrontierMath, indicating impro…