Google DeepMind has released an AI system called "AI Co-Mathematician" designed to collaborate with human mathematicians on complex problems. This system, built on Gemini 3.1 Pro, achieved a new state-of-the-art score of 48% on the challenging FrontierMath Tier 4 benchmark, significantly outperforming existing models like GPT-5.5 Pro. The AI functions as an asynchronous workspace with a coordinator agent that breaks down tasks, manages parallel research streams, and persistently stores failed hypotheses, mirroring workflows seen in software development. AI
影响 This system demonstrates a new paradigm for AI collaboration in research, potentially accelerating discoveries in complex fields like mathematics.
排序理由 The cluster describes a new AI system for mathematical research and its performance on a specialized benchmark, including its use in solving a previously unsolved problem.
- AI Co-Mathematician
- Alex Davies
- Claude Opus 4.6
- Claude Opus 4.7
- Daniel M. Roy
- Daniel Zheng
- Epoch AI
- FrontierMath Tier 4
- Gemini 3.1 Pro
- Google DeepMind
- GPT-5.4 Pro
- GPT-5.5 Pro
- Kourovka Notebook
- Marc Lackenby
- Pushmeet Kohli
- Claude Code
- Oxford
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →