Researchers have introduced Formal Conjectures, a new benchmark designed to evaluate automated reasoning systems in mathematics. This evolving dataset, formalized in Lean 4, comprises over 2600 mathematical problem statements, including 1029 open research conjectures and 836 solved problems. The benchmark facilitates collaboration between mathematicians and AI systems, and has already contributed to resolving open conjectures, demonstrating its potential for advancing AI-driven mathematical discovery. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Advances AI capabilities in formal mathematics and aids in discovering new mathematical proofs.
RANK_REASON The cluster describes a new benchmark for AI in mathematics, including its methodology and initial results. [lever_c_demoted from research: ic=1 ai=1.0]