Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics
Researchers have introduced Formal Conjectures, a new benchmark designed to evaluate automated reasoning systems in mathematics. This evolving dataset, formalized in Lean 4, comprises over 2600 mathematical problem statements, including 1029 open research conjectures and 836 solved problems. The benchmark facilitates collaboration between mathematicians and AI systems, and has already contributed to resolving open conjectures, demonstrating its potential for advancing AI-driven mathematical discovery. AI
IMPACT Advances AI capabilities in formal mathematics and aids in discovering new mathematical proofs.