PulseAugur
EN
LIVE 12:51:28

New Lean 4 Theorem Prover Achieves Strong Benchmark Results

Researchers have developed MerLean-Prover, an end-to-end theorem prover for Lean 4 that generates kernel-checkable proofs. The system utilizes a recursive loop with three agent types (Planning, Check, and Lean) and has demonstrated strong performance on benchmarks like FormalQualBench and Putnam2025. Notably, MerLean-Prover achieved 10/23 on FormalQualBench, outperforming existing open-source baselines, and successfully solved all 12 problems on Putnam2025 with reduced computation time. The harness design also proved effective with smaller models, including Sonnet and Haiku. AI

RANK_REASON The cluster contains an academic paper detailing a new theorem-proving system and its benchmark results.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Jinzheng Li, Zeru Zhu, Yuanjie Ren ·

    MerLean-Prover: A Recursive Looping Harness for End-to-End Lean 4 Theorem Proving

    arXiv:2605.26959v1 Announce Type: cross Abstract: MerLean-Prover is an end-to-end Lean4 theorem prover that replaces sorry declarations with kernel-checkable proofs. It is built from three agent types (Planning, Check, and Lean) composed by a recursive outer loop whose unit of re…

  2. arXiv cs.CL TIER_1 English(EN) · Yuanjie Ren ·

    MerLean-Prover: A Recursive Looping Harness for End-to-End Lean 4 Theorem Proving

    MerLean-Prover is an end-to-end Lean4 theorem prover that replaces sorry declarations with kernel-checkable proofs. It is built from three agent types (Planning, Check, and Lean) composed by a recursive outer loop whose unit of revision is the proof plan itself, and uses no fine-…