English(EN) Optimizing the Cost-Quality Tradeoff of Agentic Theorem Provers in Lean

LLM针对Lean中的高效形式定理证明进行了优化

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-01 04:00

两篇新研究论文探讨了在Lean环境中提高大型语言模型（LLM）在形式定理证明中的效率和有效性的方法。第一篇论文介绍了一种动作路由代理，该代理通过使用编译器反馈来指导搜索并降低计算成本，从而优化成本-质量权衡。第二篇论文提出了一种“反馈蒸馏”训练方法，该方法利用语言模型的反馈来改进令牌级别的监督和探索，在生成多样化且成功的证明轨迹方面优于传统的强化学习技术。 AI

影响这些论文提出了使LLM在形式定理证明等复杂推理任务中更高效、更有效的新技术，有可能加速AI在数学和科学发现中的应用。

排序理由两篇在arXiv上发表的学术论文，详细介绍了改进LLM在形式定理证明中性能的新颖方法。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.CL TIER_1 English(EN) · K\'ari R\"ognvaldsson, Chenhao Sun, Jasper Dekoninck, Martin Vechev · 2026-06-04 04:00

优化Lean中智能定理证明器的成本-质量权衡

arXiv:2606.04883v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used in workflows for generating formal proofs in Lean. These workflows often decompose problems into smaller lemmas, sample many proof attempts, and use compiler feedback to guide searc…
arXiv cs.CL TIER_1 English(EN) · Martin Vechev · 2026-06-03 13:46

优化Lean中智能定理证明器的成本-质量权衡

Large language models (LLMs) are increasingly used in workflows for generating formal proofs in Lean. These workflows often decompose problems into smaller lemmas, sample many proof attempts, and use compiler feedback to guide search. However, they can be prohibitively expensive,…
arXiv cs.AI TIER_1 English(EN) · Gaetan Narozniak, G\'erard Biau, R\'emi Munos, Ahmad Rammal, Pierre Marion · 2026-06-01 04:00

提炼大型语言模型反馈以实现精简定理证明

arXiv:2605.30861v1 Announce Type: new Abstract: Post-training for reasoning models typically combines supervised fine-tuning with reinforcement learning from verifiable rewards, most commonly with GRPO. However, this algorithm suffers from sparse rewards, limited exploration, and…

报道来源 [3]

优化Lean中智能定理证明器的成本-质量权衡

优化Lean中智能定理证明器的成本-质量权衡

提炼大型语言模型反馈以实现精简定理证明

相关实体

相关话题