English(EN) Evaluating the Robustness of Proof Autoformalization in Lean 4

新研究测试AI证明形式化模型的鲁棒性

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-16 04:00

arXiv上的一项新研究评估了证明自动形式化模型的鲁棒性，这些模型将自然语言数学证明翻译成Lean 4等形式化语言。研究人员对非正式证明引入了全局和局部扰动，以测试模型的_一致性_和_忠实性_。评估发现，七个近期模型对全局释义敏感，并且在很大程度上未能准确反映符号或证明步骤的局部变化。 AI

排序理由该集群包含一篇学术论文，详细介绍了新的AI模型评估方法和基准。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Zhengtao Gui, Sheng Yang, Zhouxing Shi · 2026-06-16 04:00

Evaluating the Robustness of Proof Autoformalization in Lean 4

arXiv:2606.14867v1 Announce Type: cross Abstract: Proof autoformalization aims to translate a mathematical informal proof written in natural language into a formal proof in a formal language such as Lean~4. Several works have developed LLM-based models for proof autoformalization…