Lean 4 autoformalization sensitive to surface phrasing, not semantics

By PulseAugur Editorial · [1 sources] · 2026-04-28 04:00

Researchers have investigated the impact of natural language variations on Lean 4 autoformalization, finding that semantically equivalent paraphrases can lead to different formal outputs. Their study, using GPT-family models and open-weight autoformalizers on ProofNet# and miniF2F datasets, revealed that these sensitivities are primarily due to compilation failures rather than semantic disagreements. The findings suggest that future efforts should focus on improving the compilation process rather than the semantic layer of these systems. AI

IMPACT Suggests focusing training on compilation rather than semantic layers for autoformalization tools.

RANK_REASON Academic paper on autoformalization in Lean 4.

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Lean 4 autoformalization sensitive to surface phrasing, not semantics

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · William Feng, Ethan Lou, Aryan Sharma · 2026-04-28 04:00

Surface Sensitivity in Lean 4 Autoformalization

arXiv:2604.23135v1 Announce Type: new Abstract: Natural-language variation poses a key challenge in Lean autoformalization: semantically equivalent paraphrases of the same theorem statements can induce divergent formal outputs, yet it remains unclear whether this variation reflec…

COVERAGE [1]

Surface Sensitivity in Lean 4 Autoformalization

RELATED ENTITIES

RELATED TOPICS