Lean 4 Programming Language
PulseAugur coverage of Lean 4 Programming Language — every cluster mentioning Lean 4 Programming Language across labs, papers, and developer communities, ranked by signal.
15 day(s) with sentiment data
-
New study tests AI proof formalization models for robustness
A new study on arXiv evaluates the robustness of proof autoformalization models, which translate natural language mathematical proofs into formal languages like Lean 4. Researchers introduced global and local perturbati…
-
New framework certifies faithfulness in AI-generated math proofs
Researchers have introduced Bidirectional Provability Fingerprinting (BPF), a new framework designed to certify the faithfulness of autoformalized mathematical statements. This method addresses the challenge where trans…
-
Formal statistical learning theory formalized in Lean 4 with AI aid
Researchers have developed a formalization of statistical learning theory using Lean 4, a proof assistant, to establish a rigorous foundation for machine learning theory. This project involved a human-AI collaboration w…
-
New Architecture Achieves Near-Infinite Temporal Consistency in World Models
A new research paper introduces the Physics-Grounded Symbolic Architecture (PGSA), which overcomes limitations in current statistical World Models. Unlike existing models that require Gaussian dynamics for linear identi…
-
AI agents can use signed compression progress for robust intrinsic motivation
A new research paper proposes a method called "signed compression progress" as a more robust form of intrinsic motivation for AI agents. This approach aims to ensure that an agent's reward is directly tied to genuine le…
-
LLMs evaluated for formal math proofs in Lean 4
A new research paper evaluates the performance of various Large Language Models (LLMs) in generating formal mathematical proofs using the Lean 4 theorem prover. The study employed pass@k and refine@k metrics on subsets …
-
Developer uses AI to formally verify polygon intersection algorithm
A developer has created a formally verified implementation for polygon intersection, a standard feature in vector graphics editors. This project utilized AI agents, with recent models capable of generating algorithm imp…
-
Dataset formalizes graduate algebra problems for AI verification
Researchers have developed Lean-GAP, a dataset containing 430 formalized graduate-level algebra problems derived from the textbook "Abstract Algebra" by Dummit and Foote. The process involved a pipeline for PDF-to-LaTeX…
-
AI framework resolves open math problem using formal verification
Researchers have developed a novel framework that merges informal reasoning with formal verification to tackle complex mathematical problems. This system, comprising an informal agent named Rethlas and a formal agent ca…
-
New FVSpec benchmark tests AI on formal software verification
Researchers have developed a new benchmark called FVSpec to evaluate AI models on formal software verification tasks. The benchmark was created by translating over 2,700 real-world Python property-based tests into more …
-
New EVA method improves LLM reward modeling for math verification
Researchers have introduced Expected Value Alignment (EVA), a new procedure for training reward models used with large language models in formal mathematics verification. EVA addresses a trade-off in existing models by …
-
AI frameworks boost formal theorem proving with new techniques
Researchers have developed new frameworks to enhance formal theorem proving capabilities using large language models. Goedel-Architect utilizes a blueprint generation and refinement strategy, achieving state-of-the-art …
-
ProofWala framework enables multilingual theorem-proving research
Researchers have developed ProofWala, a new framework designed to facilitate multilingual proof data synthesis and theorem-proving for neural approaches. This framework includes a reusable library for interacting with i…
-
New FVSpec Benchmark Tests AI in Formal Software Verification
Researchers have introduced FVSpec, a new benchmark designed to evaluate AI models and agents in formal software verification tasks. The benchmark involves translating property-based tests from Python into specification…
-
Lean 4 library offers verified mathematical finance theorems
Researchers have developed a comprehensive library of mathematical finance theorems using the Lean 4 proof assistant. This library, built upon Mathlib and the BrownianMotion package, includes over two hundred theorems c…
-
AI agents formalize 26 math textbooks, creating verified library
Researchers have developed AutoformBot, a multi-agent system that uses LLMs and formal verification tools to translate informal mathematical prose into machine-checked code. This system has been applied to 26 mathematic…
-
New Lean 4 Theorem Prover Achieves Strong Benchmark Results
Researchers have developed MerLean-Prover, an end-to-end theorem prover for Lean 4 that generates kernel-checkable proofs. The system utilizes a recursive loop with three agent types (Planning, Check, and Lean) and has …
-
TorchLean framework formalizes neural networks in Lean 4 theorem prover
Researchers have developed TorchLean, a framework that formalizes neural networks within the Lean 4 theorem prover. This system allows for the execution and verification of neural networks directly within the same envir…
-
Formalization of ML generalization bounds achieved in Lean 4
Researchers have formalized generalization error bounds using Rademacher complexity in the Lean 4 proof assistant. This work builds upon measure-theoretic probability theory within the Mathlib library. The formalization…
-
Lean 4 theorem proving accelerated with proof-state snapshotting
Researchers have developed a new method called proof-state snapshotting to significantly speed up automated theorem proving in Lean 4. This technique addresses the inefficiency of repeatedly reconstructing proof states …