Dafny

ENTITY Dafny

Dafny

PulseAugur coverage of Dafny — every cluster mentioning Dafny across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

6

6 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

6

6 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 6 TOTAL

TOOL · CL_109991 · Jun 25 · 04:00

New benchmark MINIF2F-DAFNY tests LLMs for mathematical theorem proving

Researchers have developed MINIF2F-DAFNY, a new benchmark for evaluating Large Language Models (LLMs) in mathematical theorem proving. This system translates the miniF2F benchmark to Dafny, an auto-active verifier, enab…
TOOL · CL_70372 · Jun 4 · 04:00

New benchmark reveals AI struggles with verified code generation

A new benchmark called AlgoVeri has been developed to evaluate the performance of AI models in generating formally verified code for classical algorithms. The benchmark tests models across three languages: Dafny, Verus,…
RESEARCH · CL_62929 · Jun 1 · 04:00

AI models improve code generation with new verification techniques

Researchers have developed new methods to improve the ability of large language models to generate correct code and proofs. One approach, TTRL-CoCoV, uses confidence-conditioned verification to enhance coverage and accu…
RESEARCH · CL_09852 · Apr 29 · 11:59

Researchers develop graph construction for imperative programs using neural methods

Researchers have developed a pipeline to convert imperative programs and their annotations into typed, attributed graphs. This process combines abstract syntax tree parsing with semantic embeddings from models like Sent…
RESEARCH · CL_06893 · Apr 28 · 04:00

SEVerA framework verifies self-evolving AI agents for safety and correctness

Researchers have introduced SEVerA, a framework designed to synthesize self-evolving AI agents with formal safety and correctness guarantees. This approach treats agentic code generation as a constrained learning proble…
RESEARCH · CL_05024 · Apr 24 · 14:28

AI models achieve high verification success with formal code generation

Researchers have developed a new dataset, NL2VC-60, containing 60 algorithmic problems to aid in generating verified code from natural language. They evaluated seven open-weight LLMs using various prompting strategies, …