PulseAugur
EN
LIVE 16:24:10
ENTITY Dafny

Dafny

PulseAugur coverage of Dafny — every cluster mentioning Dafny across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
6
6 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
6
6 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 6 TOTAL
  1. TOOL · CL_109991 ·

    New benchmark MINIF2F-DAFNY tests LLMs for mathematical theorem proving

    Researchers have developed MINIF2F-DAFNY, a new benchmark for evaluating Large Language Models (LLMs) in mathematical theorem proving. This system translates the miniF2F benchmark to Dafny, an auto-active verifier, enab…

  2. TOOL · CL_70372 ·

    New benchmark reveals AI struggles with verified code generation

    A new benchmark called AlgoVeri has been developed to evaluate the performance of AI models in generating formally verified code for classical algorithms. The benchmark tests models across three languages: Dafny, Verus,…

  3. RESEARCH · CL_62929 ·

    AI models improve code generation with new verification techniques

    Researchers have developed new methods to improve the ability of large language models to generate correct code and proofs. One approach, TTRL-CoCoV, uses confidence-conditioned verification to enhance coverage and accu…

  4. RESEARCH · CL_09852 ·

    Researchers develop graph construction for imperative programs using neural methods

    Researchers have developed a pipeline to convert imperative programs and their annotations into typed, attributed graphs. This process combines abstract syntax tree parsing with semantic embeddings from models like Sent…

  5. RESEARCH · CL_06893 ·

    SEVerA framework verifies self-evolving AI agents for safety and correctness

    Researchers have introduced SEVerA, a framework designed to synthesize self-evolving AI agents with formal safety and correctness guarantees. This approach treats agentic code generation as a constrained learning proble…

  6. RESEARCH · CL_05024 ·

    AI models achieve high verification success with formal code generation

    Researchers have developed a new dataset, NL2VC-60, containing 60 algorithmic problems to aid in generating verified code from natural language. They evaluated seven open-weight LLMs using various prompting strategies, …