PulseAugur
EN
LIVE 21:58:31

Wonda pipeline enhances SLM program verification with curated data

Researchers have developed a data curation pipeline called Wonda to improve the training of Small Language Models (SLMs) for program verification. This pipeline normalizes raw verifier output and uses LLMs to rewrite and augment invariants, ensuring provable quality. Fine-tuning SLMs like Qwen3, Llama-3.1, and Mistral AI on Wonda-curated data significantly boosts invariant correctness and speedup rates. Notably, a 4B Qwen3 model achieved performance comparable to much larger models like GPT-OSS-120B and even matched the verification time of GPT-5.2 on the InvBench suite. AI

IMPACT This research could accelerate the development and adoption of smaller, more efficient language models for specialized tasks like program verification.

RANK_REASON The cluster contains an academic paper detailing a new method for data curation to improve SLM performance on program verification tasks. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Wonda pipeline enhances SLM program verification with curated data

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Ido Pinto, Yizhak Yisrael Elboher, Haoze Wu, Nina Narodytska, Guy Katz ·

    Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs

    arXiv:2603.15510v2 Announce Type: replace Abstract: The synthesis of inductive loop invariants remains a critical bottleneck in automated program verification. While Large Language Models (LLMs) show promise in mitigating this issue, they often fail on complex programs, producing…