PulseAugur
LIVE 07:18:18
research · [3 sources] ·
0
research

New RuC framework generates HDL-agnostic benchmarks for LLM code completion

Researchers have developed RuC, a new framework for generating hardware description language (HDL) code completion benchmarks. This system is grammar-driven and language-agnostic, allowing for controlled evaluation of Large Language Models (LLMs) in Register Transfer Level (RTL) development. RuC masks code regions based on HDL grammar and prompts models to regenerate them, enabling assessment of capabilities from simple assignments to entire logic blocks. A study using RuC on SystemVerilog benchmarks from Tiny Tapeout and a RISC-V core showed that completion performance is influenced by model type, masked region structure, and prompting strategy, with Fill-in-the-Middle (FIM) yielding the best results. AI

Summary written by gemini-2.5-flash-lite from 3 sources. How we write summaries →

IMPACT Provides a more granular and controlled method for evaluating LLMs in RTL development, potentially improving model performance for hardware design tasks.

RANK_REASON Academic paper introducing a new benchmark generation framework for LLMs in hardware description languages.

Read on arXiv cs.AI →

COVERAGE [3]

  1. arXiv cs.AI TIER_1 · Arnau Ayguad\'e Domingo, Miquel Alberti-Binimelis, Cristian Gutierrez-Gomez, Emanuele Parisi, Razine Moundir Ghorab, Miquel Moreto, Gokcen Kestor, Dario Garcia-Gasulla ·

    RuC: HDL-Agnostic Rule Completion Benchmark Generation

    arXiv:2604.27780v1 Announce Type: cross Abstract: Large Language Models (LLMs) have rapidly improved in performance across code-related tasks, making their integration into Register Transfer Level (RTL) development increasingly attractive. Mimicking the behavior of inline code as…

  2. arXiv cs.AI TIER_1 · Dario Garcia-Gasulla ·

    RuC: HDL-Agnostic Rule Completion Benchmark Generation

    Large Language Models (LLMs) have rapidly improved in performance across code-related tasks, making their integration into Register Transfer Level (RTL) development increasingly attractive. Mimicking the behavior of inline code assistants, many benchmarks evaluate LLMs' capabilit…

  3. Hugging Face Daily Papers TIER_1 ·

    RuC: HDL-Agnostic Rule Completion Benchmark Generation

    Large Language Models (LLMs) have rapidly improved in performance across code-related tasks, making their integration into Register Transfer Level (RTL) development increasingly attractive. Mimicking the behavior of inline code assistants, many benchmarks evaluate LLMs' capabilit…