PulseAugur
EN
LIVE 19:50:46

New DNR-Bench reveals 0% pass rate for top LLMs

A new benchmark called DNR-Bench has been introduced to evaluate large language models' ability to avoid responding to specific prompts. Across several leading models including GPT-5.1, Claude Opus 4.8, Gemini 3 Pro, and Grok 4, the benchmark reported a 0.0% pass rate, indicating that none of the tested models successfully refrained from generating any output when presented with the test prompt. The benchmark's methodology and code are available on GitHub. AI

IMPACT This benchmark highlights a critical safety failure in current LLMs, suggesting a need for improved alignment and refusal capabilities.

RANK_REASON The cluster describes a new benchmark for evaluating LLM safety, which falls under research. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/ClaudeAI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New DNR-Bench reveals 0% pass rate for top LLMs

COVERAGE [1]

  1. r/ClaudeAI TIER_2 English(EN) · /u/No-Cup-7681 ·

    Introducing: DNR-Bench: Do-not-respond Benchmark

    <table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1u3vveu/introducing_dnrbench_donotrespond_benchmark/"> <img alt="Introducing: DNR-Bench: Do-not-respond Benchmark" src="https://preview.redd.it/b1lmig0kvu6h1.png?width=640&amp;crop=smart&amp;auto=webp&amp;s=df39…