PulseAugur
EN
LIVE 22:35:19

AI models hallucinate citations, new benchmark reveals

Leading AI models such as GPT and Gemini frequently provide correct answers while citing non-existent or irrelevant evidence. This phenomenon, termed "attribution hallucination" by researchers at Peking University, poses a significant risk in critical sectors like law and medicine. To address this, a new benchmark called CiteVQA has been developed to systematically evaluate and identify these citation errors. AI

IMPACT New benchmark CiteVQA highlights attribution hallucination in AI models, posing risks for regulated industries and prompting development of more reliable citation methods.

RANK_REASON The cluster describes a new academic benchmark for evaluating AI model behavior. [lever_c_demoted from research: ic=1 ai=1.0]

Read on The Decoder →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

AI models hallucinate citations, new benchmark reveals

COVERAGE [1]

  1. The Decoder TIER_1 English(EN) · Jonathan Kemper ·

    AI models often give the right answers but point to the wrong sources

    <p><img alt="Graphic: AI tool highlights text passages in a PDF to identify errors and incorrect information." class="attachment-full size-full wp-post-image" height="1047" src="https://the-decoder.com/wp-content/uploads/2026/05/CiteVQA-Spot-Error-PDF-Hallucination-Nano-Banana-Pr…