New metric detects and reduces LLM legal citation hallucinations

By PulseAugur Editorial · [1 sources] · 2026-06-02 04:00

Researchers have developed a new metric called Citation Grounding (CG) to detect and reduce hallucinations in Large Language Models (LLMs) when generating legal citations. This metric, tested against a large dataset of Ukrainian court decisions, breaks down hallucinations into precision, relevance, and temporality issues. To address these issues without human annotation, they also introduced Citation Grounding DPO (CG-DPO), a method that fine-tunes LLMs using algorithmically generated preference pairs, achieving high accuracy in distinguishing correct from corrupted citations. AI

IMPACT Introduces a novel evaluation framework for LLM legal citation accuracy, potentially improving reliability in legal AI applications.

RANK_REASON Academic paper introducing a new metric and method for evaluating LLM safety. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Volodymyr Ovcharov · 2026-06-02 04:00

Citation Grounding: Detecting and Reducing LLM Citation Hallucinations via Legal Citation Graphs

arXiv:2606.00898v1 Announce Type: new Abstract: Large language models systematically hallucinate legal citations -- fabricating statute references, citing repealed provisions, and confusing jurisdictions -- yet no automated method exists to measure or reduce this behavior at scal…

COVERAGE [1]

Citation Grounding: Detecting and Reducing LLM Citation Hallucinations via Legal Citation Graphs

RELATED ENTITIES

RELATED TOPICS