New research tackles LLM hallucinations with novel methods and benchmarks

arXiv cs.CL TIER_1 English(EN) · Hao Yin, Guangzong Si, Zilei Wang · 2026-06-05 04:00

The Mirage of Performance Gains: Why Contrastive Decoding Fails to Mitigate Object Hallucinations in MLLMs?

arXiv:2504.10020v4 Announce Type: replace Abstract: Contrastive decoding strategies are widely used to reduce object hallucinations in multimodal large language models (MLLMs). These methods work by constructing contrastive samples to induce hallucinations and then suppressing th…

arXiv cs.AI TIER_1 English(EN) · Saroj Mishra · 2026-06-04 04:00

Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and Mitigation

arXiv:2606.04435v1 Announce Type: new Abstract: Multi-step agentic retrieval-augmented generation (RAG) pipelines have demonstrated significant capability for complex reasoning tasks, yet remain vulnerable to a class of failure that existing hallucination detection mechanisms sys…

arXiv cs.AI TIER_1 English(EN) · Bodla Krishna Vamshi, Rohan Bhatnagar, Haizhao Yang · 2026-06-04 04:00

Geometry-Aware Hallucination Detection in Large Language Models

arXiv:2601.06196v3 Announce Type: replace-cross Abstract: Large language models (LLMs) frequently generate factually incorrect or unsupported content, commonly referred to as hallucinations. Prior work has explored decoding strategies, retrieval augmentation, and supervised fine-…

arXiv cs.CL TIER_1 English(EN) · Litian Liu, Reza Pourreza, Sunny Panchal, Apratim Bhattacharyya, Yubing Jian, Yao Qin, Roland Memisevic · 2026-06-04 04:00

Enhancing Hallucination Detection through Noise Injection

arXiv:2502.03799v4 Announce Type: replace Abstract: Large Language Models (LLMs) are prone to generating plausible yet incorrect responses, known as hallucinations. Effectively detecting hallucinations is therefore crucial for the safe deployment of LLMs. Recent research has link…

arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Saroj Mishra · 2026-06-03 04:33

Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and Mitigation

Multi-step agentic retrieval-augmented generation (RAG) pipelines have demonstrated significant capability for complex reasoning tasks, yet remain vulnerable to a class of failure that existing hallucination detection mechanisms systematically miss: cascading hallucination, where…

arXiv cs.AI TIER_1 English(EN) · Chenshuang Zhang, Kyeong Seon Kim, Chengxin Liu, Tae-Hyun Oh · 2026-06-03 04:00

SVHalluc: Benchmarking Speech-Vision Hallucination in Audio-Visual Large Language Models

arXiv:2606.02642v1 Announce Type: cross Abstract: Despite the success of audio-visual large-language models (LLMs), they can produce plausible but ungrounded outputs, termed hallucination. Existing benchmarks focus on environmental sounds (e.g., dog barking) to indicate event occ…

arXiv cs.AI TIER_1 English(EN) · Ruipeng Zhang, Zhihao Li, Haozhang Yuan, C. L. Philip Chen, Tong Zhang · 2026-06-03 04:00

P\textsuperscript{2}-DPO: Grounding Hallucination in Perceptual Processing via Calibration Direct Preference Optimization

arXiv:2606.03376v1 Announce Type: cross Abstract: Hallucination has recently garnered significant research attention in Large Vision-Language Models (LVLMs). Direct Preference Optimization (DPO) aims to learn directly from the corrected preferences provided by humans, thereby add…

arXiv cs.AI TIER_1 English(EN) · Mingkuan Zhao, Wentao Hu, Tianchen Huang, Yuheng Min, Suquan Chen, Yide Gao, Yanbo Zhai, Shuangyong Song, Xuelong Li · 2026-06-03 04:00

Hallucinations as Orthogonal Noise: Inference-Time Manifold Alignment via Dynamic Contextual Orthogonalization

arXiv:2606.03022v1 Announce Type: cross Abstract: Hallucination in Large Language Models (LLMs), characterized by the generation of content inconsistent with contextual facts or logical constraints -- remains a persistent challenge for reliable deployment. In this work, we addres…

arXiv cs.LG TIER_1 English(EN) · Mahdi Erfanian, Nelson Daniel Troncoso, Aashna Garg, Amabel Gale, Xiaoyu Liu, Pareesa Ameneh Golnari, Shengyu Fu · 2026-06-03 04:00

Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation

arXiv:2606.03130v1 Announce Type: new Abstract: Small open-source code models that power IDE autocomplete still emit hallucinated Fill-in-the-Middle (FIM) completions: syntactically natural calls to methods, parameters, variables, and imports that do not exist in the surrounding …

arXiv cs.CL TIER_1 English(EN) · Aizierjiang Aiersilan · 2026-06-03 04:00

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

arXiv:2606.02628v1 Announce Type: cross Abstract: We investigate whether open-source LLMs encode a linearly separable truthfulness signal in their hidden states, and at which network depth this signal is strongest. Across three $7$B--$8$B instruction-tuned models (Llama-3.1-8B, M…

arXiv cs.AI TIER_1 English(EN) · Yuetian Lu, Yihong Liu, Sebastian Gerstner, Lea Hirlimann, Jonas Rohweder, Hinrich Sch\"utze · 2026-06-03 04:00

Relational Linearity is a Predictor of Hallucinations

arXiv:2601.11429v2 Announce Type: replace-cross Abstract: Hallucination is a central failure mode of language models (LMs). We focus on hallucinations in response to questions like: "Which instrument did Glenn Gould play?", but we ask these questions for synthetic entities design…

arXiv cs.AI TIER_1 English(EN) · Lin Li, Georgia Channing, Suhaas M Bhat, Gabriel Davis Jones, Yarin Gal · 2026-06-03 04:00

Building Reliable Long-Form Generation via Hallucination Rejection Sampling

arXiv:2606.03628v1 Announce Type: cross Abstract: Large language models (LLMs) have achieved remarkable progress in open-ended text generation, yet they remain prone to hallucinating incorrect or unsupported content, which undermines their reliability. This issue is exacerbated i…

arXiv cs.AI TIER_1 English(EN) · Yarin Gal · 2026-06-02 13:26

Building Reliable Long-Form Generation via Hallucination Rejection Sampling

Large language models (LLMs) have achieved remarkable progress in open-ended text generation, yet they remain prone to hallucinating incorrect or unsupported content, which undermines their reliability. This issue is exacerbated in long-form generation due to hallucination snowba…

arXiv cs.CL TIER_1 English(EN) · Tong Zhang · 2026-06-02 09:22

P\textsuperscript{2}-DPO: Grounding Hallucination in Perceptual Processing via Calibration Direct Preference Optimization

Hallucination has recently garnered significant research attention in Large Vision-Language Models (LVLMs). Direct Preference Optimization (DPO) aims to learn directly from the corrected preferences provided by humans, thereby addressing the hallucination issue. Despite its succe…

arXiv cs.AI TIER_1 English(EN) · Kaixiang Zhao, Tianrun Yu, Shawn Huang, Porter Jenkins, Yushun Dong, Amanda Hughes · 2026-06-02 04:00

TIGER: Traceable Inference with Graph-Based Evidence Routing for Mitigating Hallucinations in Multimodal Generation

arXiv:2606.00232v1 Announce Type: new Abstract: We study fact-level repair for multimodal generation, where a fluent output may contain specific facts that are not supported by the input. Existing inference-time repair methods often generate feedback by jointly conditioning on th…

arXiv cs.LG TIER_1 English(EN) · Yun-Chen Cheng, Che-Yu Lin, Cheng-Lin Yang · 2026-06-02 04:00

Score $\times$ Decoder: A Unified View of Unsupervised Inference-Time Scaling for Hallucination Mitigation

arXiv:2606.00739v1 Announce Type: new Abstract: Large language models hallucinate even when the answer lies within their parameters. While inference-time scaling can surface this latent knowledge, the most effective methods require supervision: a trained verifier or reward model.…

arXiv cs.LG TIER_1 English(EN) · Wentao Ye, Liyao Li, Zhiqing Xiao, Muzhi Zhu, Jiaqi Hu, Zhanming Shen, Xiaomeng Hu, Sean Du, Haobo Wang · 2026-06-02 04:00

FLaG: Fine-Grained Latent Grouping for Hallucination Detection

arXiv:2606.00301v1 Announce Type: new Abstract: Hallucinations in large language models (LLMs) arise from heterogeneous failure mechanisms, making reliable detection difficult for any single global uncertainty score. In this work, we formulate hallucination detection as a mechani…

arXiv cs.CL TIER_1 English(EN) · Yasser Hamidullah, Koel Dutta Chowdhury, Yusser Al Ghussin, Shakib Yazdani, Cennet Oguz, Josef van Genabith, Cristina Espa\~na-Bonet · 2026-06-02 04:00

Grounding or Guessing? Visual Signals for Detecting Hallucinations in Sign Language Translation

arXiv:2510.18439v3 Announce Type: replace Abstract: Hallucination, where models generate fluent text unsupported by visual evidence, remains a major flaw in vision-language models and is particularly critical in sign language translation (SLT). In SLT, meaning depends on precise …

arXiv cs.CL TIER_1 English(EN) · Mohit Singh Chauhan · 2026-06-02 04:00

DECK: A Consistency x Confidence Taxonomy of LLM Hallucinations

arXiv:2606.02289v1 Announce Type: new Abstract: Existing hallucination taxonomies classify LLM errors by what is wrong with the output -- memorised misconceptions, reasoning failures, fluent fabrications. These taxonomies are useful for diagnosis but cannot answer a different que…

arXiv cs.CL TIER_1 English(EN) · Yiming Liao, Zeno Franco, Jose Eduardo Lizarraga Mazaba, Keke Chen · 2026-06-02 04:00

Med-HEAL: Analyzing and Mitigating Hallucinations in Medical LLMs with Hallucination-Aware In-Context Learning

arXiv:2606.01301v1 Announce Type: new Abstract: Hallucinations in medical large language models (LLMs) pose serious risks for clinical decision support, particularly when models must reason over complex electronic health records (EHRs). However, existing benchmarks often lack a r…

arXiv cs.CL TIER_1 English(EN) · S M Tahmid Siddiqui, Akib Jawad Ononto, Anoop Singhal, Latifur Khan · 2026-06-02 04:00

Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models

arXiv:2606.00919v1 Announce Type: new Abstract: Large language models (LLMs) have seen widespread adoption across various domains, yet their reliability is frequently undermined by hallucinations - responses that are plausible-sounding but factually incorrect. In high-stakes doma…

arXiv cs.AI TIER_1 English(EN) · Buyun Liang, Jinqi Luo, Liangzu Peng, Kwan Ho Ryan Chan, Darshan Thaker, Kaleab A. Kinfu, Fengrui Tian, Hamed Hassani, Ren\'e Vidal · 2026-06-02 04:00

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations

arXiv:2605.12813v2 Announce Type: replace-cross Abstract: Large language models (LLMs) achieve strong performance across many tasks but remain vulnerable to hallucinations, making it important to systematically evaluate their reliability under realistic adversarial inputs. We for…

arXiv cs.AI TIER_1 English(EN) · Bohan Yang, Yijun Gong, Zhi Zhang, Ge Zhang, Wenpeng Xing, Meng Han · 2026-06-02 04:00

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

arXiv:2606.01033v1 Announce Type: new Abstract: When a language model hallucinates, the final answer is wrong, but the mistake is not necessarily invisible inside the model. Different internal pathways may remain uncertain, disagree in how quickly they sharpen, or commit to compe…

arXiv cs.AI TIER_1 English(EN) · Hanze Li, Jinhao You, Yichen Guo, Kai Tang, Shuangyang Xie, Xiande Huang · 2026-06-02 04:00

Mitigating Hallucinations in Large Language Models Via Decoder Layer Skipping

arXiv:2606.00819v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved strong performance across diverse natural language tasks, yet their outputs often suffer from hallucinations -- content that is misaligned with factual information. In this work, we conduct…

arXiv cs.CL TIER_1 English(EN) · Mohit Singh Chauhan · 2026-06-01 14:11

DECK: A Consistency x Confidence Taxonomy of LLM Hallucinations

Existing hallucination taxonomies classify LLM errors by what is wrong with the output -- memorised misconceptions, reasoning failures, fluent fabrications. These taxonomies are useful for diagnosis but cannot answer a different question: which uncertainty scorer would have caugh…

arXiv cs.AI TIER_1 English(EN) · Jiaming Li, Jiacheng Zhang, Zequn Jie, Lin Ma, Guanbin Li · 2026-06-01 04:00

Cross-Modal Attention Calibration for LVLM Hallucination Mitigation

arXiv:2501.01926v3 Announce Type: replace-cross Abstract: Large vision-language models (LVLMs) have shown remarkable capabilities in visual-language understanding. Despite their success, LVLMs still suffer from generating hallucinations in complex generation tasks, leading to inc…

arXiv cs.AI TIER_1 English(EN) · Litian Liu, Reza Pourreza, Yubing Jian, Yao Qin, Roland Memisevic · 2026-06-01 04:00

From Out-of-Distribution Detection to Hallucination Detection: A Geometric View

arXiv:2602.07253v2 Announce Type: replace Abstract: Detecting hallucinations in large language models is a critical open problem with significant implications for safety and reliability. While existing hallucination detection methods achieve strong performance in question-answeri…

arXiv cs.AI TIER_1 English(EN) · Soorya Ram Shimgekar, Agam Goyal, Amruta Parulekar, Joshua Chen, Yian Wang, Navin Kumar, Hari Sundaram, Eshwar Chandrasekharan, Koustuv Saha · 2026-06-01 04:00

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

arXiv:2605.30913v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed in conversational settings where user tone ranges from polite to adversarial or toxic, yet less is known about whether toxic language in otherwise semantically equivalent prom…

arXiv cs.CL TIER_1 English(EN) · Shefayat E Shams Adib, Ahmed Alfey Sani, Ekramul Alam Esham, Ajwad Abrar, Ishmam Tashdeed, Md Taukir Azam Chowdhury · 2026-06-01 04:00

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali

arXiv:2605.31483v1 Announce Type: new Abstract: Despite Bengali being the sixth most spoken language in the world, no prior work has systematically evaluated hallucination in large language models (LLMs) for Bengali. We introduce BenHalluEval, a fine-grained hallucination evaluat…

arXiv cs.CL TIER_1 English(EN) · Haolin Deng, Xin Zou, Zhiwei Jin, Chen Chen, Haonan Lu, Xuming Hu · 2026-06-01 04:00

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization

arXiv:2605.31312v1 Announce Type: cross Abstract: Multimodal hallucination remains a persistent challenge for Vision-Language Models (VLMs). Standard textual Direct Preference Optimization (DPO) often fails to mitigate it due to a lack of explicit visual supervision. While existi…

arXiv cs.AI TIER_1 English(EN) · Yusheng He, Jizhe Zhou, Xia Du, Zheng Lin, Jun Luo, Jiancheng Lv · 2026-06-01 04:00

What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness

arXiv:2605.30911v1 Announce Type: cross Abstract: Hallucination remains one of the key challenges undermining the reliability of Large Vision-Language Models (LVLMs). But what makes an LVLM hallucinate less? Many existing efforts focus on improving internal components of the mode…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-29 16:07

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali

Despite Bengali being the sixth most spoken language in the world, no prior work has systematically evaluated hallucination in large language models (LLMs) for Bengali. We introduce BenHalluEval, a fine-grained hallucination evaluation framework for Bengali covering four tasks: G…

arXiv cs.CL TIER_1 English(EN) · Md Taukir Azam Chowdhury · 2026-05-29 16:07

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali

Despite Bengali being the sixth most spoken language in the world, no prior work has systematically evaluated hallucination in large language models (LLMs) for Bengali. We introduce BenHalluEval, a fine-grained hallucination evaluation framework for Bengali covering four tasks: G…

arXiv cs.CL TIER_1 English(EN) · Xuming Hu · 2026-05-29 13:44

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization

Multimodal hallucination remains a persistent challenge for Vision-Language Models (VLMs). Standard textual Direct Preference Optimization (DPO) often fails to mitigate it due to a lack of explicit visual supervision. While existing works introduce visual preference DPO by contra…

arXiv cs.CL TIER_1 English(EN) · Chaodong Tong, Qi Zhang, Zhuojun Jiang, Lei Jiang, Yanbing Liu · 2026-05-29 04:00

HaluNet: Learning Hallucination Risk from Internal Signals in LLM Question Answering

arXiv:2512.24562v2 Announce Type: replace Abstract: Large language models (LLMs) achieve strong question answering (QA) performance but can produce fluent answers unsupported by available evidence. Existing hallucination detectors often rely on external verification, repeated sam…

arXiv cs.LG TIER_1 English(EN) · Eunbyeol Cho, Yunseung Lee, Mirae Kim, Jeewon Yang, Youngjun Kwak, Edward Choi · 2026-05-29 04:00

K-FinHallu: A Hallucination Detection Benchmark for Multi-Turn RAG in Korean Finance

arXiv:2605.29523v1 Announce Type: new Abstract: Large Language Models (LLMs) have advanced financial automation through Retrieval-Augmented Generation (RAG), yet hallucinations remain a critical barrier to deployment in high-stakes environments. Existing benchmarks focus on singl…

arXiv cs.LG TIER_1 English(EN) · Chia-Yi Hsu, Chia-Mu Yu, Chun-Ying Huang, Jun Sakuma · 2026-05-29 04:00

Harmless Yet Harmful: Neutral Prompting Attacks for Stealthy Hallucination Steering in Agent Skills

arXiv:2605.29354v1 Announce Type: cross Abstract: LLM-powered coding agents increasingly participate in software development workflows by generating code, selecting dependencies, and producing package installation commands. This creates a new software supply chain risk: when an a…

arXiv cs.AI TIER_1 English(EN) · Zhe Qian, Yanbiao Ma, Zhuohan Ouyang, Zhonghua Wang, Zhongxing Xu, Fei Luo, Xinyu Liu, Zongyuan Ge, Yike Guo, Jungong Han · 2026-05-29 04:00

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

arXiv:2604.10219v2 Announce Type: replace Abstract: Multimodal Large Reasoning Models (MLRMs) have achieved remarkable strides in visual reasoning through test time compute scaling, yet long chain reasoning remains prone to hallucinations. We identify a concerning phenomenon term…

arXiv cs.AI TIER_1 English(EN) · Soumyadeep Jana, Pulkit Mittal, Sanasam Ranbir Singh · 2026-05-29 04:00

Mitigating Hallucination in Vision-Language Models through Barrier-Regulated Adaptive Closed-form Steering

arXiv:2605.29881v1 Announce Type: cross Abstract: Large vision-language models (LVLMs) often hallucinate objects that are not present in the input image, largely because visual grounding weakens as decoding progresses. Existing inference-time mitigation methods modify logits or h…

arXiv cs.AI TIER_1 English(EN) · Shamanth Kuthpadi Seethakantha, Dung Ngoc Thai, Vara Prasad Gudi, Simran Tiwari, Rami Matar, Avijit Mitra, Wenlong Zhao, Wael Salloum, Andrew McCallum · 2026-05-29 04:00

Hallucination Detection-Guided Preference Optimization for Clinical Summarization

arXiv:2605.28910v1 Announce Type: cross Abstract: Large language models (LLMs) have shown promise on summarization tasks, but they often produce hallucinations, which are unsupported or incorrect statements that limit their reliability in specialized healthcare applications. We i…

arXiv cs.AI TIER_1 English(EN) · Diego Gosmar, Deborah A. Dahl · 2026-05-29 04:00

Hallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching

arXiv:2605.29055v1 Announce Type: new Abstract: Hallucination remains a major reliability barrier for production LLM systems, particularly in multi-agent pipelines where unsupported claims can propagate unchecked across stages. This paper adapts a HOPE-inspired Nested Learning ar…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-29 00:00

Score-Control for Hallucination Reduction in Diffusion Models

Variance-Guided Score Modulation reduces hallucinations in diffusion models by controlling score function smoothness through Jacobian modulation while maintaining image quality.

arXiv cs.CL TIER_1 English(EN) · Saptarshi Sengupta, Suhang Wang · 2026-05-28 04:00

Can Hallucinations Be Useful? Solving Multi-Hop Questions With SLMs By Chaining System-I/II Reasoning

arXiv:2605.27596v1 Announce Type: new Abstract: Recently, there has been increased interest in Small Language Models (SLMs), which are fast, show good performance, and have lower hardware demands than large language models (LLMs). However, SLMs hallucinate more frequently than LL…

arXiv cs.CL TIER_1 English(EN) · Yuang Huang, Yafeng Zhang, Yu Zilan · 2026-05-28 04:00

Risk-aware Selective Prompting for Hallucination Mitigation in Large Vision-Language Models

arXiv:2605.28123v1 Announce Type: new Abstract: Prompt-based verification is widely used to mitigate hallucinations in large vision-language models (LVLMs), yet when it helps remains poorly understood. We systematically study verification prompting across two representative LVLM …

arXiv cs.AI TIER_1 English(EN) · Partho Ghose, Al Bashir, Prem Raj, Azlan Zahid · 2026-05-28 04:00

Hallucination Behavior in Multimodal LLMs Across Agricultural Image Interpretation and Generation Tasks

arXiv:2605.27595v1 Announce Type: cross Abstract: Large Language Models (LLMs) are being rapidly adopted in agricultural imaging applications, ranging from crop interpretation to synthetic field image generation. However, these models frequently exhibit hallucinations outputs tha…

arXiv cs.CL TIER_1 English(EN) · Jingwen Wu, Xijun Zhang, Ge Song · 2026-05-28 04:00

Rethinking Visual Neglect: Steering via Context-Preference for MLLM Hallucination Mitigation

arXiv:2605.27993v1 Announce Type: new Abstract: Object hallucination remains a primary obstacle to the reliable deployment of Multimodal Large Language Models (MLLMs). Current inference-time mitigation methods mainly assume hallucinations stem from visual neglect, steering models…

arXiv cs.CL TIER_1 English(EN) · Joan Vendrell Gallart, Solmaz Kia, Russell Bent, Michael Grosskopf · 2026-05-28 04:00

Chain-based Adaptive Reconfiguration Over Lattices for Hallucination Reduction

arXiv:2605.27706v1 Announce Type: new Abstract: We introduce CAROL (Chain-based Adaptive Reconfiguration Over Lattices), a probabilistic framework for test-time hallucination reduction in large language models. Rather than relying on token-level uncertainty, CAROL defines a seman…

arXiv cs.AI TIER_1 English(EN) · Mattia J. Villani, Pranav Deshpande, Akshay Seshadri, Romina Yalovetzky, Niraj Kumar · 2026-05-28 04:00

Entropy Distribution as a Fingerprint for Hallucinations in Generative Models

arXiv:2605.28264v1 Announce Type: new Abstract: Large Language Models (LLMs) often generate factually incorrect outputs, commonly termed hallucinations, that undermine trust and limit deployment in high-stakes settings. Existing hallucination detection methods typically require m…

arXiv cs.MA (Multiagent) TIER_1 English(EN) · Deborah A. Dahl · 2026-05-27 20:01

Hallucination Mitigation with Agentic AI, Nested Learning, and AI Sustainability via Semantic Caching

Hallucination remains a major reliability barrier for production LLM systems, particularly in multi-agent pipelines where unsupported claims can propagate unchecked across stages. This paper adapts a HOPE-inspired Nested Learning architecture with Continuum Memory Systems (CMS) a…

arXiv cs.AI TIER_1 English(EN) · Xinpeng Wang, William Cao, Andrew Gordon Wilson, Zhe Zeng · 2026-05-27 04:00

Automatic Layer Selection for Hallucination Detection

arXiv:2605.26366v1 Announce Type: new Abstract: Recent studies on hallucination detection have shown that hallucination-related signals are more strongly encoded in intermediate layers than in the final layer of large language models (LLMs). Although a growing body of work has so…

arXiv cs.AI TIER_1 English(EN) · Nishant P. Das, Piyush Srivastava · 2026-05-27 04:00

Innovation: An Almost Characterization of Hallucination

arXiv:2605.26808v1 Announce Type: cross Abstract: Hallucination is a central limitation of large language models (LLMs), and substantial effort has been devoted to understanding and mitigating it. Towards this, Kalai and Vempala (STOC 2024) introduced a probabilistic framework fo…

arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Michael Grosskopf · 2026-05-26 21:28

Chain-based Adaptive Reconfiguration Over Lattices for Hallucination Reduction

We introduce CAROL (Chain-based Adaptive Reconfiguration Over Lattices), a probabilistic framework for test-time hallucination reduction in large language models. Rather than relying on token-level uncertainty, CAROL defines a semantic uncertainty measure based on the consistency…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-26 19:07

Hallucination Behavior in Multimodal LLMs Across Agricultural Image Interpretation and Generation Tasks

Large Language Models (LLMs) are being rapidly adopted in agricultural imaging applications, ranging from crop interpretation to synthetic field image generation. However, these models frequently exhibit hallucinations outputs that appear confident yet deviate from biological or …

arXiv cs.LG TIER_1 English(EN) · Piyush Srivastava · 2026-05-26 10:24

Innovation: An Almost Characterization of Hallucination

Hallucination is a central limitation of large language models (LLMs), and substantial effort has been devoted to understanding and mitigating it. Towards this, Kalai and Vempala (STOC 2024) introduced a probabilistic framework formalizing calibration and hallucination, and showe…

arXiv cs.CL TIER_1 English(EN) · Riasad Alvi, Nurul Labib Sayeedi, Md. Faiyaz Abdullah Sayeedi · 2026-05-26 04:00

MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing

arXiv:2605.24919v1 Announce Type: new Abstract: Hallucinations in Large Language Models (LLMs) represent a critical barrier to their reliable deployment, a vulnerability heavily exacerbated in non-English and resource-constrained contexts. Existing detection approaches that rely …

arXiv cs.AI TIER_1 English(EN) · Yuanzhi Xu, Qian Gao, Jun Fan, Guohui Ding, Zhenyu Yang, Sixue Lin, Yuteng Xiao · 2026-05-26 04:00

Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration

arXiv:2605.24957v1 Announce Type: new Abstract: The generation of factually incorrect objects, commonly known as object hallucination, remains a persistent challenge in Large Vision-Language Models (LVLMs). Current approaches to address this issue - ranging from expensive data-dr…

arXiv cs.AI TIER_1 English(EN) · Hinduja Nirujan, Shreyas Patil, Abdallah Ayoub, Ahmad Abdel Latif, Gouri Ginde · 2026-05-26 04:00

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries

arXiv:2605.24137v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used to generate summaries of software bug reports, including sections such as Steps-to-Reproduce (S2R), Actual Behavior (AB), and Expected Behavior (EB). However, these models frequen…

arXiv cs.AI TIER_1 English(EN) · Quanjiang Li, Zhiming Liu, Wei Luo, Tingjin Luo, Chenping Hou · 2026-05-26 04:00

Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory

arXiv:2605.24602v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) frequently suffer from object hallucinations, yet the visual perceptual mechanism underlying this failure remains poorly understood. In this work, we reveal that hallucinations are strongly…

arXiv cs.AI TIER_1 English(EN) · Yuhao Zhan, Tianyu Fan, Linxuan Huang, Zirui Guo, Chao Huang · 2026-05-26 04:00

Why Your Deep Research Agent Fails? On Hallucination Evaluation in Full Research Trajectory

arXiv:2601.22984v2 Announce Type: replace Abstract: Diagnosing failure patterns in Deep Research Agents (DRAs) remains a critical challenge. Existing benchmarks predominantly rely on end-to-end evaluation, obscuring intermediate hallucinations that accumulate throughout the resea…

arXiv cs.AI TIER_1 English(EN) · Shuqi Zhu, Yi Zhong, Ziyi Ye, Bangde Du, Yujia Zhou, Qingyao Ai, Yiqun Liu · 2026-05-26 04:00

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

arXiv:2605.16953v2 Announce Type: replace Abstract: While AI-generated hallucinations pose considerable risks, the underlying cognitive mechanisms by which humans can successfully recognize or be misled by these hallucinations remain unclear. To address this problem, this paper e…

arXiv cs.CL TIER_1 English(EN) · Musarrat Zeba, Abdullah Al Mamun, Kishoar Jahan Tithee, Debopom Sutradhar, Mohaimenul Azam Khan Raiaan, Saddam Mukta, Reem E. Mohamed, Md Rafiqul Islam, Yakub Sebastian, Mukhtar Hussain, Sami Azam · 2026-05-26 04:00

Mitigating Hallucinations in Healthcare LLMs with Granular Fact-Checking and Domain-Specific Adaptation

arXiv:2512.16189v3 Announce Type: replace Abstract: In healthcare, it is essential for any LLM-generated output to be reliable and accurate, particularly in cases involving decision-making and patient safety. However, the outputs are often unreliable in such critical areas due to…

arXiv cs.AI TIER_1 English(EN) · Yutong Xie, Zhenglin Hua, Ran Wang, Wing W. Y. Ng, Xizhao Wang, Yuheng Jia · 2026-05-22 04:00

Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

arXiv:2605.20965v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have shown remarkable performance on a wide range of vision-language tasks. Despite this progress, they are still prone to hallucination, generating responses that are inconsistent with visual …

arXiv cs.AI TIER_1 English(EN) · Jianfei Li, Ines Rosellon-Inclan, Gitta Kutyniok, Jean-Luc Starck · 2026-05-22 04:00

CHEM: Estimating and Understanding Hallucinations in Deep Learning for Image Processing

arXiv:2512.09806v2 Announce Type: replace-cross Abstract: Deep learning-based methods have recently achieved significant success in image reconstruction problems. However, challenges have emerged, as these methods may generate unrealistic artifacts or hallucinations, which can in…

arXiv cs.AI TIER_1 English(EN) · Yuheng Jia · 2026-05-20 09:50

Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

Large Vision-Language Models (LVLMs) have shown remarkable performance on a wide range of vision-language tasks. Despite this progress, they are still prone to hallucination, generating responses that are inconsistent with visual content. In this work, we find that LVLMs tend to …

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-20 06:14

VIHD: Visual Intervention-based Hallucination Detection for Medical Visual Question Answering

While medical Multimodal Large Language Models (MLLMs) have shown promise in assisting diagnosis, they still frequently generate hallucinated responses that appear linguistically plausible but lack visual evidence. Such hallucinations pose risks to clinical decision-making and ne…

arXiv cs.CL TIER_1 English(EN) · Tej Sanibh Ranade · 2026-05-18 10:08

TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction

Hallucination correction is not a one-direction problem. We show that intermediate layers are neither uniformly more truthful than final layers nor uniformly less trustworthy. Yet hallucination reduction is usually instantiated through one fixed intervention form: contrast one la…

arXiv cs.CL TIER_1 English(EN) · Lijie Wen · 2026-05-14 09:37

Do We Really Need External Tools to Mitigate Hallucinations? SIRA: Shared-Prefix Internal Reconstruction of Attribution

Large vision-language models (LVLMs) often hallucinate when language priors dominate weak or ambiguous visual evidence. Existing contrastive decoding methods mitigate this problem by comparing predictions from the original image with those from externally perturbed visual inputs,…

arXiv cs.CL TIER_1 English(EN) · Yubin Xia · 2026-05-14 06:44

When Answers Stray from Questions: Hallucination Detection via Question-Answer Orthogonal Decomposition

Hallucination detection in large language models (LLMs) requires balancing accu racy, efficiency, and robustness to distribution shift. Black-box consistency methods are effective but demand repeated inference; single-pass white-box probes are effi cient yet treat answer represen…

arXiv cs.AI TIER_1 English(EN) · Ali Baheri · 2026-05-13 16:48

Where Does Reasoning Break? Step-Level Hallucination Detection via Hidden-State Transport Geometry

Large language models hallucinate during multi-step reasoning, but most existing detectors operate at the trace level: they assign one confidence score to a full output, fail to localize the first error, and often require multiple sampled completions. We frame hallucination inste…

arXiv cs.AI TIER_1 English(EN) · Amine Trabelsi · 2026-05-12 17:32

CAAFC: Chronological Actionable Automated Fact-Checker for misinformation / non-factual hallucination detection and correction

With the vast amount of content uploaded every hour, along with the AI generated content that can include hallucinations, Automated Fact-Checking (AFC) has become increasingly vital, as it is infeasible for human fact-checkers to manually verify the sheer volume of information ge…

arXiv cs.AI TIER_1 English(EN) · Yi R. Fung · 2026-05-12 16:47

Scalable Token-Level Hallucination Detection in Large Language Models

Large language models (LLMs) have demonstrated remarkable capabilities, but they still frequently produce hallucinations. These hallucinations are difficult to detect in reasoning-intensive tasks, where the content appears coherent but contains errors like logical flaws and unrel…

arXiv cs.LG TIER_1 English(EN) · Ruixuan Wang · 2026-05-12 15:27

Instruction Lens Score: Your Instruction Contributes a Powerful Object Hallucination Detector for Multimodal Large Language Models

Multimodal large language models (MLLMs) have achieved remarkable progress, yet the object hallucination remains a critical challenge for reliable deployment. In this paper, we present an in-depth analysis of instruction token embeddings and reveal that they implicitly encode vis…

arXiv cs.AI TIER_1 English(EN) · Yian Yin · 2026-05-08 13:26

LLM hallucinations in the wild: Large-scale evidence from non-existent citations

Large language models (LLMs) are known to generate plausible but false information across a wide range of contexts, yet the real-world magnitude and consequences of this hallucination problem remain poorly understood. Here we leverage a uniquely verifiable object - scientific cit…

arXiv cs.CL TIER_1 English(EN) · Brandon C. Colelough, Davis Bartels, Dina Demner-Fushman · 2026-05-08 04:00

Quantifying Hallucinations in Language Language Models on Medical Textbooks

arXiv:2603.09986v2 Announce Type: replace Abstract: Hallucinations, the tendency for large language models to provide responses with factually incorrect and unsupported claims, is a serious problem within natural language processing for which we do not yet have an effective solut…

arXiv cs.CL TIER_1 English(EN) · Erik Nielsen, Elia Cunegatti, Marcus Vukojevic, Giovanni Iacca · 2026-05-08 04:00

Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

arXiv:2605.05953v1 Announce Type: new Abstract: One of the most critical challenges in Large Language Models is their tendency to hallucinate, i.e., produce factually incorrect responses. Existing approaches show promising results in terms of hallucination correction, but still s…

arXiv cs.CL TIER_1 English(EN) · Giovanni Iacca · 2026-05-07 10:02

Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

One of the most critical challenges in Large Language Models is their tendency to hallucinate, i.e., produce factually incorrect responses. Existing approaches show promising results in terms of hallucination correction, but still suffer from a main limitation: they apply correct…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-07 10:02

Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

One of the most critical challenges in Large Language Models is their tendency to hallucinate, i.e., produce factually incorrect responses. Existing approaches show promising results in terms of hallucination correction, but still suffer from a main limitation: they apply correct…

arXiv cs.CL TIER_1 English(EN) · Mina Gabriel · 2026-05-07 04:00

The First Token Knows: Single-Decode Confidence for Hallucination Detection

arXiv:2605.05166v1 Announce Type: new Abstract: Self-consistency detects hallucinations by generating multiple sampled answers to a question and measuring agreement, but this requires repeated decoding and can be sensitive to lexical variation. Semantic self-consistency improves …

arXiv cs.LG TIER_1 English(EN) · Dan Wilson, Mohamed Akrout · 2026-05-07 04:00

Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction

arXiv:2605.05134v1 Announce Type: new Abstract: Large Language Models (LLMs) frequently generate plausible but non-factual content, a phenomenon known as hallucination. While existing detection methods typically rely on computationally expensive sampling-based consistency checks …

arXiv cs.CL TIER_1 English(EN) · Gijs van Dijk · 2026-05-07 04:00

Detecting Hallucinations in Large Language Models via Internal Attention Divergence Signals

arXiv:2605.05025v1 Announce Type: new Abstract: We propose a lightweight and single-pass uncertainty quantification method for detecting hallucinations in Large Language Models. The method uses attention matrices to estimate uncertainty without requiring repeated sampling or exte…

arXiv cs.LG TIER_1 English(EN) · Linggang Kong, Lei Wu, Yunlong Zhang, Xiaofeng Zhong, Zhen Wang, Yongjie Wang, Yao Pan · 2026-05-07 04:00

CausalGaze: Unveiling Hallucinations via Counterfactual Graph Intervention in Large Language Models

arXiv:2604.11087v2 Announce Type: replace Abstract: Despite the groundbreaking advancements made by large language models (LLMs), hallucination remains a critical bottleneck for their deployment in high-stakes domains. Existing classification-based methods mainly rely on static a…

arXiv cs.CL TIER_1 English(EN) · Philip Wootaek Shin, Ajay Narayanan Sridhar, Sivani Devarapalli, Rui Zhang, Jack Sampson, Vijaykrishnan Narayanan · 2026-05-07 04:00

When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise

arXiv:2605.05045v1 Announce Type: cross Abstract: Vision-language models (VLMs) achieve strong multimodal performance but remain prone to relation hallucination, which requires accurate reasoning over inter-object interactions. We study the impact of visual perturbations, specifi…

arXiv cs.CL TIER_1 English(EN) · Mina Gabriel · 2026-05-06 17:34

The First Token Knows: Single-Decode Confidence for Hallucination Detection

Self-consistency detects hallucinations by generating multiple sampled answers to a question and measuring agreement, but this requires repeated decoding and can be sensitive to lexical variation. Semantic self-consistency improves this by clustering sampled answers by meaning us…

arXiv cs.LG TIER_1 English(EN) · Mohamed Akrout · 2026-05-06 17:07

Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction

Large Language Models (LLMs) frequently generate plausible but non-factual content, a phenomenon known as hallucination. While existing detection methods typically rely on computationally expensive sampling-based consistency checks or external knowledge retrieval, we propose a ne…

arXiv cs.CL TIER_1 English(EN) · Gijs van Dijk · 2026-05-06 15:21

Detecting Hallucinations in Large Language Models via Internal Attention Divergence Signals

We propose a lightweight and single-pass uncertainty quantification method for detecting hallucinations in Large Language Models. The method uses attention matrices to estimate uncertainty without requiring repeated sampling or external models. Specifically, we measure the Kullba…

arXiv cs.AI TIER_1 English(EN) · Ahmed Ibrahim · 2026-05-06 04:00

Neuro-Symbolic Agents for Hallucination-Free Requirements Reuse

arXiv:2605.01562v1 Announce Type: cross Abstract: The Object-Oriented Method for Requirements Authoring and Management (OOMRAM) is a requirements reuse framework that relies on exact identifier matching and rigid templates, limiting its ability to adapt specifications across dive…

arXiv cs.CL TIER_1 English(EN) · Severin Ye, Xiao Kong, Xiaopeng He, Guangsu Yan, Dongsuk Oh · 2026-05-06 04:00

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

arXiv:2605.03476v1 Announce Type: new Abstract: Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; ho…

arXiv cs.CL TIER_1 English(EN) · Hao Mi, Qiang Sheng, Shaofei Wang, Beizhe Hu, Yifan Sun, Zhengjia Wang, Hengqi Zeng, Yang Li, Danding Wang, Juan Cao · 2026-05-06 04:00

Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

arXiv:2605.03971v1 Announce Type: new Abstract: Large Language Models (LLMs) are prone to factual hallucinations, risking their reliability in real-world applications. Existing hallucination detectors mainly extract micro-level intrinsic patterns for uncertainty quantification or…

arXiv cs.CL TIER_1 English(EN) · Juan Cao · 2026-05-05 16:53

Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

Large Language Models (LLMs) are prone to factual hallucinations, risking their reliability in real-world applications. Existing hallucination detectors mainly extract micro-level intrinsic patterns for uncertainty quantification or elicit macro-level self-judgments through verba…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-05 08:05

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness …

arXiv cs.CL TIER_1 English(EN) · Dongsuk Oh · 2026-05-05 08:05

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness …

arXiv cs.CL TIER_1 English(EN) · Alexandra Bazarova, Aleksandr Yugay, Andrey Shulga, Alina Ermilova, Andrei Volodichev, Konstantin Polev, Julia Belikova, Rauf Parchiev, Dmitry Simakov, Maxim Savchenko, Andrey Savchenko, Serguei Barannikov, Alexey Zaytsev · 2026-05-05 04:00

Hallucination Detection in LLMs with Topological Divergence on Attention Graphs

arXiv:2504.10063v4 Announce Type: replace Abstract: Hallucination, i.e., generating factually incorrect content, remains a critical challenge for large language models (LLMs). We introduce TOHA, a TOpology-based HAllucination detector in the RAG setting, which leverages a topolog…

arXiv cs.CL TIER_1 English(EN) · Ahmed Cherif · 2026-05-05 04:00

HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs

arXiv:2605.02443v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language processing tasks, yet they remain susceptible to hallucinations -- generating content that is factually incorrect, unfaithful to …

arXiv cs.CL TIER_1 English(EN) · Freja Thoresen, Dan Saattrup Smart · 2026-05-05 04:00

A multilingual hallucination benchmark: MultiWikiQHalluA

arXiv:2605.02504v1 Announce Type: new Abstract: Most hallucination evaluations focus on English, leaving it unclear whether findings transfer to lower-resource languages. We investigate faithfulness hallucinations, defined as model-generated content that is fluent and plausible b…

arXiv cs.CL TIER_1 English(EN) · Joseph Spracklen, Pedram Aghazadeh, Farinaz Koushanfar, Murtuza Jadliwala · 2026-05-05 04:00

LLM Ghostbusters: Surgical Hallucination Suppression via Adaptive Unlearning

arXiv:2605.01047v1 Announce Type: cross Abstract: Hallucinations, outputs that sound plausible but are factually incorrect, remain an open challenge for deployed LLMs. In code generation, models frequently hallucinate non-existent software packages, recommending imports and insta…

arXiv cs.LG TIER_1 English(EN) · Yee Zhing Liew, Andrew Huey Ping Tan, Anwar P. P Abdul Majeed · 2026-05-05 04:00

From Flat Facts to Sharp Hallucinations: Detecting Stubborn Errors via Gradient Sensitivity

arXiv:2605.00939v1 Announce Type: new Abstract: Traditional hallucination detection fails on "Stubborn Hallucinations" -- errors where LLMs are confidently wrong. We propose a geometric solution: Embedding-Perturbed Gradient Sensitivity (EPGS). We hypothesize that while robust fa…

arXiv cs.LG TIER_1 English(EN) · Jianxiong Zhang, Bing Guo, Yuming Jiang, Haobo Wang, Bo An, Sean Du · 2026-05-05 04:00

Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping

arXiv:2601.17467v2 Announce Type: replace Abstract: Large reasoning models (LRMs) often generate long, seemingly coherent reasoning traces yet still produce incorrect answers, making hallucination detection challenging. Although trajectories contain useful signals, directly using…

arXiv cs.CL TIER_1 English(EN) · Dan Saattrup Smart · 2026-05-04 11:57

A multilingual hallucination benchmark: MultiWikiQHalluA

Most hallucination evaluations focus on English, leaving it unclear whether findings transfer to lower-resource languages. We investigate faithfulness hallucinations, defined as model-generated content that is fluent and plausible but diverges from the provided input or is intern…

arXiv cs.CL TIER_1 English(EN) · Ahmed Cherif · 2026-05-04 10:43

HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs

Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language processing tasks, yet they remain susceptible to hallucinations -- generating content that is factually incorrect, unfaithful to provided context, or misaligned with user instru…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-03 07:58

Mitigating Multimodal LLMs Hallucinations via Relevance Propagation at Inference Time

Multimodal large language models (MLLMs) have revolutionized the landscape of AI, demonstrating impressive capabilities in tackling complex vision and audio-language tasks. However, a critical challenge remains: these models often suffer from hallucinations, generating outputs th…

arXiv cs.CL TIER_1 English(EN) · Guoshenghui Zhao, Weijie Zhao, Tan Yu · 2026-04-30 04:00

HIVE: Hidden-Evidence Verification for Hallucination Detection in Diffusion Large Language Models

arXiv:2604.26139v1 Announce Type: new Abstract: Diffusion large language models generate text through multi-step denoising, where hallucination signals may emerge throughout the trajectory rather than only in the final output. Existing detectors mainly rely on output uncertainty …

arXiv cs.CL TIER_1 English(EN) · Jiawei Li, Akshayaa Magesh, Venugopal V. Veeravalli · 2026-04-29 04:00

Principled Detection of Hallucinations in Large Language Models via Multiple Testing

arXiv:2508.18473v3 Announce Type: replace Abstract: While Large Language Models (LLMs) have emerged as powerful foundational models to solve a variety of tasks, they have also been shown to be prone to hallucinations, i.e., generating responses that sound confident but are actual…

arXiv cs.CL TIER_1 English(EN) · Tan Yu · 2026-04-28 21:57

HIVE: Hidden-Evidence Verification for Hallucination Detection in Diffusion Large Language Models

Diffusion large language models generate text through multi-step denoising, where hallucination signals may emerge throughout the trajectory rather than only in the final output. Existing detectors mainly rely on output uncertainty or coarse trace statistics, which often fail to …

arXiv cs.AI TIER_1 English(EN) · Federico A. Kamelhar · 2026-04-28 04:00

GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

arXiv:2604.23366v1 Announce Type: new Abstract: Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is grounded in observed evidence rather …

Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-27 12:23

Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

Vision-Language Models (VLMs) are frequently undermined by object hallucination--generating content that contradicts visual reality--due to an over-reliance on linguistic priors. We introduce Positive-and-Negative Decoding (PND), a training-free inference framework that intervene…

Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-25 16:20

GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is grounded in observed evidence rather than model-internal inference. Existing grounded…

arXiv cs.CV TIER_1 English(EN) · Yue Jiang, Xue Jiang, Lihua Zhang, Zhiqiang Wang, Yuhang Lu, Peng Wang, Bo Han, Feng Zheng, Dingkang Yang · 2026-06-02 04:00

MM-Snowball: Evaluating and Mitigating Hallucination Snowballing in Multimodal Multi-Turn Dialogue

arXiv:2606.00622v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) demonstrate remarkable visual understanding, yet their reliability in interactive settings is severely undermined by hallucination snowballing: a phenomenon where initial errors amplify acros…

arXiv cs.CV TIER_1 English(EN) · Mahesh Bhosale, Naresh Kumar Devulapally, Abdul Wasi, Chau Pham, Vishnu Suresh Lokhande, David Doermann · 2026-06-02 04:00

Score-Control for Hallucination Reduction in Diffusion Models

arXiv:2606.00377v1 Announce Type: new Abstract: Diffusion models have emerged as the backbone of modern generative AI, powering advances in vision, language, audio and other modalities. Despite their success, they suffer from hallucinations, implausible samples that lie outside t…

arXiv cs.CV TIER_1 English(EN) · Ting Chen, Geng Li, Guohao Chen, Yu Hu, Guan Huang, Mai Chen, Langsheng Lei, Jun Du · 2026-06-01 04:00

YARD: Y-Architecture Register Decoding for Efficient Hallucination Mitigation in Large Vision-Language Models

arXiv:2605.31429v1 Announce Type: new Abstract: Contrastive decoding (CD) seeks to mitigate hallucinations in Large Vision-Language Models (LVLMs) by contrasting the output distributions of a standard model and a visually degraded model. However, existing training-free CD methods…

arXiv cs.CV TIER_1 English(EN) · Zheng Qi, Chao Shang, Evangelia Spiliopoulou, Nikolaos Pappas · 2026-06-01 04:00

Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation

arXiv:2510.22067v3 Announce Type: replace Abstract: Vision language models (VLMs) often generate hallucination, i.e., content that cannot be substantiated by either textual or visual inputs. Prior work primarily attributes this to over-reliance on linguistic prior knowledge rathe…

arXiv cs.CV TIER_1 English(EN) · Jun Du · 2026-05-29 15:23

YARD: Y-Architecture Register Decoding for Efficient Hallucination Mitigation in Large Vision-Language Models

Contrastive decoding (CD) seeks to mitigate hallucinations in Large Vision-Language Models (LVLMs) by contrasting the output distributions of a standard model and a visually degraded model. However, existing training-free CD methods suffer from sub-optimal degraded branches: comp…

arXiv cs.CV TIER_1 English(EN) · Shizhe Zhou, Bohan Jia, Kai Wu, Yan Shen, Tongyun Li, Yuyang Wu, Shaohui Lin · 2026-05-29 04:00

ReactBench: A Cause-Driven Benchmark for Multimodal Hallucination via Systematic Evaluation

arXiv:2605.29579v1 Announce Type: new Abstract: While multimodal large language models (MLLMs) have achieved rapid progress in vision-language understanding, they remain prone to multimodal hallucinations, producing responses that are inconsistent with the visual input. Existing …

arXiv cs.CV TIER_1 English(EN) · Jiacheng Zhang, Feng Liu, Chao Du, Tianyu Pang · 2026-05-29 04:00

SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification

arXiv:2602.13600v2 Announce Type: replace Abstract: A line of recent training-free methods for mitigating hallucinations in large vision-language models (LVLMs) operates by amplifying attention to visual tokens during autoregressive generation within a single forward pass. We ref…

arXiv cs.CV TIER_1 English(EN) · Sanasam Ranbir Singh · 2026-05-28 13:07

Mitigating Hallucination in Vision-Language Models through Barrier-Regulated Adaptive Closed-form Steering

Large vision-language models (LVLMs) often hallucinate objects that are not present in the input image, largely because visual grounding weakens as decoding progresses. Existing inference-time mitigation methods modify logits or hidden states throughout generation, but they suffe…

arXiv cs.CV TIER_1 English(EN) · Hyunmin Cho, Donghoon Ahn, Susung Hong, Jee Eun Kim, Seungryong Kim, Kyong Hwan Jin · 2026-05-27 04:00

TAG: Tangential Amplifying Guidance for Hallucination-Resistant Sampling

arXiv:2510.04533v2 Announce Type: replace Abstract: Diffusion models achieve state-of-the-art image generation but often produce semantic inconsistencies, or hallucinations. Existing inference-time guidance methods rely on external signals or architectural modifications, adding c…

arXiv stat.ML TIER_1 English(EN) · Yedidia Agnimo, Anna Korba, Annabelle Blangero, Nicolas Chesneau, Karteek Alahari · 2026-05-27 04:00

Evaluating the Relevance of Uncertainty Estimators for LLM Hallucination

arXiv:2605.27016v1 Announce Type: cross Abstract: Large language models (LLMs) are prone to hallucinations, i.e., statements unsupported by the input or training data, hindering reliable deployment. In parallel, numerous uncertainty estimation (UE) methods have been proposed to q…

arXiv stat.ML TIER_1 English(EN) · Karteek Alahari · 2026-05-26 13:34

Evaluating the Relevance of Uncertainty Estimators for LLM Hallucination

Large language models (LLMs) are prone to hallucinations, i.e., statements unsupported by the input or training data, hindering reliable deployment. In parallel, numerous uncertainty estimation (UE) methods have been proposed to quantify model confidence and are often implicitly …

arXiv cs.CV TIER_1 English(EN) · Jiayi Chen, Benteng Ma, Zehui Liao, Winston Chong, Yasmeen George, Jianfei Cai · 2026-05-26 04:00

VIHD: Visual Intervention-based Hallucination Detection for Medical Visual Question Answering

arXiv:2605.20772v2 Announce Type: replace Abstract: While medical Multimodal Large Language Models (MLLMs) have shown promise in assisting diagnosis, they still frequently generate hallucinated responses that appear linguistically plausible but lack visual evidence. Such hallucin…

arXiv cs.CV TIER_1 English(EN) · Zhe Cheng, Wenyu Chen, Fode Zhang, Dehuan Shen · 2026-05-26 04:00

Mitigating Hallucinations in Large Vision-Language Models via Causal Route Gating

arXiv:2605.24024v1 Announce Type: new Abstract: Large vision-language models (LVLMs) often hallucinate content that is fluent yet unsupported by the image, limiting their reliability in real-world deployment. We show that a key failure mode arises from route competition: even whe…

arXiv cs.CV TIER_1 English(EN) · Shangpin Peng, Senqiao Yang, Li Jiang, Zhuotao Tian · 2026-05-25 04:00

Mitigating Object Hallucinations via Sentence-Level Early Intervention

arXiv:2507.12455v3 Announce Type: replace Abstract: Multimodal large language models (MLLMs) have revolutionized cross-modal understanding but continue to struggle with hallucinations - fabricated content contradicting visual inputs. Existing hallucination mitigation methods eith…

arXiv cs.CV TIER_1 English(EN) · Deepu Rajan · 2026-05-20 15:29

Reducing Object Hallucination in LVLMs via Emphasizing Image-negative Tokens

Object hallucination is a significant challenge that hinders the application of large vision-language models (LVLMs) in practice. We hypothesize that one possible origin of hallucination is the model's tendency to prioritize text generation over meaningful interaction with images…

arXiv stat.ML TIER_1 English(EN) · Emmy Liu, Varun Gangal, Michael Yu, Zhuofu Tao, Karan Singh, Sachin Kumar, Steven Y. Feng · 2026-05-20 04:00

HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models

arXiv:2605.19341v1 Announce Type: cross Abstract: Hallucination remains a central failure mode of large language models, but existing benchmarks operationalize it inconsistently across summarization, question answering, retrieval-augmented generation, and agentic interaction. Thi…

arXiv stat.ML TIER_1 English(EN) · Steven Y. Feng · 2026-05-19 04:29

HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models

Hallucination remains a central failure mode of large language models, but existing benchmarks operationalize it inconsistently across summarization, question answering, retrieval-augmented generation, and agentic interaction. This fragmentation makes it unclear whether a mitigat…

arXiv cs.CV TIER_1 English(EN) · Yu Wang · 2026-05-14 15:31

MHSA: A Lightweight Framework for Mitigating Hallucinations via Steered Attention in LVLMs

Large vision-language models (LVLMs) have achieved remarkable performance across diverse multimodal tasks, yet they continue to suffer from hallucinations, generating content that is inconsistent with the visual input. Prior work DHCP (Detecting Hallucinations by Cross-modal Atte…

arXiv cs.CV TIER_1 English(EN) · Aofan Liu · 2026-05-13 08:20

Dual-Pathway Circuits of Object Hallucination in Vision-Language Models

Vision-language models (VLMs) have demonstrated remarkable capabilities in bridging visual perception and natural language understanding, enabling a wide range of multimodal reasoning tasks. However, they often produce object hallucinations, describing content absent from the inp…

arXiv cs.CV TIER_1 English(EN) · Jing Li · 2026-05-11 14:16

Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination

Large Vision-Language Models (LVLMs) have achieved remarkable progress in multimodal tasks, yet their reliability is persistently undermined by hallucinations-generating text that contradicts visual input. Recent studies often attribute these errors to inadequate visual attention…

arXiv stat.ML TIER_1 English(EN) · Prabhat Kc, Rongping Zeng, Nirmal Soni, Aldo Badano · 2026-05-08 04:00

sFRC for assessing hallucinations in medical image restoration

arXiv:2603.04673v2 Announce Type: replace-cross Abstract: Deep learning (DL) methods are currently being explored to restore images from sparse-view-, limited-data-, and undersampled-based acquisitions in medical applications. Although outputs from DL may appear visually appealin…

arXiv cs.CV TIER_1 English(EN) · Vijaykrishnan Narayanan · 2026-05-06 15:41

When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise

Vision-language models (VLMs) achieve strong multimodal performance but remain prone to relation hallucination, which requires accurate reasoning over inter-object interactions. We study the impact of visual perturbations, specifically rotation and noise, and show that even mild …

arXiv cs.CV TIER_1 English(EN) · Itai Allouche, Joseph Keshet · 2026-05-05 04:00

Mitigating Multimodal LLMs Hallucinations via Relevance Propagation at Inference Time

arXiv:2605.01766v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have revolutionized the landscape of AI, demonstrating impressive capabilities in tackling complex vision and audio-language tasks. However, a critical challenge remains: these models often…

arXiv cs.CV TIER_1 English(EN) · Jianfei Zhao, Feng Zhang, Xin Sun, Chong Feng, Zhixing Tan · 2026-04-30 04:00

Tell Model Where to Look: Mitigating Hallucinations in MLLMs by Vision-Guided Attention

arXiv:2511.20032v3 Announce Type: replace Abstract: Visual attention serves as the primary mechanism through which MLLMs interpret visual information; however, its limited localization capability often leads to hallucinations. We observe that although MLLMs can accurately extract…

arXiv cs.CV TIER_1 English(EN) · Chengsheng Zhang, Chenghao Sun, Xinyan Jiang, Wei Li, Xinmei Tian · 2026-04-29 04:00

Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models

arXiv:2604.25642v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable progress in visual-textual understanding, yet their reliability is critically undermined by hallucinations, i.e., the generation of factually incorrect or inconsistent re…

arXiv cs.CV TIER_1 English(EN) · Xinmei Tian · 2026-04-28 13:42

Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models

Large Vision-Language Models (LVLMs) have achieved remarkable progress in visual-textual understanding, yet their reliability is critically undermined by hallucinations, i.e., the generation of factually incorrect or inconsistent responses. While recent studies using steering vec…

arXiv cs.CV TIER_1 English(EN) · Jiawei Chen, Dingkang Yang, Tong Wu, Yue Jiang, Xiaolu Hou, Mingcheng Li, Shunli Wang, Dongling Xiao, Ke Li, Lihua Zhang · 2026-04-28 04:00

Detecting and Evaluating Medical Hallucinations in Large Vision Language Models

arXiv:2406.10185v2 Announce Type: replace Abstract: Large Vision Language Models (LVLMs) are increasingly integral to healthcare applications, including medical visual question answering and imaging report generation. While these models inherit the robust capabilities of foundati…

arXiv cs.CV TIER_1 English(EN) · Zhiyuan Jiang, Weihao Hong, Xinlei Guan, Tejaswi Dhandu, Miles Q. Li, Meng Xu, Kuan Huang, Umamaheswara Rao Tida, Bingyu Shen, Daehan Kwak, Boyang Li · 2026-04-28 04:00

LLM-as-Judge Framework for Evaluating Tone-Induced Hallucination in Vision-Language Models

arXiv:2604.18803v3 Announce Type: replace Abstract: Vision-Language Models (VLMs) are increasingly deployed in settings where reliable visual grounding carries operational consequences, yet their behavior under progressively coercive prompt phrasing remains undercharacterized. Ex…

arXiv cs.CV TIER_1 English(EN) · Yubo Jiang, Xin Yang, Abudukelimu Wuerkaixi, Zheming Yuan, Xuxin Cheng, Fengying Xie, Zhiguo Jiang, Cao Liu, Ke Zeng, Haopeng Zhang · 2026-04-28 04:00

Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

arXiv:2604.24396v1 Announce Type: new Abstract: Vision-Language Models (VLMs) are frequently undermined by object hallucination--generating content that contradicts visual reality--due to an over-reliance on linguistic priors. We introduce Positive-and-Negative Decoding (PND), a …

arXiv cs.CV TIER_1 English(EN) · JiYang Wang, Jiawei Chen, Mengqi Xiao, Yu Cheng, Yangfu Li, Zhaoxia Yin · 2026-04-28 04:00

DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models

arXiv:2604.22822v1 Announce Type: new Abstract: Object level hallucination remains a central reliability challenge for vision language models (VLMs), particularly in binary object existence verification. Existing benchmarks emphasize aggregate accuracy but rarely disentangle whet…

arXiv cs.CV TIER_1 English(EN) · Haopeng Zhang · 2026-04-27 12:23

Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

Vision-Language Models (VLMs) are frequently undermined by object hallucination--generating content that contradicts visual reality--due to an over-reliance on linguistic priors. We introduce Positive-and-Negative Decoding (PND), a training-free inference framework that intervene…

Eugene Yan TIER_1 English(EN) · 2023-11-05 00:00

Out-of-Domain Finetuning to Bootstrap Hallucination Detection

How to use open-source, permissive-use data and collect less labeled samples for our tasks.

Medium — MLOps tag TIER_1 English(EN) · Anil Nayak · 2026-06-01 05:54

Beyond ChatGPT Wrappers: Architecting a Cost-Optimized, Zero-Hallucination AI Platform

<div class="medium-feed-item"><a href="https://medium.com/@nayakanil43603/beyond-chatgpt-wrappers-architecting-a-cost-optimized-zero-hallucination-ai-platform-c3a51fcc639c?source=rss------mlops-5"><img src="https://cdn-images-1.medium.com/max/1672/1*R…

Medium — Claude tag TIER_1 English(EN) · Akashshettyonline · 2026-05-25 16:41

1/10 Ways to Reduce Hallucinations in LLM Applications: Grounding with RAG

<div class="medium-feed-item"><a href="https://medium.com/@akashshettyonline22/1-10-ways-to-reduce-hallucinations-in-llm-applications-grounding-with-rag-051102434e6f?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1402/1*-HGVib…

Towards AI TIER_1 English(EN) · Faheem Ahmed · 2026-05-19 05:38

When AI Lies to You: Hallucination, Data Quality, and the Old School Fix

<figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*g6-CMPTHlab61BBb" /><figcaption>Photo by <a href="https://unsplash.com/@galka_nz?utm_source=medium&utm_medium=referral">Galina Nelyubova</a> on <a href="https://unsplash.com?utm_source=medium&utm_medium=r…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-06-05 18:22

"Hallucination is a Consequence of Space-Optimality: A Rate-Distortion Theorem for Membership Testing" This paper says that hallucinations are an inevitable con

"Hallucination is a Consequence of Space-Optimality: A Rate-Distortion Theorem for Membership Testing" This paper says that hallucinations are an inevitable consequence of the way that information is compressed in a lossy way to be stored in LLMs by comparing this to the math of …

LINKS arxiv.org/…/2602.00906

dev.to — LLM tag TIER_1 English(EN) · Muhammad Muzammil · 2026-06-04 11:08

LongTracer: Open-Source RAG Hallucination Detection Without LLM-as-a-Judge

Stop paying to evaluate your LLM outputs. Stop tolerating non-deterministic quality gates. LongTracer is the MIT-licensed Python library that catches RAG hallucinations at inference time — no API calls, no cloud dependency, no per-verification cost. The Hallucin…

dev.to — LLM tag TIER_1 English(EN) · AlterLab · 2026-05-28 11:06

Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

<h2> TL;DR </h2> To achieve near-zero hallucination in RAG pipelines, you must extract web content as structured Markdown or JSON rather than raw HTML, and apply DOM-aware semantic chunking. This preserves contextual boundaries and prevents irrelevant boilerplate or bot-challe…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-26 10:04

New methodology paper: The Calculator Discipline. A four-class taxonomy of AI-assisted disclosure hallucinations, a pre-send filter that catches the mechanical

New methodology paper: The Calculator Discipline. A four-class taxonomy of AI-assisted disclosure hallucinations, a pre-send filter that catches the mechanical ones, and two real withdrawals from my own OpenBSD work — including the one Theo de Raadt asked the right question about…

LINKS stuart-thomas.com/…/calculator-discipline

dev.to — LLM tag TIER_1 English(EN) · Spicy · 2026-05-24 16:10

Why Does AI Make Things Up? A Dev's Guide to Hallucination

Quick version: LLMs don't look things up. They predict probable token sequences. When the model's training data is thin or absent on a topic, it doesn't stop — it keeps predicting. Fluently. Confidently. Incorrectly. If you've been building with LLMs for more than a few…

dev.to — LLM tag TIER_1 English(EN) · Gabriel Anhaia · 2026-05-24 09:36

Hallucination Detection at the Trace Layer: 4 Detectors You Can Ship Today

<ul> <li> Book: <a href="https://www.amazon.com/dp/B0GYLHMLMT" rel="noopener noreferrer">LLM Observability Pocket Guide: Picking the Right Tracing & Evals Tools for Your Team</a> </li> <li> Also by me: Thinking in Go (2-book series) …

dev.to — LLM tag TIER_1 English(EN) · CapeStart · 2026-05-21 07:08

A Guide to Preventing AI Hallucinations

<h2> What Are AI Hallucinations? </h2> Last quarter, something happened that made us rethink our entire approach to AI deployment. During a routine audit, we found out our customer support AI had confidently recommended a non-existent product feature to an enterprise client. T…

dev.to — LLM tag TIER_1 English(EN) · Mansi Somayajula · 2026-05-18 18:16

What Production ML Systems Taught Me About AI Hallucinations

Most discussions about AI hallucinations stay at the chatbot level. “ChatGPT made up a legal case.” “The AI invented a research paper.” “The model confidently gave the wrong answer.” Interesting? Sure. But after working on production ML systems…

dev.to — LLM tag TIER_1 English(EN) · Thousand Miles AI · 2026-05-17 06:06

The cracked mirror: why AI hallucination is structural, not a bug

There is a particular kind of error a language model makes that feels different from every other kind of software failure. A database returns the wrong row and you can trace the query. A null pointer crashes and the stack tells you where. But when a model confidently cites a p…

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-16 13:23

LLM hallucinations in the wild: Large-scale evidence from non-existent citations Zhenyue Zhao, Yihe Wang, Toby Stuart, Mathijs De Vaan, Paul Ginsparg, Yian Yin

LLM hallucinations in the wild: Large-scale evidence from non-existent citations Zhenyue Zhao, Yihe Wang, Toby Stuart, Mathijs De Vaan, Paul Ginsparg, Yian Yin "Large language models (LLMs) are known to generate plausible but false information across a wide range of contexts, yet…

LINKS arxiv.org/…/2605.07723

dev.to — LLM tag TIER_1 Português(PT) · Marcelo Cabral Ghilardi · 2026-05-08 11:07

When AI Lies and Asks You to CALM DOWN: a straight talk about hallucinations

E aí, gurizada! Tudo tranquilo? Hoje eu quero trocar uma ideia com vocês sobre umas paradas que andei percebendo com as IAs, e que me motivaram a gravar um vídeo e até escrever um post lá no meu site, o marcelocabral.com.br. Sabe quando a inteligência artificial solta…

r/Anthropic TIER_1 English(EN) · /u/RouXanthica · 2026-05-19 22:10

Fixed the viral Opus 4.7 hallucination/reasoning error using neurosymbolic AI

submitted by <a href="https://www.reddit.com/user/RouXanthica"> /u/RouXanthica </a> <a href="https://www.reddit.com/gallery/1ti228y">[link]</a> <a href="https://www.reddit.com/r/Anthropic/comments/1ti23mi/fixed_the_viral_opus_47_halluci…

COVERAGE [153]