PulseAugur
实时 02:37:34
English(EN) GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

新研究通过新颖的检测和缓解技术解决大语言模型幻觉问题

2026年5月发布的多篇研究论文提出了检测和缓解大语言模型(LLMs)幻觉的新方法。这些方法包括内部重建技术(如SIRA)、问答分解(QAOD)和隐藏状态轨迹分析。其他方法侧重于token级检测、按时间顺序的事实核查以及使用指令嵌入作为检测器。一项研究还量化了大语言模型生成的科学论文中不存在引用的普遍问题,突显了问题的规模。 AI

影响 这些用于幻觉检测和缓解的各种方法可以显著提高大语言模型在各种应用中输出的可靠性和可信度。

排序理由 多篇研究论文发布在arXiv上,详细介绍了检测和缓解大语言模型幻觉的新方法。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 74 个来源。 我们如何撰写摘要 →

新研究通过新颖的检测和缓解技术解决大语言模型幻觉问题

报道来源 [74]

  1. arXiv cs.AI TIER_1 English(EN) · Jianfei Li, Ines Rosellon-Inclan, Gitta Kutyniok, Jean-Luc Starck ·

    CHEM: Estimating and Understanding Hallucinations in Deep Learning for Image Processing

    arXiv:2512.09806v2 Announce Type: replace-cross Abstract: Deep learning-based methods have recently achieved significant success in image reconstruction problems. However, challenges have emerged, as these methods may generate unrealistic artifacts or hallucinations, which can in…

  2. arXiv cs.AI TIER_1 English(EN) · Yutong Xie, Zhenglin Hua, Ran Wang, Wing W. Y. Ng, Xizhao Wang, Yuheng Jia ·

    Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

    arXiv:2605.20965v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have shown remarkable performance on a wide range of vision-language tasks. Despite this progress, they are still prone to hallucination, generating responses that are inconsistent with visual …

  3. arXiv cs.AI TIER_1 English(EN) · Yuheng Jia ·

    Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

    Large Vision-Language Models (LVLMs) have shown remarkable performance on a wide range of vision-language tasks. Despite this progress, they are still prone to hallucination, generating responses that are inconsistent with visual content. In this work, we find that LVLMs tend to …

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    VIHD: Visual Intervention-based Hallucination Detection for Medical Visual Question Answering

    While medical Multimodal Large Language Models (MLLMs) have shown promise in assisting diagnosis, they still frequently generate hallucinated responses that appear linguistically plausible but lack visual evidence. Such hallucinations pose risks to clinical decision-making and ne…

  5. arXiv cs.CL TIER_1 English(EN) · Tej Sanibh Ranade ·

    TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction

    Hallucination correction is not a one-direction problem. We show that intermediate layers are neither uniformly more truthful than final layers nor uniformly less trustworthy. Yet hallucination reduction is usually instantiated through one fixed intervention form: contrast one la…

  6. arXiv cs.CL TIER_1 English(EN) · Lijie Wen ·

    Do We Really Need External Tools to Mitigate Hallucinations? SIRA: Shared-Prefix Internal Reconstruction of Attribution

    Large vision-language models (LVLMs) often hallucinate when language priors dominate weak or ambiguous visual evidence. Existing contrastive decoding methods mitigate this problem by comparing predictions from the original image with those from externally perturbed visual inputs,…

  7. arXiv cs.CL TIER_1 English(EN) · Yubin Xia ·

    When Answers Stray from Questions: Hallucination Detection via Question-Answer Orthogonal Decomposition

    Hallucination detection in large language models (LLMs) requires balancing accu racy, efficiency, and robustness to distribution shift. Black-box consistency methods are effective but demand repeated inference; single-pass white-box probes are effi cient yet treat answer represen…

  8. arXiv cs.AI TIER_1 English(EN) · Ali Baheri ·

    Where Does Reasoning Break? Step-Level Hallucination Detection via Hidden-State Transport Geometry

    Large language models hallucinate during multi-step reasoning, but most existing detectors operate at the trace level: they assign one confidence score to a full output, fail to localize the first error, and often require multiple sampled completions. We frame hallucination inste…

  9. arXiv cs.AI TIER_1 English(EN) · Amine Trabelsi ·

    CAAFC: Chronological Actionable Automated Fact-Checker for misinformation / non-factual hallucination detection and correction

    With the vast amount of content uploaded every hour, along with the AI generated content that can include hallucinations, Automated Fact-Checking (AFC) has become increasingly vital, as it is infeasible for human fact-checkers to manually verify the sheer volume of information ge…

  10. arXiv cs.AI TIER_1 English(EN) · Yi R. Fung ·

    Scalable Token-Level Hallucination Detection in Large Language Models

    Large language models (LLMs) have demonstrated remarkable capabilities, but they still frequently produce hallucinations. These hallucinations are difficult to detect in reasoning-intensive tasks, where the content appears coherent but contains errors like logical flaws and unrel…

  11. arXiv cs.LG TIER_1 English(EN) · Ruixuan Wang ·

    Instruction Lens Score: Your Instruction Contributes a Powerful Object Hallucination Detector for Multimodal Large Language Models

    Multimodal large language models (MLLMs) have achieved remarkable progress, yet the object hallucination remains a critical challenge for reliable deployment. In this paper, we present an in-depth analysis of instruction token embeddings and reveal that they implicitly encode vis…

  12. arXiv cs.AI TIER_1 English(EN) · Yian Yin ·

    LLM hallucinations in the wild: Large-scale evidence from non-existent citations

    Large language models (LLMs) are known to generate plausible but false information across a wide range of contexts, yet the real-world magnitude and consequences of this hallucination problem remain poorly understood. Here we leverage a uniquely verifiable object - scientific cit…

  13. arXiv cs.CL TIER_1 English(EN) · Brandon C. Colelough, Davis Bartels, Dina Demner-Fushman ·

    Quantifying Hallucinations in Language Language Models on Medical Textbooks

    arXiv:2603.09986v2 Announce Type: replace Abstract: Hallucinations, the tendency for large language models to provide responses with factually incorrect and unsupported claims, is a serious problem within natural language processing for which we do not yet have an effective solut…

  14. arXiv cs.CL TIER_1 English(EN) · Erik Nielsen, Elia Cunegatti, Marcus Vukojevic, Giovanni Iacca ·

    Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

    arXiv:2605.05953v1 Announce Type: new Abstract: One of the most critical challenges in Large Language Models is their tendency to hallucinate, i.e., produce factually incorrect responses. Existing approaches show promising results in terms of hallucination correction, but still s…

  15. arXiv cs.CL TIER_1 English(EN) · Giovanni Iacca ·

    Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

    One of the most critical challenges in Large Language Models is their tendency to hallucinate, i.e., produce factually incorrect responses. Existing approaches show promising results in terms of hallucination correction, but still suffer from a main limitation: they apply correct…

  16. Hugging Face Daily Papers TIER_1 English(EN) ·

    Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

    One of the most critical challenges in Large Language Models is their tendency to hallucinate, i.e., produce factually incorrect responses. Existing approaches show promising results in terms of hallucination correction, but still suffer from a main limitation: they apply correct…

  17. arXiv cs.LG TIER_1 English(EN) · Linggang Kong, Lei Wu, Yunlong Zhang, Xiaofeng Zhong, Zhen Wang, Yongjie Wang, Yao Pan ·

    CausalGaze: Unveiling Hallucinations via Counterfactual Graph Intervention in Large Language Models

    arXiv:2604.11087v2 Announce Type: replace Abstract: Despite the groundbreaking advancements made by large language models (LLMs), hallucination remains a critical bottleneck for their deployment in high-stakes domains. Existing classification-based methods mainly rely on static a…

  18. arXiv cs.CL TIER_1 English(EN) · Mina Gabriel ·

    The First Token Knows: Single-Decode Confidence for Hallucination Detection

    arXiv:2605.05166v1 Announce Type: new Abstract: Self-consistency detects hallucinations by generating multiple sampled answers to a question and measuring agreement, but this requires repeated decoding and can be sensitive to lexical variation. Semantic self-consistency improves …

  19. arXiv cs.CL TIER_1 English(EN) · Gijs van Dijk ·

    Detecting Hallucinations in Large Language Models via Internal Attention Divergence Signals

    arXiv:2605.05025v1 Announce Type: new Abstract: We propose a lightweight and single-pass uncertainty quantification method for detecting hallucinations in Large Language Models. The method uses attention matrices to estimate uncertainty without requiring repeated sampling or exte…

  20. arXiv cs.LG TIER_1 English(EN) · Dan Wilson, Mohamed Akrout ·

    Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction

    arXiv:2605.05134v1 Announce Type: new Abstract: Large Language Models (LLMs) frequently generate plausible but non-factual content, a phenomenon known as hallucination. While existing detection methods typically rely on computationally expensive sampling-based consistency checks …

  21. arXiv cs.CL TIER_1 English(EN) · Philip Wootaek Shin, Ajay Narayanan Sridhar, Sivani Devarapalli, Rui Zhang, Jack Sampson, Vijaykrishnan Narayanan ·

    When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise

    arXiv:2605.05045v1 Announce Type: cross Abstract: Vision-language models (VLMs) achieve strong multimodal performance but remain prone to relation hallucination, which requires accurate reasoning over inter-object interactions. We study the impact of visual perturbations, specifi…

  22. arXiv cs.CL TIER_1 English(EN) · Mina Gabriel ·

    The First Token Knows: Single-Decode Confidence for Hallucination Detection

    Self-consistency detects hallucinations by generating multiple sampled answers to a question and measuring agreement, but this requires repeated decoding and can be sensitive to lexical variation. Semantic self-consistency improves this by clustering sampled answers by meaning us…

  23. arXiv cs.LG TIER_1 English(EN) · Mohamed Akrout ·

    Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction

    Large Language Models (LLMs) frequently generate plausible but non-factual content, a phenomenon known as hallucination. While existing detection methods typically rely on computationally expensive sampling-based consistency checks or external knowledge retrieval, we propose a ne…

  24. arXiv cs.CL TIER_1 English(EN) · Gijs van Dijk ·

    Detecting Hallucinations in Large Language Models via Internal Attention Divergence Signals

    We propose a lightweight and single-pass uncertainty quantification method for detecting hallucinations in Large Language Models. The method uses attention matrices to estimate uncertainty without requiring repeated sampling or external models. Specifically, we measure the Kullba…

  25. arXiv cs.CL TIER_1 English(EN) · Hao Mi, Qiang Sheng, Shaofei Wang, Beizhe Hu, Yifan Sun, Zhengjia Wang, Hengqi Zeng, Yang Li, Danding Wang, Juan Cao ·

    Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

    arXiv:2605.03971v1 Announce Type: new Abstract: Large Language Models (LLMs) are prone to factual hallucinations, risking their reliability in real-world applications. Existing hallucination detectors mainly extract micro-level intrinsic patterns for uncertainty quantification or…

  26. arXiv cs.AI TIER_1 English(EN) · Ahmed Ibrahim ·

    Neuro-Symbolic Agents for Hallucination-Free Requirements Reuse

    arXiv:2605.01562v1 Announce Type: cross Abstract: The Object-Oriented Method for Requirements Authoring and Management (OOMRAM) is a requirements reuse framework that relies on exact identifier matching and rigid templates, limiting its ability to adapt specifications across dive…

  27. arXiv cs.CL TIER_1 English(EN) · Severin Ye, Xiao Kong, Xiaopeng He, Guangsu Yan, Dongsuk Oh ·

    CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

    arXiv:2605.03476v1 Announce Type: new Abstract: Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; ho…

  28. arXiv cs.CL TIER_1 English(EN) · Juan Cao ·

    Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

    Large Language Models (LLMs) are prone to factual hallucinations, risking their reliability in real-world applications. Existing hallucination detectors mainly extract micro-level intrinsic patterns for uncertainty quantification or elicit macro-level self-judgments through verba…

  29. arXiv cs.CL TIER_1 English(EN) · Dongsuk Oh ·

    CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

    Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness …

  30. Hugging Face Daily Papers TIER_1 English(EN) ·

    CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

    Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness …

  31. arXiv cs.LG TIER_1 English(EN) · Yee Zhing Liew, Andrew Huey Ping Tan, Anwar P. P Abdul Majeed ·

    From Flat Facts to Sharp Hallucinations: Detecting Stubborn Errors via Gradient Sensitivity

    arXiv:2605.00939v1 Announce Type: new Abstract: Traditional hallucination detection fails on "Stubborn Hallucinations" -- errors where LLMs are confidently wrong. We propose a geometric solution: Embedding-Perturbed Gradient Sensitivity (EPGS). We hypothesize that while robust fa…

  32. arXiv cs.LG TIER_1 English(EN) · Jianxiong Zhang, Bing Guo, Yuming Jiang, Haobo Wang, Bo An, Sean Du ·

    Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping

    arXiv:2601.17467v2 Announce Type: replace Abstract: Large reasoning models (LRMs) often generate long, seemingly coherent reasoning traces yet still produce incorrect answers, making hallucination detection challenging. Although trajectories contain useful signals, directly using…

  33. arXiv cs.CL TIER_1 English(EN) · Alexandra Bazarova, Aleksandr Yugay, Andrey Shulga, Alina Ermilova, Andrei Volodichev, Konstantin Polev, Julia Belikova, Rauf Parchiev, Dmitry Simakov, Maxim Savchenko, Andrey Savchenko, Serguei Barannikov, Alexey Zaytsev ·

    Hallucination Detection in LLMs with Topological Divergence on Attention Graphs

    arXiv:2504.10063v4 Announce Type: replace Abstract: Hallucination, i.e., generating factually incorrect content, remains a critical challenge for large language models (LLMs). We introduce TOHA, a TOpology-based HAllucination detector in the RAG setting, which leverages a topolog…

  34. arXiv cs.CL TIER_1 English(EN) · Joseph Spracklen, Pedram Aghazadeh, Farinaz Koushanfar, Murtuza Jadliwala ·

    LLM Ghostbusters: Surgical Hallucination Suppression via Adaptive Unlearning

    arXiv:2605.01047v1 Announce Type: cross Abstract: Hallucinations, outputs that sound plausible but are factually incorrect, remain an open challenge for deployed LLMs. In code generation, models frequently hallucinate non-existent software packages, recommending imports and insta…

  35. arXiv cs.CL TIER_1 English(EN) · Freja Thoresen, Dan Saattrup Smart ·

    A multilingual hallucination benchmark: MultiWikiQHalluA

    arXiv:2605.02504v1 Announce Type: new Abstract: Most hallucination evaluations focus on English, leaving it unclear whether findings transfer to lower-resource languages. We investigate faithfulness hallucinations, defined as model-generated content that is fluent and plausible b…

  36. arXiv cs.CL TIER_1 English(EN) · Ahmed Cherif ·

    HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs

    arXiv:2605.02443v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language processing tasks, yet they remain susceptible to hallucinations -- generating content that is factually incorrect, unfaithful to …

  37. arXiv cs.CL TIER_1 English(EN) · Dan Saattrup Smart ·

    A multilingual hallucination benchmark: MultiWikiQHalluA

    Most hallucination evaluations focus on English, leaving it unclear whether findings transfer to lower-resource languages. We investigate faithfulness hallucinations, defined as model-generated content that is fluent and plausible but diverges from the provided input or is intern…

  38. arXiv cs.CL TIER_1 English(EN) · Ahmed Cherif ·

    HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs

    Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language processing tasks, yet they remain susceptible to hallucinations -- generating content that is factually incorrect, unfaithful to provided context, or misaligned with user instru…

  39. Hugging Face Daily Papers TIER_1 English(EN) ·

    Mitigating Multimodal LLMs Hallucinations via Relevance Propagation at Inference Time

    Multimodal large language models (MLLMs) have revolutionized the landscape of AI, demonstrating impressive capabilities in tackling complex vision and audio-language tasks. However, a critical challenge remains: these models often suffer from hallucinations, generating outputs th…

  40. arXiv cs.CL TIER_1 English(EN) · Guoshenghui Zhao, Weijie Zhao, Tan Yu ·

    HIVE: Hidden-Evidence Verification for Hallucination Detection in Diffusion Large Language Models

    arXiv:2604.26139v1 Announce Type: new Abstract: Diffusion large language models generate text through multi-step denoising, where hallucination signals may emerge throughout the trajectory rather than only in the final output. Existing detectors mainly rely on output uncertainty …

  41. arXiv cs.CL TIER_1 English(EN) · Jiawei Li, Akshayaa Magesh, Venugopal V. Veeravalli ·

    Principled Detection of Hallucinations in Large Language Models via Multiple Testing

    arXiv:2508.18473v3 Announce Type: replace Abstract: While Large Language Models (LLMs) have emerged as powerful foundational models to solve a variety of tasks, they have also been shown to be prone to hallucinations, i.e., generating responses that sound confident but are actual…

  42. arXiv cs.CL TIER_1 English(EN) · Tan Yu ·

    HIVE: Hidden-Evidence Verification for Hallucination Detection in Diffusion Large Language Models

    Diffusion large language models generate text through multi-step denoising, where hallucination signals may emerge throughout the trajectory rather than only in the final output. Existing detectors mainly rely on output uncertainty or coarse trace statistics, which often fail to …

  43. arXiv cs.AI TIER_1 English(EN) · Federico A. Kamelhar ·

    GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

    arXiv:2604.23366v1 Announce Type: new Abstract: Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is grounded in observed evidence rather …

  44. Hugging Face Daily Papers TIER_1 English(EN) ·

    Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

    Vision-Language Models (VLMs) are frequently undermined by object hallucination--generating content that contradicts visual reality--due to an over-reliance on linguistic priors. We introduce Positive-and-Negative Decoding (PND), a training-free inference framework that intervene…

  45. Hugging Face Daily Papers TIER_1 English(EN) ·

    GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

    Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is grounded in observed evidence rather than model-internal inference. Existing grounded…

  46. arXiv cs.CV TIER_1 English(EN) · Shangpin Peng, Senqiao Yang, Li Jiang, Zhuotao Tian ·

    Mitigating Object Hallucinations via Sentence-Level Early Intervention

    arXiv:2507.12455v3 Announce Type: replace Abstract: Multimodal large language models (MLLMs) have revolutionized cross-modal understanding but continue to struggle with hallucinations - fabricated content contradicting visual inputs. Existing hallucination mitigation methods eith…

  47. arXiv cs.CV TIER_1 English(EN) · Deepu Rajan ·

    Reducing Object Hallucination in LVLMs via Emphasizing Image-negative Tokens

    Object hallucination is a significant challenge that hinders the application of large vision-language models (LVLMs) in practice. We hypothesize that one possible origin of hallucination is the model's tendency to prioritize text generation over meaningful interaction with images…

  48. arXiv stat.ML TIER_1 English(EN) · Emmy Liu, Varun Gangal, Michael Yu, Zhuofu Tao, Karan Singh, Sachin Kumar, Steven Y. Feng ·

    HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models

    arXiv:2605.19341v1 Announce Type: cross Abstract: Hallucination remains a central failure mode of large language models, but existing benchmarks operationalize it inconsistently across summarization, question answering, retrieval-augmented generation, and agentic interaction. Thi…

  49. arXiv stat.ML TIER_1 English(EN) · Steven Y. Feng ·

    HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models

    Hallucination remains a central failure mode of large language models, but existing benchmarks operationalize it inconsistently across summarization, question answering, retrieval-augmented generation, and agentic interaction. This fragmentation makes it unclear whether a mitigat…

  50. arXiv cs.CV TIER_1 English(EN) · Yu Wang ·

    MHSA: A Lightweight Framework for Mitigating Hallucinations via Steered Attention in LVLMs

    Large vision-language models (LVLMs) have achieved remarkable performance across diverse multimodal tasks, yet they continue to suffer from hallucinations, generating content that is inconsistent with the visual input. Prior work DHCP (Detecting Hallucinations by Cross-modal Atte…

  51. arXiv cs.CV TIER_1 English(EN) · Aofan Liu ·

    Dual-Pathway Circuits of Object Hallucination in Vision-Language Models

    Vision-language models (VLMs) have demonstrated remarkable capabilities in bridging visual perception and natural language understanding, enabling a wide range of multimodal reasoning tasks. However, they often produce object hallucinations, describing content absent from the inp…

  52. arXiv cs.CV TIER_1 English(EN) · Jing Li ·

    Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination

    Large Vision-Language Models (LVLMs) have achieved remarkable progress in multimodal tasks, yet their reliability is persistently undermined by hallucinations-generating text that contradicts visual input. Recent studies often attribute these errors to inadequate visual attention…

  53. arXiv stat.ML TIER_1 English(EN) · Prabhat Kc, Rongping Zeng, Nirmal Soni, Aldo Badano ·

    sFRC for assessing hallucinations in medical image restoration

    arXiv:2603.04673v2 Announce Type: replace-cross Abstract: Deep learning (DL) methods are currently being explored to restore images from sparse-view-, limited-data-, and undersampled-based acquisitions in medical applications. Although outputs from DL may appear visually appealin…

  54. arXiv cs.CV TIER_1 English(EN) · Vijaykrishnan Narayanan ·

    When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise

    Vision-language models (VLMs) achieve strong multimodal performance but remain prone to relation hallucination, which requires accurate reasoning over inter-object interactions. We study the impact of visual perturbations, specifically rotation and noise, and show that even mild …

  55. arXiv cs.CV TIER_1 English(EN) · Itai Allouche, Joseph Keshet ·

    Mitigating Multimodal LLMs Hallucinations via Relevance Propagation at Inference Time

    arXiv:2605.01766v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have revolutionized the landscape of AI, demonstrating impressive capabilities in tackling complex vision and audio-language tasks. However, a critical challenge remains: these models often…

  56. arXiv cs.CV TIER_1 English(EN) · Jianfei Zhao, Feng Zhang, Xin Sun, Chong Feng, Zhixing Tan ·

    Tell Model Where to Look: Mitigating Hallucinations in MLLMs by Vision-Guided Attention

    arXiv:2511.20032v3 Announce Type: replace Abstract: Visual attention serves as the primary mechanism through which MLLMs interpret visual information; however, its limited localization capability often leads to hallucinations. We observe that although MLLMs can accurately extract…

  57. arXiv cs.CV TIER_1 English(EN) · Chengsheng Zhang, Chenghao Sun, Xinyan Jiang, Wei Li, Xinmei Tian ·

    Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models

    arXiv:2604.25642v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable progress in visual-textual understanding, yet their reliability is critically undermined by hallucinations, i.e., the generation of factually incorrect or inconsistent re…

  58. arXiv cs.CV TIER_1 English(EN) · Xinmei Tian ·

    Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models

    Large Vision-Language Models (LVLMs) have achieved remarkable progress in visual-textual understanding, yet their reliability is critically undermined by hallucinations, i.e., the generation of factually incorrect or inconsistent responses. While recent studies using steering vec…

  59. arXiv cs.CV TIER_1 English(EN) · Yubo Jiang, Xin Yang, Abudukelimu Wuerkaixi, Zheming Yuan, Xuxin Cheng, Fengying Xie, Zhiguo Jiang, Cao Liu, Ke Zeng, Haopeng Zhang ·

    Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

    arXiv:2604.24396v1 Announce Type: new Abstract: Vision-Language Models (VLMs) are frequently undermined by object hallucination--generating content that contradicts visual reality--due to an over-reliance on linguistic priors. We introduce Positive-and-Negative Decoding (PND), a …

  60. arXiv cs.CV TIER_1 English(EN) · JiYang Wang, Jiawei Chen, Mengqi Xiao, Yu Cheng, Yangfu Li, Zhaoxia Yin ·

    DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models

    arXiv:2604.22822v1 Announce Type: new Abstract: Object level hallucination remains a central reliability challenge for vision language models (VLMs), particularly in binary object existence verification. Existing benchmarks emphasize aggregate accuracy but rarely disentangle whet…

  61. arXiv cs.CV TIER_1 English(EN) · Zhiyuan Jiang, Weihao Hong, Xinlei Guan, Tejaswi Dhandu, Miles Q. Li, Meng Xu, Kuan Huang, Umamaheswara Rao Tida, Bingyu Shen, Daehan Kwak, Boyang Li ·

    LLM-as-Judge Framework for Evaluating Tone-Induced Hallucination in Vision-Language Models

    arXiv:2604.18803v3 Announce Type: replace Abstract: Vision-Language Models (VLMs) are increasingly deployed in settings where reliable visual grounding carries operational consequences, yet their behavior under progressively coercive prompt phrasing remains undercharacterized. Ex…

  62. arXiv cs.CV TIER_1 English(EN) · Jiawei Chen, Dingkang Yang, Tong Wu, Yue Jiang, Xiaolu Hou, Mingcheng Li, Shunli Wang, Dongling Xiao, Ke Li, Lihua Zhang ·

    Detecting and Evaluating Medical Hallucinations in Large Vision Language Models

    arXiv:2406.10185v2 Announce Type: replace Abstract: Large Vision Language Models (LVLMs) are increasingly integral to healthcare applications, including medical visual question answering and imaging report generation. While these models inherit the robust capabilities of foundati…

  63. arXiv cs.CV TIER_1 English(EN) · Haopeng Zhang ·

    Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation

    Vision-Language Models (VLMs) are frequently undermined by object hallucination--generating content that contradicts visual reality--due to an over-reliance on linguistic priors. We introduce Positive-and-Negative Decoding (PND), a training-free inference framework that intervene…

  64. Eugene Yan TIER_1 English(EN) ·

    Out-of-Domain Finetuning to Bootstrap Hallucination Detection

    How to use open-source, permissive-use data and collect less labeled samples for our tasks.

  65. Medium — Claude tag TIER_1 English(EN) · Akashshettyonline ·

    1/10 Ways to Reduce Hallucinations in LLM Applications: Grounding with RAG

    <div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/@akashshettyonline22/1-10-ways-to-reduce-hallucinations-in-llm-applications-grounding-with-rag-051102434e6f?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1402/1*-HGVib…

  66. Towards AI TIER_1 English(EN) · Faheem Ahmed ·

    When AI Lies to You: Hallucination, Data Quality, and the Old School Fix

    <figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*g6-CMPTHlab61BBb" /><figcaption>Photo by <a href="https://unsplash.com/@galka_nz?utm_source=medium&amp;utm_medium=referral">Galina Nelyubova</a> on <a href="https://unsplash.com?utm_source=medium&amp;utm_medium=r…

  67. dev.to — LLM tag TIER_1 English(EN) · Spicy ·

    Why Does AI Make Things Up? A Dev's Guide to Hallucination

    <p>Quick version: LLMs don't look things up. They predict probable token sequences. When the model's training data is thin or absent on a topic, it doesn't stop — it keeps predicting. Fluently. Confidently. Incorrectly.</p> <p>If you've been building with LLMs for more than a few…

  68. dev.to — LLM tag TIER_1 English(EN) · Gabriel Anhaia ·

    Hallucination Detection at the Trace Layer: 4 Detectors You Can Ship Today

    <ul> <li> <strong>Book:</strong> <a href="https://www.amazon.com/dp/B0GYLHMLMT" rel="noopener noreferrer">LLM Observability Pocket Guide: Picking the Right Tracing &amp; Evals Tools for Your Team</a> </li> <li> <strong>Also by me:</strong> <em>Thinking in Go</em> (2-book series) …

  69. dev.to — LLM tag TIER_1 English(EN) · CapeStart ·

    A Guide to Preventing AI Hallucinations

    <h2> What Are AI Hallucinations? </h2> <p>Last quarter, something happened that made us rethink our entire approach to AI deployment. During a routine audit, we found out our customer support AI had confidently recommended a non-existent product feature to an enterprise client. T…

  70. dev.to — LLM tag TIER_1 English(EN) · Mansi Somayajula ·

    What Production ML Systems Taught Me About AI Hallucinations

    <p>Most discussions about AI hallucinations stay at the chatbot level.</p> <p>“ChatGPT made up a legal case.”<br /> “The AI invented a research paper.”<br /> “The model confidently gave the wrong answer.”</p> <p>Interesting? Sure.</p> <p>But after working on production ML systems…

  71. dev.to — LLM tag TIER_1 English(EN) · Thousand Miles AI ·

    The cracked mirror: why AI hallucination is structural, not a bug

    <p>There is a particular kind of error a language model makes that feels different from every other kind of software failure. A database returns the wrong row and you can trace the query. A null pointer crashes and the stack tells you where. But when a model confidently cites a p…

  72. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    LLM hallucinations in the wild: Large-scale evidence from non-existent citations Zhenyue Zhao, Yihe Wang, Toby Stuart, Mathijs De Vaan, Paul Ginsparg, Yian Yin

    LLM hallucinations in the wild: Large-scale evidence from non-existent citations Zhenyue Zhao, Yihe Wang, Toby Stuart, Mathijs De Vaan, Paul Ginsparg, Yian Yin "Large language models (LLMs) are known to generate plausible but false information across a wide range of contexts, yet…

  73. dev.to — LLM tag TIER_1 Português(PT) · Marcelo Cabral Ghilardi ·

    When AI Lies and Asks You to CALM DOWN: a straight talk about hallucinations

    <p> </p> <p>E aí, gurizada! Tudo tranquilo? Hoje eu quero trocar uma ideia com vocês sobre umas paradas que andei percebendo com as IAs, e que me motivaram a gravar um vídeo e até escrever um post lá no meu site, o marcelocabral.com.br. Sabe quando a inteligência artificial solta…

  74. r/Anthropic TIER_1 English(EN) · /u/RouXanthica ·

    Fixed the viral Opus 4.7 hallucination/reasoning error using neurosymbolic AI

    &#32; submitted by &#32; <a href="https://www.reddit.com/user/RouXanthica"> /u/RouXanthica </a> <br /> <span><a href="https://www.reddit.com/gallery/1ti228y">[link]</a></span> &#32; <span><a href="https://www.reddit.com/r/Anthropic/comments/1ti23mi/fixed_the_viral_opus_47_halluci…