PulseAugur
实时 21:17:27

Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs

Researchers are developing new frameworks to address hallucinations in large language models (LLMs). One approach, termed "LLM Psychosis," categorizes severe reality-boundary failures and proposes a diagnostic scale to evaluate them, with findings from ChatGPT 5 documented. Another method, KARL, uses reinforcement learning to align abstention behavior with a model's knowledge boundary, aiming to reduce hallucinations without sacrificing accuracy. Additionally, PRISM offers a benchmark to disentangle hallucinations into knowledge, reasoning, and instruction-following errors, aiding in understanding their origins. For vision-language models, AVES-DPO focuses on self-correction to mitigate hallucinations using in-distribution data. AI

影响 New diagnostic tools and mitigation strategies for LLM hallucinations could improve the reliability and trustworthiness of deployed AI systems.

排序理由 Multiple academic papers introducing new frameworks and benchmarks for understanding and mitigating LLM hallucinations.

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 8 个来源。 我们如何撰写摘要 →

Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs

报道来源 [8]

  1. arXiv cs.AI TIER_1 English(EN) · Ashutosh Raj ·

    大语言模型精神病:大型语言模型现实边界故障的理论与诊断框架

    arXiv:2604.25934v1 Announce Type: cross Abstract: The deployment of large language models (LLMs) as interactive agents has exposed a category of behavioral failure that prevailing terminology, principally hallucination, fails to adequately characterize. This paper introduces LLM …

  2. arXiv cs.AI TIER_1 English(EN) · Wentao Hu, Yanbo Zhai, Xiaohui Hu, Mingkuan Zhao, Shanhong yu, Xue Liu, Kaidong Yu, Shuangyong Song, Xuelong Li ·

    唤醒沉睡的专家:反事实路由以减轻 MoE 幻觉

    arXiv:2604.14246v2 Announce Type: replace-cross Abstract: Sparse Mixture-of-Experts (MoE) models have achieved remarkable scalability, yet they remain vulnerable to hallucinations, particularly when processing long-tail knowledge. We identify that this fragility stems from static…

  3. arXiv cs.CL TIER_1 English(EN) · Cheng Gao, Cheng Huang, Kangyang Luo, Ziqing Qiao, Shuzheng Si, Huimin Chen, Chaojun Xiao, Maosong Sun ·

    KARL:通过知识边界感知强化学习缓解 LLM 中的幻觉

    arXiv:2604.22779v1 Announce Type: cross Abstract: Enabling large language models (LLMs) to appropriately abstain from answering questions beyond their knowledge is crucial for mitigating hallucinations. While existing reinforcement learning methods foster autonomous abstention, t…

  4. arXiv cs.CL TIER_1 English(EN) · Yuhe Wu, Guangyu Wang, Yuran Chen, Jiatong Zhang, Yutong Zhang, Yujie Chen, Jiaming Shang, Guang Zhang, Zhuang Liu ·

    PRISM:探究大型语言模型幻觉中的推理、指令和来源记忆

    arXiv:2604.16909v2 Announce Type: replace Abstract: As large language models (LLMs) evolve from conversational assistants into agents capable of handling complex tasks, they are increasingly deployed in high-risk domains. However, existing benchmarks largely rely on mixed queries…

  5. arXiv cs.AI TIER_1 English(EN) · Byeonggeuk Lim, JungMin Yun, Junehyoung Kwon, Kyeonghyun Kim, YoungBin Kim ·

    用你自己的声音对齐:用于 LVLM 中幻觉缓解的自校正偏好学习

    arXiv:2604.24395v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) frequently suffer from hallucinations. Existing preference learning-based approaches largely rely on proprietary models to construct preference datasets. We identify that this reliance introduces…

  6. arXiv cs.AI TIER_1 English(EN) · YoungBin Kim ·

    与您自己的声音对齐:用于 LVLM 中幻觉缓解的自纠正偏好学习

    Large Vision-Language Models (LVLMs) frequently suffer from hallucinations. Existing preference learning-based approaches largely rely on proprietary models to construct preference datasets. We identify that this reliance introduces a distributional mismatch between the proprieta…

  7. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    KARL:RL框架在不损失准确性的情况下减少LLM幻觉 KARL推出了一种强化学习框架,可动态估算LLM的知识

    KARL: RL Framework Cuts LLM Hallucinations Without Accuracy Loss KARL introduces a reinforcement learning framework that dynamically estimates an LLM's knowledge boundary to reward abstention only when appropriate, achieving a superior accuracy-hallucination trade- https:// genti…

  8. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    日本在羽田机场部署Unitree G1机器人以应对劳动力短缺 日本正在东京羽田机场测试Unitree G1和更高版本的人形机器人以解决劳动力短缺问题

    Japan Deploys Unitree G1 Robots at Haneda Airport Amid Labor Shortage Japan is testing Unitree G1 and taller humanoid robots at Tokyo Haneda Airport to tackle its labor shortage crisis, marking a real-world deployment of AI-driven robotics. https:// gentic.news/article/japan-depl…