PulseAugur
实时 15:02:45

LLM Hallucinations Linked to Commitment Failure, New Quantization Framework Introduced

A new paper proposes that LLM hallucinations stem not from a lack of knowledge, but from a failure in commitment, where models disperse probability mass across alternatives instead of concentrating on the correct answer. This phenomenon is observed to increase with model scale and is exacerbated by instruction tuning. Another paper introduces GAMMA, a framework for mixed-precision quantization that optimizes bit allocation for LLMs, significantly improving accuracy under memory constraints and outperforming existing methods on Llama and Qwen models. Additionally, a benchmark called SciEval has been developed to automatically evaluate K-12 science instructional materials, revealing that current mainstream LLMs perform poorly on this task without domain-specific fine-tuning. AI

影响 New research sheds light on LLM hallucination mechanisms and introduces novel methods for model optimization and evaluation, potentially improving reliability and efficiency.

排序理由 The cluster contains multiple academic papers detailing research findings on LLM behavior and optimization techniques.

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →

LLM Hallucinations Linked to Commitment Failure, New Quantization Framework Introduced

报道来源 [5]

  1. arXiv cs.CL TIER_1 English(EN) · Jewon Yeom, Jaewon Sok, Heejun Kim, Seonghyeon Park, Jeongjae Park, Taesup Kim ·

    Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer

    arXiv:2605.22007v1 Announce Type: new Abstract: Hallucination is often viewed as a direct consequence of missing knowledge: a model answers incorrectly when the correct answer is absent from its generation-time distribution, and correctly when it is present. We test this assumpti…

  2. arXiv cs.CL TIER_1 English(EN) · Taesup Kim ·

    Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer

    Hallucination is often viewed as a direct consequence of missing knowledge: a model answers incorrectly when the correct answer is absent from its generation-time distribution, and correctly when it is present. We test this assumption by introducing a semantic notion of answer av…

  3. arXiv cs.AI TIER_1 English(EN) · Xu Han ·

    GAMMA: Global Bit Allocation for Mixed-Precision Models under Arbitrary Budgets

    Mixed-precision quantization improves the budget--accuracy trade-off for large language models (LLMs) by allocating more bits to sensitive modules. However, automating this allocation at LLM scale faces a unique combination of constraints: learnable approaches require quantizatio…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    GAMMA: Global Bit Allocation for Mixed-Precision Models under Arbitrary Budgets

    Mixed-precision quantization improves the budget--accuracy trade-off for large language models (LLMs) by allocating more bits to sensitive modules. However, automating this allocation at LLM scale faces a unique combination of constraints: learnable approaches require quantizatio…

  5. arXiv cs.AI TIER_1 English(EN) · Jinjun Xiong ·

    SciEval: A Benchmark for Automatic Evaluation of K-12 Science Instructional Materials

    The need to evaluate instructional materials for K-12 science education has become increasingly important, as more educators use generative AI to create instructional materials. However, the review of instructional materials is time-consuming, expertise-intensive, and difficult t…