English(EN) Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges

新框架揭示LLM裁判存在合理化偏见

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-26 04:00

研究人员开发了一个因果框架，用于分析大型语言模型（LLM）在充当文本评估裁判时的合理化偏见。该研究引入了新的指标和线索干预措施，以测试当非证据性线索改变时，LLM裁判是否保持一致。研究结果表明存在显著的线索锚定合理化，但PROOF-BEFORE-PREFERENCE提示策略显著提高了线索不变性。 AI

影响强调了LLM评估者中潜在的偏见，表明需要改进提示策略以确保公平和一致的AI驱动评估。

排序理由学术论文发表在arXiv上，详细介绍了分析LLM偏见的新框架。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Riya Tapwal, Abhishek Kumar, Carsten Maple · 2026-05-26 04:00

Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges

arXiv:2605.23970v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as automatic judges for summarization and dialogue evaluation. Prior work has documented biases such as position, verbosity, and style preferences, but largely focuses on outcomes, …

报道来源 [1]

Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges

相关实体

相关话题