English(EN) A Systematic Comparison between Extractive Self-Explanations and Human Rationales in Text Classification

LLM自我解释与文本分类中人类解释的比较

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-22 04:00

一篇新的研究论文系统地比较了指令微调LLM生成的自我解释与文本分类任务中人类提供的解释。该研究评估了这些自我解释在情感分类、强迫劳动检测和声明验证方面的合理性和忠实性。研究结果表明，LLM自我解释与人类解释之间的一致性随文本长度和任务复杂度的变化而变化，尽管LLM确实能生成忠实的token级解释。 AI

影响这项研究为理解LLM生成解释的质量和忠实性提供了见解，这对于提高模型的可解释性和用户信任至关重要。

排序理由该集群包含一篇学术论文，详细介绍了LLM生成的解释与人类解释的系统性比较。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Stephanie Brandl, Oliver Eberle · 2026-05-22 04:00

A Systematic Comparison between Extractive Self-Explanations and Human Rationales in Text Classification

arXiv:2410.03296v4 Announce Type: replace-cross Abstract: Instruction-tuned LLMs are able to provide \textit{an} explanation about their output to users by generating self-explanations, without requiring the application of complex interpretability techniques. In this paper, we an…

报道来源 [1]

A Systematic Comparison between Extractive Self-Explanations and Human Rationales in Text Classification

相关实体

相关话题