English(EN) Identifying High-Confidence Social Biases in LLMs for Trustworthy Conversational Tutoring Agents

大型语言模型在辅导角色中表现出高置信度的社会偏见

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 04:00

研究人员在用于对话式辅导的大型语言模型（LLMs）中识别出了显著的社会偏见。这些模型在评估时常常表现出高置信度，即使其评估存在偏见，这也可能对提供给学生的反馈产生负面影响。研究发现，与标准基准测试相比，在自然对话式辅导中检测这些偏见更加困难，而当前的大型语言模型对其不正确的判断过于自信。 AI

影响强调了教育领域中过于自信、存在偏见的AI所带来的风险，需要更好的偏见检测和缓解策略来构建值得信赖的AI导师。

排序理由这是一篇详细介绍大型语言模型偏见研究结果的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Aitor Arronte Alvarez, Naiyi Xie Fincham · 2026-06-02 04:00

Identifying High-Confidence Social Biases in LLMs for Trustworthy Conversational Tutoring Agents

arXiv:2606.01584v1 Announce Type: cross Abstract: Conversational tutoring agents have been shown to improve learning engagement and student outcomes, and large language models (LLMs) are increasingly used in these systems to provide scalable, personalized feedback. However, LLMs …