English(EN) Are LLMs Bad at Moral Reasoning?

新研究表明大型语言模型展现出更强的道德推理能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-11 04:00

一篇新论文认为，大型语言模型（LLMs）比之前认为的更擅长道德推理。该研究重新评估了MoReBench数据集，表明当大型语言模型被要求为道德案例生成评分标准时，其输出比先前的评估更准确、更乐观。这种方法突显了道德问题的巨大维度，并表明大型语言模型比早期研究得出的结论拥有更强的道德能力。 AI

影响表明大型语言模型可能更适合在复杂环境中安全部署，有可能加速其集成到敏感应用中。

排序理由该集群包含一篇评估大型语言模型能力的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Menghang Zhu, Seth Lazar · 2026-06-11 04:00

Are LLMs Bad at Moral Reasoning?

arXiv:2606.11635v1 Announce Type: cross Abstract: For highly capable AI systems to operate safely in dynamic, open-ended environments, they must be able to identify, understand, and respond to moral reasons for action, and constrain their behaviour accordingly. A growing body of …