English(EN) The Collapse of Heterogeneity in Silicon Philosophers

AI模型显示出人为共识，导致哲学异质性崩溃

作者 PulseAugur 编辑部 · [1 个来源] · 2026-04-28 04:00

一篇新发表在arXiv上的研究论文调查了在哲学背景下使用大型语言模型（LLMs）替代人类判断的问题。研究发现，LLMs倾向于过度关联哲学立场，制造人为共识，并导致人类意见的自然异质性崩溃。这种现象在专有和开源模型中都观察到，部分原因是模型假设专家持有统一的观点。这些发现对AI对齐、评估方法以及使用AI系统复制人类决策的可靠性都有影响。 AI

影响强调了LLMs在复制人类判断方面潜在的偏见，影响AI对齐和评估。

排序理由学术论文分析LLM在特定任务上的行为。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Yuanming Shi (Adobe Inc.), Andreas Haupt (Stanford University) · 2026-04-28 04:00

硅哲学家异质性的崩溃

arXiv:2604.23575v1 Announce Type: cross Abstract: Silicon samples are increasingly used as a low-cost substitute for human panels and have been shown to reproduce aggregate human opinion with high fidelity. We show that, in the alignment-relevant domain of philosophy, silicon sam…

报道来源 [1]

硅哲学家异质性的崩溃

相关实体

相关话题