新研究绘制11个大语言模型的地缘政治偏见图谱，发现中文语境下存在亲华倾向

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-16 04:00

一篇新发表在arXiv上的研究论文分析了11个大语言模型的地缘政治偏见，重点关注中美紧张关系。该研究开发了一种新颖的定量工具，通过改编调查心理测量技术来衡量模型的立场。该方法包括提出命题及其反向命题以消除简单的顺从性，从而分离出真实的信念。研究结果表明，开发者来源、查询语言和议题领域是影响偏见的重要因素，所有模型，包括美国开发的模型，在用中文查询时都表现出亲华倾向。 AI

影响这项研究提供了一种新颖、可复现的方法来量化大语言模型的地缘政治偏见，可能影响未来的模型开发和评估标准。

排序理由该集群包含一篇发表在arXiv上的学术论文，详细介绍了一种分析大语言模型偏见的新方法。 [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · William Guey, Wei Zhang, Pierrick Bougault, Vitor D. de Moura, Jos\'e O. Gomes · 2026-06-16 04:00

Mapping Geopolitical Bias in 11 Large Language Models: A Bilingual, Dual-Framing Analysis of U.S.-China Tensions

arXiv:2503.23688v2 Announce Type: replace Abstract: Large language models are how hundreds of millions of people now encounter contested political questions, raising a subtle measurement problem: a model that simply agrees with whatever it is told can masquerade as biased, contam…

报道来源 [1]

Mapping Geopolitical Bias in 11 Large Language Models: A Bilingual, Dual-Framing Analysis of U.S.-China Tensions

相关实体

相关话题