English(EN) Disentangling Interaction and Bias Effects in Opinion Dynamics of Large Language Models

新的LLM评估框架应对分歧、说服力和偏见

作者 PulseAugur 编辑部 · [4 个来源] · 2026-05-25 04:00

研究人员正在开发新的方法来评估大型语言模型（LLM）的性能，超越简单的准确性指标。一种方法侧重于模型之间的分歧，将其作为解释复杂性的指标，尤其适用于分析公众评论等任务。另一个框架PMIYC自动化评估LLM的说服力有效性和易感性，揭示了Llama-3.3-70B和GPT-4o等模型之间的性能差异。此外，研究还探讨了使用LLM进行细粒度意见分析，发现它们可作为标注助手，但由于在关系结构方面存在困难，不能完全取代人类标注员。最后，提出了一个贝叶斯框架来解耦LLM意见动态中的交互和偏见效应，强调了微调如何改变模型的吸引子。 AI

影响新的评估方法和框架正在涌现，以更好地理解LLM的行为，解决分歧、说服力和偏见等问题，这对于开发更安全、更可靠的AI系统至关重要。

排序理由该集群包含多篇在arXiv上发表的学术论文，详细介绍了与LLM评估、说服力和意见动态相关的新研究方法和发现。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。我们如何撰写摘要 →

报道来源 [4]

arXiv cs.AI TIER_1 English(EN) · Aisha Najera, Alvin Moon, Vedant Srinivasan, Rajesh Veeraraghavan · 2026-05-29 04:00

当模型意见不合时：重新思考用于公众评论分析的大语言模型评估

arXiv:2605.29025v1 Announce Type: new Abstract: Federal agencies are deploying large language models (LLMs) to categorize public comment corpora, where the model's organization of the record shapes what policymakers see and which arguments register. Standard evaluation, anchored …
arXiv cs.AI TIER_1 English(EN) · Nimet Beyza Bozdag, Shuhaib Mehri, Gokhan Tur, Dilek Hakkani-T\"ur · 2026-05-28 04:00

说服我试试：评估大型语言模型说服有效性和易感性的框架

arXiv:2503.01829v4 Announce Type: replace-cross Abstract: Large Language Models (LLMs) demonstrate persuasive capabilities that rival human-level persuasion. While these capabilities can be used for social good, they also present risks of potential misuse. Beyond the concern of h…
arXiv cs.CL TIER_1 English(EN) · Gaurav Negi, MA Waskow, John McCrae, Omnia Zayed, Paul Buitelaar · 2026-05-28 04:00

大型语言模型作为细粒度意见分析的自动标注器和标注仲裁者

arXiv:2601.16800v3 Announce Type: replace Abstract: Fine-grained opinion analysis of text provides a detailed understanding of expressed sentiments, including the addressed entity. Although this level of detail is valuable, annotating opinions in datasets for model training requi…
arXiv cs.AI TIER_1 English(EN) · Vincent C. Brockers, David A. Ehrlich, Viola Priesemann · 2026-05-25 04:00

解耦大型语言模型意见动态中的交互和偏差效应

arXiv:2509.06858v2 Announce Type: replace-cross Abstract: Large Language Models are increasingly used to simulate human opinion dynamics, yet the effect of genuine interaction is often obscured by systematic biases. We develop a Bayesian framework to disentangle and quantify thre…

报道来源 [4]

当模型意见不合时：重新思考用于公众评论分析的大语言模型评估

说服我试试：评估大型语言模型说服有效性和易感性的框架

大型语言模型作为细粒度意见分析的自动标注器和标注仲裁者

解耦大型语言模型意见动态中的交互和偏差效应

相关实体

相关话题