English(EN) Simulating Hate Speech Cascades with Multi-LLM Agents: Empirical Grounding, Modeling Fidelity, and Intervention Strategies

LLM代理在Bluesky上模拟仇恨言论级联

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-18 04:00

研究人员开发了一个多代理大型语言模型（LLM）系统，用于模拟在线平台上的仇恨言论传播，旨在改进审核策略。该研究分析了来自Bluesky的三个仇恨级联和一个良性级联，发现仇恨级联表现出高比例的敌意转发者和星型扩散拓扑结构。LLM模拟器成功地重现了观察到的立场单一文化和毒性-参与同质性，其中代理异质性被确定为模拟保真度的关键因素。针对密集网络中的放大器显示出减少仇恨内容传播的潜力。 AI

影响这项研究可能有助于开发更有效的AI驱动的审核工具，以打击在线仇恨言论。

排序理由该集群包含一篇学术论文，详细介绍了一种研究在线现象的新模拟方法。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

Bluesky
LLM

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Fan Huang · 2026-06-18 04:00

Simulating Hate Speech Cascades with Multi-LLM Agents: Empirical Grounding, Modeling Fidelity, and Intervention Strategies

arXiv:2606.18264v1 Announce Type: cross Abstract: Faithful modeling of hateful content propagation on online platforms remains an open problem for moderation research. Classical cascade models that do not explicitly represent the profile, community, and content factors associated…

报道来源 [1]

Simulating Hate Speech Cascades with Multi-LLM Agents: Empirical Grounding, Modeling Fidelity, and Intervention Strategies

相关实体

相关话题