English(EN) What Suppresses Nash Equilibrium Play in Large Language Models? Mechanistic Evidence and Causal Control

大型语言模型计算纳什均衡，但通过末层覆盖来抑制它

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-01 04:00

研究人员调查了大型语言模型（LLMs）在战略互动中为何偏离纳什均衡博弈。通过检查 Llama-3 和 Qwen2.5 等开源模型，他们发现虽然对手历史得到了很好的编码，但纳什行动本身的表征却很弱。模型末层的亲社会覆盖似乎抑制了纳什行动，导致了合作行为。有趣的是，思维链推理可以提高大于 70B 参数的大型模型的纳什博弈能力，但会降低小型模型的纳什博弈能力。 AI

影响研究了大型语言模型在战略博弈中的决策制定，可能影响智能体设计和对齐研究。

排序理由学术论文，详细介绍了在战略博弈中大型语言模型行为的机制发现和因果控制实验。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Paraskevas V. Lekeas, Giorgos Stamatopoulos · 2026-05-01 04:00

大型语言模型中的纳什均衡博弈受何抑制？机制证据与因果控制

arXiv:2604.27167v1 Announce Type: cross Abstract: LLM agents are known to deviate from Nash equilibria in strategic interactions, but nobody has looked inside the model to understand why, or asked whether the deviation can be reversed. We do both. Working with four open-source mo…

报道来源 [1]

大型语言模型中的纳什均衡博弈受何抑制？机制证据与因果控制

相关实体

相关话题