A new research paper explores whether Large Language Models (LLMs) truly align with human decision-making mechanisms when faced with risk, using the St. Petersburg game as a testbed. While many LLMs produce human-like finite bids in the original game, this outcome-level resemblance often hides differing underlying reasoning processes. Controlled variants of the game reveal that LLMs frequently shift to conditionally rational behavior rather than maintaining human-consistent mechanisms, even after instruction tuning. AI
影响 Highlights the need for deeper evaluation of LLM decision-making beyond surface-level outcomes to ensure true alignment.
排序理由 Academic paper analyzing LLM behavior on a specific task. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →