English(EN) HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

新的HER框架通过类人推理增强LLM角色扮演能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-04-30 04:00

研究人员推出HER框架，旨在通过模拟类人推理来增强大型语言模型（LLM）的角色扮演能力。该方法解决了当前模型在捕捉角色行为背后内心想法方面的局限性。HER采用双层思考过程，并结合了基于Qwen3-32B训练的增强推理数据和人类对齐的奖励模型。该框架在CoSER和Minimax Role-Play等基准测试中表现出显著的改进。 AI

影响通过模拟类人推理增强LLM的角色扮演能力，有望改进陪伴和数字游戏等应用。

排序理由这是一篇详细介绍LLM角色扮演新框架的研究论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Chengyu Du, Xintao Wang, Aili Chen, Weiyuan Li, Rui Xu, Junteng Liu, Zishan Huang, Rong Tian, Zijun Sun, Yuhao Li, Liheng Feng, Deming Ding, Pengyu Zhao, Yanghua Xiao · 2026-04-30 04:00

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

arXiv:2601.21459v4 Announce Type: replace-cross Abstract: LLM role-playing, i.e., using LLMs to simulate specific personas, has emerged as a key capability in various applications, such as companionship, content creation and digital games. While current models effectively capture…

报道来源 [1]

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

相关实体

相关话题