English(EN) Retell, Reward, Repeat: Reinforcement Learning for Narrative Theory-Informed Story Retelling

新的强化学习流程利用叙事理论增强大型语言模型的故事讲述能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-18 04:00

研究人员开发了一个名为 Retell, Reward, Repeat (RRR) 的新强化学习流程，旨在提高大型语言模型 (LLM) 的故事讲述能力。该方法整合了结构主义叙事学和标量叙事学，以训练 LLM 生成逻辑合理的故事事件，解决了当前 SFT 等训练后技术存在的不足。RRR 利用合成的 TimeTravel 数据集，并通过 d-RLAIF 从文本特征中提取训练信号，无需参考输出。评估表明，经 RRR 训练的 LLM 在逻辑性、合理性和完整性方面优于现有基线，提供了一种提高 LLM 故事讲述能力的经济高效的方法。 AI

影响这项研究提供了一种提高 LLM 叙事连贯性和逻辑性的新颖方法，有望增强创意写作和互动叙事应用。

排序理由该集群包含一篇详细介绍 LLM 新训练方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · David Y. Liu, Xanthe Muston, Dipankar Srirag, Aditya Joshi, Sebastian Sequoiah-Grayson · 2026-06-18 04:00

Retell, Reward, Repeat: Reinforcement Learning for Narrative Theory-Informed Story Retelling

arXiv:2601.17226v2 Announce Type: replace-cross Abstract: Counterfactual story retelling exposes LLM shortcomings in constrained narrative solution spaces where they can no longer rely on recalling memorised training data. Ground-truth-based post-training, such as SFT, fails to t…

报道来源 [1]

Retell, Reward, Repeat: Reinforcement Learning for Narrative Theory-Informed Story Retelling

相关实体

相关话题