English(EN) Better Literary Translation: A Multi-Aspect Data Generation and LLM Training Approach

新的大语言模型框架通过生成数据改进文学翻译

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-04 09:27

研究人员开发了一个新颖的框架，用于生成高质量数据来训练大语言模型进行文学翻译。该方法使用专门的大语言模型创建翻译参考和偏好数据，侧重于不同的质量维度。由此产生的 LitMT-8B 和 LitMT-14B 模型在基准测试中表现出竞争力，并且能够很好地泛化到新的文学作品。 AI

影响这项研究介绍了一种提高大语言模型在文学翻译等细微任务上性能的方法，有可能实现更复杂的跨文化交流工具。

排序理由该集群包含一篇学术论文，详细介绍了大语言模型针对特定任务的训练新方法。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Zhihao Lin, Ziqi Zhu, Hao Huang, Guanghui Wang, Peiyang He · 2026-06-05 04:00

更好的文学翻译：多方面数据生成与LLM训练方法

arXiv:2606.05924v1 Announce Type: new Abstract: Literary translation poses unique challenges due to the scarcity of high-quality annotated data and the need to balance expression fluency with literary effect. We present a multi-aspect iterative refinement framework that generates…
arXiv cs.CL TIER_1 English(EN) · Peiyang He · 2026-06-04 09:27

更好的文学翻译：多方面数据生成与LLM训练方法

Literary translation poses unique challenges due to the scarcity of high-quality annotated data and the need to balance expression fluency with literary effect. We present a multi-aspect iterative refinement framework that generates high-quality translation references and prefere…