English(EN) Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations

AI评估工具未能识别文学翻译中的创造力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-13 14:30

一项新的研究论文揭示，当前的自动评估指标和LLM-as-a-judge系统在准确评估文学翻译中的创造力方面存在困难。这些工具偏袒机器翻译的文本，并常常惩罚富有创造性、具有文化相关性的解决方案，尤其是在诗歌等体裁中。研究结果强调了现有评估方法的局限性，并指出了开发能够更好地识别细微差别和非标准翻译的新工具的必要性。 AI

影响强调了开发新AI评估工具的必要性，这些工具能够更好地理解文本中的创造性细微差别，尤其是在文学应用中。

排序理由该集群包含一篇学术论文，详细介绍了关于AI评估方法局限性的研究结果。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Ana Guerberof Arenas · 2026-05-13 14:30

Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations

This article investigates the performance of automatic evaluation metrics (AEMs) and LLM-as-a-judge evaluation on literary translation across multiple languages, genres, and translation modalities. The aim is to assess how well these tools align with professionals when evaluating…

报道来源 [1]

Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations

相关实体

相关话题