English(EN) Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

新AI方法绕过人工标注进行机器翻译错误检测

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-26 04:00

研究人员开发了一种无需人工标注即可检测机器翻译错误的新方法。这种方法称为迭代MBR蒸馏，它使用大型语言模型生成自己的训练数据，有效地创建伪标签。实验表明，使用这种自生成数据训练的模型在识别特定错误跨度方面，其性能优于在人工标注数据集上训练的模型。 AI

影响该方法可以显著降低机器翻译评估模型训练的成本并提高其一致性。

排序理由该集群包含一篇详细介绍机器翻译错误跨度检测新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Boxuan Lyu, Haiyue Song, Zhi Qu · 2026-05-26 04:00

人工标注是否必要？迭代式MBR蒸馏用于机器翻译错误跨度检测

arXiv:2603.12983v3 Announce Type: replace-cross Abstract: Error Span Detection (ESD) is a crucial subtask in Machine Translation (MT) evaluation, aiming to identify the location and severity of translation errors. While fine-tuning models on human-annotated data improves ESD perf…