PulseAugur
实时 05:29:16
English(EN) Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

新基准和语料库推动古希腊语到现代希腊语的翻译

研究人员开发了一个新的古希腊语到现代希腊语翻译基准和数据集,这项任务以前因缺乏平行数据而受到阻碍。AG-MG平行语料库包含超过132,000个句子对,是通过一个涉及网络抓取、先进对齐技术以及使用Gemini 2.5 Flash进行的大型语言模型错误纠正的新颖流程创建的。实验表明,微调Llama-Krikri-8B和M2M100等模型可显著提高翻译质量,最佳模型的BLEU得分达到13.16。 AI

影响 推动了低资源语言的翻译,可能为历史语言学和数字人文领域带来新应用。

排序理由 该集群描述了一篇介绍特定机器翻译任务新基准和数据集的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新基准和语料库推动古希腊语到现代希腊语的翻译

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Maria Giagkou ·

    Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

    Machine Translation (MT) for Ancient Greek (AG) to Modern Greek (MG) is a low-resource task, constrained by the lack of large-scale, high-quality parallel data. We address this gap by introducing the AG-MG Parallel Corpus, a new resource containing 132,481 sentence-aligned pairs …

  2. Hugging Face Daily Papers TIER_1 English(EN) ·

    Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

    Machine Translation (MT) for Ancient Greek (AG) to Modern Greek (MG) is a low-resource task, constrained by the lack of large-scale, high-quality parallel data. We address this gap by introducing the AG-MG Parallel Corpus, a new resource containing 132,481 sentence-aligned pairs …