PulseAugur
实时 18:15:58
English(EN) Nsanku: Evaluating Zero-Shot Translation Performance of LLMs for Ghanaian Languages

大型语言模型在加纳语种上表现不佳,Nsanku基准测试揭示

一项名为Nsanku的新基准测试被开发出来,用于评估19个大型语言模型在43种加纳语种上的零样本翻译能力。研究发现,虽然Gemini 2.5 Flash在专有模型中表现最佳,Kimi-K2-Instruct-0905在开源模型中领先,但没有一个大型语言模型同时达到高绩效和高一致性。这表明当前模型尚未能可靠地大规模翻译这些低资源语言。 AI

影响 凸显了大型语言模型在低资源非洲语言翻译能力上的显著差距,需要进一步的研究和开发。

排序理由 这是一篇研究论文,提出了一个新的基准测试,用于评估大型语言模型在低资源语言上的翻译性能。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

大型语言模型在加纳语种上表现不佳,Nsanku基准测试揭示

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Stephen E. Moore, Mich-Seth Owusu, Akwasi Asare, Lawrence Adu Gyamfi, Paul Azunre, Joel Budu, Jonathan Asiamah, Elias Dzobo, Kelvin Newman, Edmund O. Benefo, Gerhardt Datsomor, Onesimus Addo Appiah, Ama Branoa Banful, Lucas Woedem Kpatah, Saani Mustapha D ·

    Nsanku: Evaluating Zero-Shot Translation Performance of LLMs for Ghanaian Languages

    arXiv:2605.04208v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated impressive multilingual capabilities for well-resourced languages, yet their performance on low-resource African languages remains poorly understood and largely unevaluated. This paper …

  2. arXiv cs.CL TIER_1 English(EN) · John Ayernor ·

    Nsanku: Evaluating Zero-Shot Translation Performance of LLMs for Ghanaian Languages

    Large language models (LLMs) have demonstrated impressive multilingual capabilities for well-resourced languages, yet their performance on low-resource African languages remains poorly understood and largely unevaluated. This paper presents Nsanku, a systematic benchmark that eva…