Researchers have developed two neural machine translation systems for the low-resource Tangkhul-English language pair. The primary system, utilizing ByT5-large fine-tuned on over 38,000 parallel sentences, achieved a BLEU score of 39.97. A secondary mT5-small system was also trained for comparison. The study highlights challenges related to Tangkhul's orthography and the domain bias of the training data, suggesting future work in data diversification and domain adaptation. AI
IMPACT Advances machine translation capabilities for under-resourced languages, potentially enabling new communication and information access.
RANK_REASON The cluster contains an academic paper detailing a new research finding in machine translation.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →