Enhancing Scientific Discourse: Machine Translation for the Scientific Domain
Researchers have developed new parallel and monolingual corpora specifically for scientific machine translation. These corpora focus on Spanish-English, French-English, and Portuguese-English language pairs, with specialized subsets for Cancer Research, Energy Research, Neuroscience, and Transportation. The created datasets were used to fine-tune general-purpose neural machine translation systems, and the paper details the corpus creation, fine-tuning methods, and evaluation results. AI
IMPACT Facilitates broader access to scientific research by improving translation quality for specialized terminology.