PulseAugur
EN
LIVE 07:52:36

Readers prefer human literary translations over AI versions, study finds

A new study published on arXiv explores the quality of AI-generated translations for literary texts. Researchers found that while AI translations are generally adequate, human translations are still preferred by readers for their immersiveness and clarity. The study involved avid readers comparing human and AI translations of novels, revealing that readers could not reliably distinguish between the two but favored the human versions when they believed them to be human-made. Automatic evaluation metrics, including LLM-as-a-judge approaches, failed to align with reader preferences, highlighting a gap in current assessment methods. The researchers have released a new dataset, LAIT, to facilitate reader-centered evaluation of literary AI translations. AI

IMPACT Highlights the limitations of current AI translation models for nuanced literary tasks and the need for reader-centric evaluation methods.

RANK_REASON Academic paper on AI translation quality. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Readers prefer human literary translations over AI versions, study finds

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Yves Ferstler, Adam Podoxin, Ty Brassington, Roman Grundkiewicz, Maite Taboada, Marzena Karpinska ·

    AI translation of literary texts is "fine", but readers still prefer human translations

    arXiv:2606.26040v1 Announce Type: new Abstract: AI translation of literary works is increasingly common. While the content may be rendered adequately, we do not know enough about how readers experience it in terms of immersiveness and literary effect, aspects poorly captured by a…

  2. arXiv cs.CL TIER_1 English(EN) · Marzena Karpinska ·

    AI translation of literary texts is "fine", but readers still prefer human translations

    AI translation of literary works is increasingly common. While the content may be rendered adequately, we do not know enough about how readers experience it in terms of immersiveness and literary effect, aspects poorly captured by automatic machine translation metrics or human ev…