LLMs struggle with literary translation creativity despite strong comprehension

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed a new framework to evaluate large language models (LLMs) on literary translation, assessing both comprehension and creativity. Their paired-task approach uses human annotations and automatic scoring of creative units to benchmark 23 models. The study found that even models with strong comprehension struggled with creative translation, often producing literal or contextually inappropriate renderings, especially between distant language pairs like English and Chinese. While creativity-focused prompts offered minor improvements, only Mistral-Large approached human-level creativity. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

RANK_REASON Academic paper introducing a new framework for evaluating LLM capabilities in literary translation.

Read on Hugging Face Daily Papers →

COVERAGE [1]

Hugging Face Daily Papers TIER_1 · 2026-04-20 12:28

Beyond Reproduction: A Paired-Task Framework for Assessing LLM Comprehension and Creativity in Literary Translation

Large language models (LLMs) are increasingly used for creative tasks such as literary translation. Yet translational creativity remains underexplored and is rarely evaluated at scale, while source-text comprehension is typically studied in isolation, despite the fact that, in pr…

COVERAGE [1]

Beyond Reproduction: A Paired-Task Framework for Assessing LLM Comprehension and Creativity in Literary Translation

RELATED TOPICS