PulseAugur
EN
LIVE 13:22:33

New benchmark G-IdiomAlign tackles cross-lingual idiom translation in LLMs

Researchers have introduced G-IdiomAlign, a new benchmark designed to evaluate how well large language models can align idioms across different languages. The benchmark uses English glosses from Wiktionary as a pivot to anchor idioms, addressing the challenges posed by their non-compositional nature and weak surface-form grounding. Initial tests reveal that LLMs often exhibit a bias towards literal translation, particularly with low-resource languages, and that using glosses improves performance in controlled generation tasks, though significant room for improvement remains. AI

IMPACT This benchmark could drive improvements in LLMs' ability to handle nuanced linguistic phenomena like idioms, enhancing cross-lingual communication.

RANK_REASON The cluster describes a new academic benchmark for evaluating LLM capabilities, presented in a research paper.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

COVERAGE [3]

  1. arXiv cs.AI TIER_1 English(EN) · Fengying Ye, Yanming Sun, Runzhe Zhan, Zheqi Zhang, Lidia S. Chao, Derek F. Wong ·

    G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

    arXiv:2606.18989v1 Announce Type: cross Abstract: Idioms are difficult to transfer across languages due to their non-compositionality and weak surface-form grounding, making literal mappings unreliable. We present G-IdiomAlign, a gloss-pivoted benchmark where each idiom is anchor…

  2. arXiv cs.AI TIER_1 English(EN) · Derek F. Wong ·

    G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

    Idioms are difficult to transfer across languages due to their non-compositionality and weak surface-form grounding, making literal mappings unreliable. We present G-IdiomAlign, a gloss-pivoted benchmark where each idiom is anchored by an English gloss from Wiktionary. We further…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

    Idioms are difficult to transfer across languages due to their non-compositionality and weak surface-form grounding, making literal mappings unreliable. We present G-IdiomAlign, a gloss-pivoted benchmark where each idiom is anchored by an English gloss from Wiktionary. We further…