PulseAugur
EN
LIVE 10:43:54

LLMs struggle with cultural translation in math problems

A new study analyzed how large language models like Claude Opus 4, GPT-4.1, and Gemini 2.5 Pro translate math word problems across various languages and cultures. The research found that while models often agree on the type of transformation, they frequently substitute specific cultural elements like names and foods, leading to a significant divergence in the cultural context presented to students. Furthermore, all tested language-model combinations exhibited "entropy collapse," meaning the adaptation process compressed rather than expanded cultural diversity, and models often misattributed regional contexts or introduced cross-cultural contamination, such as equating egg hunts with Eid activities. AI

IMPACT Reveals significant limitations in LLMs' ability to perform nuanced cultural translation, impacting educational applications.

RANK_REASON The cluster contains an academic paper detailing research findings on LLM capabilities.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Parisa Suchdev, Juniper Lovato ·

    Who Brought Easter Eggs to Eid? Auditing Cultural Translation of Math Word Problems Across Diverse Languages and Regions

    arXiv:2606.11009v1 Announce Type: new Abstract: Large language models are increasingly used to adapt math word problems for personalized learning at scale, but it remains an open question whether those adaptations are consistent across models, preserve cultural diversity at scale…

  2. arXiv cs.CL TIER_1 English(EN) · Juniper Lovato ·

    Who Brought Easter Eggs to Eid? Auditing Cultural Translation of Math Word Problems Across Diverse Languages and Regions

    Large language models are increasingly used to adapt math word problems for personalized learning at scale, but it remains an open question whether those adaptations are consistent across models, preserve cultural diversity at scale, and reveal which cultural entities models trea…