Cross-lingual LLM explanations may lack faithfulness, study finds

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

A new research paper explores the trade-offs in cross-lingual explanations for large language models. The study found that explanations generated in English for non-English inputs can be less faithful to the model's actual reasoning process, even if they appear plausible. This degradation in faithfulness, measured by comprehensiveness and sufficiency, can be significant, with comprehensiveness dropping up to 5.7 times compared to native-language explanations. The research suggests that auditing explanations in the input language and using multi-faceted faithfulness metrics are crucial for accurate model evaluation. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Highlights potential inaccuracies in cross-lingual LLM auditing, emphasizing the need for native-language explanations and robust faithfulness metrics.

RANK_REASON The cluster contains an academic paper detailing research findings on LLM explanations. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

COVERAGE [1]

arXiv cs.CL TIER_1 · Animesh Mukherjee · 2026-05-19 02:44

Lost in Interpretation: The Plausibility-Faithfulness Trade-off in Cross-Lingual Explanations

LLMs deployed multilingually are often audited via English explanations for non-English inputs. We evaluate extractive explanations ''where the model identifies input token spans as evidence alongside a generated rationale'' and uncover a systematic trade-off: English-pivot expla…

COVERAGE [1]

Lost in Interpretation: The Plausibility-Faithfulness Trade-off in Cross-Lingual Explanations

RELATED ENTITIES

RELATED TOPICS