New Benchmark Tests LLMs on Arabic-Hebrew Cognate Ambiguity

By PulseAugur Editorial · [2 sources] · 2026-06-11 11:33

Researchers have developed SemCog Bench, a new benchmark designed to evaluate how well large language models (LLMs) handle cognates between Arabic and Hebrew. The benchmark includes 1,858 word pairs and sentence-level annotations to test identification and semantic disambiguation. Evaluations revealed that LLMs perform well on true cognates but struggle significantly with false friends and loanwords, indicating a reliance on surface-level similarity rather than deep semantic understanding. Even with contextual cues, performance gains were modest, highlighting a fundamental limitation in current LLMs for resolving cross-lingual meaning conflicts. AI

IMPACT Highlights limitations in LLM cross-lingual understanding, potentially guiding future model development for nuanced semantic reasoning.

RANK_REASON The cluster describes a new academic paper introducing a benchmark for evaluating LLMs on linguistic tasks.

Read on arXiv cs.CL →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.CL TIER_1 English(EN) · Junhong Liang, Noor Abo Mokh, Bashar Alhafni · 2026-06-12 04:00

When Similar Means Different: Evaluating LLMs on Arabic--Hebrew Cognates

arXiv:2606.13218v1 Announce Type: new Abstract: Arabic and Hebrew, as closely related Semitic languages, share a substantial lexicon of true cognates, misleading false friends, and modern loanwords. This overlap poses a challenge for cross-lingual semantic understanding in large …
arXiv cs.CL TIER_1 English(EN) · Bashar Alhafni · 2026-06-11 11:33

When Similar Means Different: Evaluating LLMs on Arabic--Hebrew Cognates

Arabic and Hebrew, as closely related Semitic languages, share a substantial lexicon of true cognates, misleading false friends, and modern loanwords. This overlap poses a challenge for cross-lingual semantic understanding in large language models (LLMs). To evaluate this capabil…

COVERAGE [2]

When Similar Means Different: Evaluating LLMs on Arabic--Hebrew Cognates

When Similar Means Different: Evaluating LLMs on Arabic--Hebrew Cognates

RELATED ENTITIES

RELATED TOPICS