New SomaliBench benchmark reveals large refusal gaps in open-weight LLMs

By PulseAugur Editorial · [3 sources] · 2026-05-25 04:45

A new benchmark, SomaliBench v0, has been developed to evaluate the safety refusal capabilities of open-weight language models in Somali, a low-resource language. The study found significant gaps in refusal rates between English and Somali for models like Llama-3.1-8B-Instruct, Aya-23-8B, Qwen-2.5-7B-Instruct, and Gemma-2-9B-Instruct. For many models, non-refusal in Somali often resulted in unclear or incoherent outputs rather than direct harmful compliance. AI

IMPACT Highlights the need for more robust safety evaluations in low-resource languages, potentially influencing future model development and testing.

RANK_REASON The cluster describes a new academic benchmark and evaluation of existing models, fitting the research bucket.

Read on Hugging Face Daily Papers →

paper
safety

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

New SomaliBench benchmark reveals large refusal gaps in open-weight LLMs

COVERAGE [3]

arXiv cs.AI TIER_1 English(EN) · Khalid Yusuf Dahir · 2026-05-26 04:00

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

arXiv:2605.25420v1 Announce Type: cross Abstract: Large language model safety evaluation remains heavily English-centered, leaving low-resource languages under-measured even when models are deployed globally. We evaluate four open-weight instruction-tuned models on SomaliBench v0…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-25 04:45

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

Large language model safety evaluation remains heavily English-centered, leaving low-resource languages under-measured even when models are deployed globally. We evaluate four open-weight instruction-tuned models on SomaliBench v0, a native-author-verified benchmark of 100 harmfu…
arXiv cs.CL TIER_1 English(EN) · Khalid Yusuf Dahir · 2026-05-25 04:45

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

Large language model safety evaluation remains heavily English-centered, leaving low-resource languages under-measured even when models are deployed globally. We evaluate four open-weight instruction-tuned models on SomaliBench v0, a native-author-verified benchmark of 100 harmfu…

COVERAGE [3]

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models

RELATED ENTITIES

RELATED TOPICS