MoE LLM vulnerability exploited by 'RepetitionCurse' DoS attack

By PulseAugur Editorial · [1 sources] · 2026-05-26 04:00

Researchers have identified a vulnerability in Mixture-of-Experts (MoE) Large Language Models that can be exploited as a denial-of-service attack. Adversarial inputs can cause the model's router to concentrate all processing on a small subset of experts, creating bottlenecks and increasing inference latency. The proposed 'RepetitionCurse' method uses simple repetitive token patterns to trigger this imbalance, significantly degrading model performance and availability. AI

IMPACT This research highlights a critical security vulnerability in MoE architectures, potentially impacting the reliability and availability of deployed LLM services.

RANK_REASON The cluster contains a research paper detailing a new attack method against MoE LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Ruixuan Huang, Qingyue Wang, Hantao Huang, Yudong Gao, Dong Chen, Shuai Wang, Wei Wang · 2026-05-26 04:00

RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress

arXiv:2512.23995v2 Announce Type: replace-cross Abstract: Mixture-of-Experts architectures have become the standard for scaling large language models due to their superior parameter efficiency. To accommodate the growing number of experts in practice, modern inference systems com…

COVERAGE [1]

RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress

RELATED ENTITIES

RELATED TOPICS