PulseAugur
EN
LIVE 23:48:23

MoE LLM vulnerability exploited by 'RepetitionCurse' DoS attack

Researchers have identified a vulnerability in Mixture-of-Experts (MoE) Large Language Models that can be exploited as a denial-of-service attack. Adversarial inputs can cause the model's router to concentrate all processing on a small subset of experts, creating bottlenecks and increasing inference latency. The proposed 'RepetitionCurse' method uses simple repetitive token patterns to trigger this imbalance, significantly degrading model performance and availability. AI

IMPACT This research highlights a critical security vulnerability in MoE architectures, potentially impacting the reliability and availability of deployed LLM services.

RANK_REASON The cluster contains a research paper detailing a new attack method against MoE LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Ruixuan Huang, Qingyue Wang, Hantao Huang, Yudong Gao, Dong Chen, Shuai Wang, Wei Wang ·

    RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress

    arXiv:2512.23995v2 Announce Type: replace-cross Abstract: Mixture-of-Experts architectures have become the standard for scaling large language models due to their superior parameter efficiency. To accommodate the growing number of experts in practice, modern inference systems com…