SGLang's MI355x boosts DeepSeekv4 Pro throughput over 10x per GPU

By PulseAugur Editorial · [1 sources] · 2026-05-06 13:00

DeepSeekv4 Pro has seen a significant performance increase, achieving over tenfold improvement in throughput per GPU. This advancement was realized through the integration of MI355x on the SGLang framework. The gains represent a substantial leap in efficiency since the model's initial release. AI

IMPACT Demonstrates substantial efficiency gains in LLM inference, potentially lowering operational costs and increasing deployment feasibility.

RANK_REASON The cluster reports on a performance improvement for an existing model, not a new release or a significant industry event. [lever_c_demoted from research: ic=1 ai=1.0]

Read on X — SemiAnalysis →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

SGLang's MI355x boosts DeepSeekv4 Pro throughput over 10x per GPU

COVERAGE [1]

X — SemiAnalysis TIER_1 English(EN) · SemiAnalysis_ · 2026-05-06 13:00

Canyon Overlook, @ZionNPS - MI355x on SGLang has achieved >10x improvement on throughput PER GPU since day-0 release for DeepSeekv4 Pro. HUGE W to the 10x en

Canyon Overlook, @ZionNPS - MI355x on SGLang has achieved >10x improvement on throughput PER GPU since day-0 release for DeepSeekv4 Pro. HUGE W to the 10x engineers at Hai's team from @amd and @sgl_project! @EmadBarsoumPi @AnushElangovan https://t.co/O8DL9a1VD0

COVERAGE [1]

Canyon Overlook, @ZionNPS - MI355x on SGLang has achieved &gt;10x improvement on throughput PER GPU since day-0 release for DeepSeekv4 Pro. HUGE W to the 10x en

RELATED ENTITIES

RELATED TOPICS

Canyon Overlook, @ZionNPS - MI355x on SGLang has achieved >10x improvement on throughput PER GPU since day-0 release for DeepSeekv4 Pro. HUGE W to the 10x en