MiniMax readies M3 LLM with custom sparse attention

By PulseAugur Editorial · [1 sources] · 2026-05-28 08:12

Chinese AI company MiniMax is set to release its M3 large language model, which incorporates a custom sparse attention mechanism. This new model is reported to offer significant speed enhancements, with prefilling speeds up to 9.7 times faster than previous versions. AI

IMPACT Introduces a novel sparse attention mechanism that could significantly speed up LLM inference.

RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on Pandaily →

MiniMax

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

MiniMax readies M3 LLM with custom sparse attention

COVERAGE [1]

Pandaily TIER_1 English(EN) · [email protected] (Pandaily) · 2026-05-28 08:12

MiniMax Prepares to Launch Next-Generation M3 Large Language Model

Chinese AI unicorn MiniMax is preparing to launch its M3 large language model featuring a custom sparse attention mechanism, claiming 9.7x prefilling speed improvements.

COVERAGE [1]

MiniMax Prepares to Launch Next-Generation M3 Large Language Model

RELATED ENTITIES

RELATED TOPICS