PulseAugur
EN
LIVE 22:13:09

Fireworks AI launches MiniMax M3 with 512K context and multimodal capabilities

Fireworks AI has launched the MiniMax M3 model, offering it as the fastest endpoint for the MiniMax series. This new model boasts a 512K context window and supports native image and video input, along with significant speed improvements in prefill and decode through MSA sparse attention. The MiniMax M3 is priced comparably to its predecessor, the M2.7, and is recognized as a top open-weight model on the Artificial Analysis index. AI

IMPACT Enhances inference speed and multimodal capabilities for AI applications, potentially lowering costs for users.

RANK_REASON This is a product launch for an inference infrastructure provider, not a frontier model release from a core AI lab.

Read on X — Fireworks (inference infra) →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Fireworks AI launches MiniMax M3 with 512K context and multimodal capabilities

COVERAGE [2]

  1. X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ ·

    MiniMax M3 is live on Fireworks. Day-0, fastest endpoint for the MiniMax series.

    MiniMax M3 is live on Fireworks. Day-0, fastest endpoint for the MiniMax series. → Top open-weight model on the Artificial Analysis index → 512K context, native image + video input → MSA sparse attention: 9× faster prefill, 15× faster decode → Priced at parity with M2.7 https…

  2. X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ ·

    MiniMax M3 is live on Fireworks. Day-0, fastest endpoint for the MiniMax series.

    MiniMax M3 is live on Fireworks. Day-0, fastest endpoint for the MiniMax series. → Top open-weight model on the Artificial Analysis index → 512K context, native image + video input → MSA sparse attention: 9× faster prefill, 15× faster decode → Priced at parity with M2.7 https…