Fireworks AI launches Qwen 3.7 Plus and Max models

By PulseAugur Editorial · [5 sources] · 2026-06-12 00:00

Fireworks AI has announced the availability of Qwen 3.7 Plus and Qwen 3.7 Max models on its inference infrastructure. These models are designed for long-horizon agent loops and offer features like preserved reasoning history and native image/text input. Fireworks emphasizes that users' requests execute end-to-end on their hardware using licensed weights, ensuring control over latency, throughput, and data paths with zero data retention and a 99.9% SLA. AI

IMPACT Enhances agent loop capabilities and offers direct hardware execution for Qwen models.

RANK_REASON Product launch of new AI models by an inference provider.

Read on Fireworks AI blog →

AI-generated summary · Google Gemini · from 5 sources. How we write summaries →

Fireworks AI launches Qwen 3.7 Plus and Max models

COVERAGE [5]

X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ · 2026-06-13 04:15

Drop it into what you already use.

Drop it into what you already use. OpenAI + Anthropic compatible endpoints. Works with Claude Code, Cursor, LangChain, etc. Run it serverless today, and get in touch if you’re interested in early-access for Qwen 3.7 Max or bespoke workloads. Get started: https://t.co/PaZEwfjY5…
X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ · 2026-06-13 04:15

Fireworks serves Qwen 3.7 Plus as a true inference provider.

Fireworks serves Qwen 3.7 Plus as a true inference provider. Your requests execute end-to-end on our hardware from the licensed weights. No forwarding. Qwen 3.7 Plus (thinking) matches Max on AIME 2025 and delivers 3.55× higher end-to-end throughput than 3.6 Plus.
X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ · 2026-06-13 04:15

Built for long-horizon agent loops.

Built for long-horizon agent loops. Observe → reason → code → act (GUI/CLI) → verify, repeat. Qwen’s own demo ran 11 hours with 10k+ lines of code and 1k+ calls. On Fireworks you get: → reasoning_history="preserved" to persist reasoning context across turns →
X — Fireworks (inference infra) TIER_1 English(EN) · FireworksAI_HQ · 2026-06-13 04:15

Qwen 3.7 Plus is now live on Fireworks.

Qwen 3.7 Plus is now live on Fireworks. You get the official weights running on our stack. That means full control of latency, throughput, and data path end-to-end, with zero data retention and our 99.9% SLA. Let’s dig in ↓ https://t.co/4JAmGyj9PE
Fireworks AI blog TIER_1 English(EN) · 2026-06-12 00:00

Qwen 3.7 Plus on Fireworks: Run it today.

Use state-of-the-art, open-source LLMs and image models at blazing fast speed, or fine-tune and deploy your own at no additional cost with Fireworks AI!

COVERAGE [5]

Drop it into what you already use.

Fireworks serves Qwen 3.7 Plus as a true inference provider.

Built for long-horizon agent loops.

Qwen 3.7 Plus is now live on Fireworks.

Qwen 3.7 Plus on Fireworks: Run it today.

RELATED ENTITIES

RELATED TOPICS