Fireworks AI has announced the availability of Qwen 3.7 Plus and Qwen 3.7 Max models on its inference infrastructure. These models are designed for long-horizon agent loops and offer features like preserved reasoning history and native image/text input. Fireworks emphasizes that users' requests execute end-to-end on their hardware using licensed weights, ensuring control over latency, throughput, and data paths with zero data retention and a 99.9% SLA. AI
IMPACT Enhances agent loop capabilities and offers direct hardware execution for Qwen models.
RANK_REASON Product launch of new AI models by an inference provider.
AI-generated summary · Google Gemini · from 5 sources. How we write summaries →