Chinese AI labs have significantly reduced LLM API prices in the first half of 2026, with DeepSeek, Xiaomi, and Moonshot making these cuts permanent. DeepSeek V4-Pro now offers the lowest cost per output token at $0.87 per million, while Xiaomi MiMo V2.5 provides a flat rate for long contexts at $3 per million output tokens. Other notable models include Alibaba's Qwen3 Max for general production balance and Moonshot's Kimi K2.6 for efficient handling of stable prompts. AI
IMPACT Accelerates adoption of LLM APIs by making them significantly more affordable, especially for long-context and high-output workloads.
RANK_REASON Multiple frontier labs announcing permanent price cuts on their leading LLM APIs.
- Alibaba
- Claude Opus 4.7
- DeepSeek
- DeepSeek V4-Pro
- GLM-5
- GPT-5.5
- Kimi K2.6
- Moonshot
- Qwen3 Max
- Xiaomi
- Xiaomi MiMo V2.5
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →