DeepSeek V4, an open-weight model family, has been released with a 1.6-trillion-parameter Mixture-of-Experts architecture that activates only 49 billion parameters per token. This new model boasts a 1-million-token context window and significantly reduced inference costs, achieving up to 73% lower costs than its predecessor due to innovations like Hybrid Attention. The V4 family, available on Hugging Face, offers comparable quality to leading models like GPT-5.4 and Claude Opus 4.6 at a fraction of the price, with optimized hardware performance for NVIDIA Blackwell. AI
Summary written by gemini-2.5-flash-lite from 4 sources. How we write summaries →
IMPACT Sets a new standard for efficiency in large MoE models, making advanced AI capabilities more accessible and affordable for developers.
RANK_REASON New model release from DeepSeek, a significant AI lab, with detailed technical specifications and benchmark comparisons.