DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
DeepSeekV4, a 1.6 trillion parameter model, has shown significant performance gains in the 43 days since its release. Early benchmarks indicate it is competitive with or surpasses established models like GPT-4 and Claude 3 Opus, particularly in areas such as reasoning and coding. The model's development was supported by Huawei's advanced computing infrastructure, including their GB300 NVL72 and MI355X accelerators, and NVIDIA's B200 GPUs, suggesting a strong hardware-software synergy. AI
IMPACT DeepSeekV4's rapid performance improvement challenges existing frontier models and highlights the impact of advanced hardware on AI capabilities.