DeepSeek has released two new text-generation models: DeepSeek-V4-Flash-DSpark and DeepSeek-V4-Pro-DSpark. The Flash model features 284 billion parameters with 13 billion active and a 1 million token context window. The Pro model is significantly larger, boasting 1.6 trillion parameters with 49 billion active, also supporting a 1 million token context. AI
IMPACT These models offer large parameter counts and extensive context windows, potentially advancing capabilities in complex text generation tasks.
RANK_REASON Frontier-lab model release with system card [lever_c_demoted from frontier_release: ic=2 ai=1.0]
Read on Mastodon — mastodon.social →
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →