PulseAugur
LIVE 12:47:52
significant · [1 source] ·
74
significant

DeepSeek V4 models launch, tying Claude Opus 4.6 on agent benchmarks

DeepSeek has released its V4 models, including V4-Pro and V4-Flash, both featuring a 1 million token context window and released under the MIT license. These models achieve performance on par with Claude Opus 4.6 on public agent benchmarks and surpass GPT-5.4 on coding challenges, positioning them as strong open-weight alternatives for AI agents. Key improvements include a new XML-based schema for tool calls that reduces parsing errors and enhanced reasoning persistence across multiple tool interactions. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Sets new SOTA on agentic benchmarks for open-weight models, offering a cost-effective alternative to proprietary models for tool-driven agents.

RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on dev.to — MCP tag →

COVERAGE [1]

  1. dev.to — MCP tag TIER_1 · Rupa Tiwari ·

    Connect Your MCP Server With DeepSeek V4 — Step-by-Step Guide (2026)

    <h2> TL;DR </h2> <ul> <li> <strong>DeepSeek V4 shipped April 24, 2026</strong> in two flavours: <strong>V4-Pro</strong> (1.6T MoE / 49B active) and <strong>V4-Flash</strong> (284B MoE / 13B active). Both expose a <strong>1M-token context</strong> and ship under the MIT license.</…