DeepSeek V4
PulseAugur coverage of DeepSeek V4 — every cluster mentioning DeepSeek V4 across labs, papers, and developer communities, ranked by signal.
- developed by DeepSeek 100%
- subsidiary of DeepSeek 100%
- instance of DeepSeek 90%
- used by Huawei Ascend 90%
- instance of mixture of experts 90%
- used by 36Kr 90%
- developed mixture of experts 90%
- developed by DeepSeek V3.2 90%
- instance of DeepSeek-V4 Flash 90%
- founded Liang Wenfeng 90%
- used by Moore Threads 90%
- instance of DeepSeek V3.2 90%
- 2026-06-28 research_milestone DeepSeek V4's DSpark update significantly improves inference speed. source
- 2026-06-28 research_milestone DeepSeek V4's DSpark update significantly increases inference speed. source
- 2026-06-28 research_milestone DeepSeek V4 achieved an 80% increase in inference speed with the integration of DSpark. source
- 2026-06-27 product_launch DeepSeek AI has released a preview of its DeepSeek-V4 series of MoE language models, featuring a 1 million token context length. source
- 2026-06-27 product_launch DeepSeek released preview versions of its DeepSeek-V4 series, including DeepSeek-V4-Pro and DeepSeek-V4-Flash, both supporting a one million token context length. source
- 2026-06-19 research_milestone DeepSeek released a preview of its DeepSeek-V4 series of language models, featuring one million token context length and Mixture-of-Experts architecture. source
- 2026-06-18 research_milestone A new technique called Lookahead Sparse Attention was introduced, significantly reducing the KV cache size for the DeepSeek-V4 model. source
- 2026-06-15 research_milestone Analysis reveals co-design of DeepSeek V4 and Huawei Ascend 950DT significantly cut AI inference costs. source
- 2026-06-15 research_milestone DeepSeek V4 and Huawei Ascend 950DT co-design resulted in a 75% reduction in AI inference costs, according to SemiAnalysis. source
- 2026-06-08 product_launch DeepSeek's V4 model launch has led to significant price cuts by Chinese cloud providers and competitors like Xiaomi. source
- 2026-06-01 research_milestone DeepSeek V4, co-designed with Huawei hardware, shows significant performance gains, indicating a shift in global AI leadership. source
- 2026-05-31 research_milestone DeepSeek V4 demonstrates strong performance in Chinese cultural contexts and legal accuracy, despite mixed global benchmark results. source
- 2026-05-25 research_milestone DeepSeek V4 completes full adaptation to Huawei Ascend chips, marking a milestone for China's domestic AI stack. source
- 2026-05-24 research_milestone DeepSeek V4 is presented as a new AI model challenging OpenAI.
- 2026-05-23 product_launch DeepSeek released its V4 model with a 1 million token context window. source
29 day(s) with sentiment data
-
DeepSeek V4 boosts inference speed by 80% after funding
DeepSeek has announced significant performance improvements for its V4 model, with updates to DSpark reportedly increasing inference speed by 80%. Following its initial funding round, the company has also claimed an 85%…
-
iOS 27 to Deepen Apple Intelligence Integration, Boosting iPhone DRAM
Analyst Ming-Chi Kuo reports that iOS 27 will feature deeper system-level integration with Apple Intelligence. To support AI workloads, lower-end iPhone models in the first half of 2027, equipped with the A20 processor,…
-
DeepSeek V4 with DSpark boosts inference speed by 80%
DeepSeek has released an update to its DeepSeek V4 model, now featuring DSpark. This enhancement reportedly boosts inference speed by 80%. This development follows DeepSeek's initial funding round and signifies a signif…
-
AI and energy sectors drive industrial profits; institutional interest high in over 120 stocks
Over 120 stocks received institutional research attention in the past week, with Guangshengtang attracting the most inquiries from 53 institutions. Several other stocks, including Shenghong Co., Ltd. and United Chemical…
-
DeepSeek V4 boosts inference speed by 80% with DSpark integration
DeepSeek has announced significant updates to its V4 model, including an 80% increase in inference speed with the integration of DSpark. This advancement follows DeepSeek's initial funding round and aims to bolster the …
-
MiniMax M3: Open-weight 1M-context model released, but commercial use restricted
MiniMax has released MiniMax M3, an open-weight Mixture-of-Experts model featuring a 1 million token context window and native multimodality. The model boasts 428 billion total parameters, with only 23 billion active pe…
-
DeepSeek unveils V4 models with 1M token context and MoE architecture · 3 sources tracked
DeepSeek has released preview versions of its DeepSeek-V4 series, featuring two Mixture-of-Experts (MoE) language models: DeepSeek-V4-Pro and DeepSeek-V4-Flash. Both models support an impressive one million token contex…
-
LLM context compaction quality degradation curve observed, lacks benchmarks
A user observed that the output quality of LLMs like DeepSeek V4 and Claude Code does not degrade linearly with repeated context compaction. Instead, there appears to be a temporary improvement after the second compacti…
-
Zhipu AI's GLM-5.2 challenges top closed-source models with open release · 1 source tracked
Zhipu AI has released its flagship open-source model, GLM-5.2, which supports a 1 million token context window and has demonstrated top performance in coding and long-range tasks. This release follows Anthropic's tempor…
-
Baidu Cloud launches enterprise AI subscription service with GLM-5.2 integration
Baidu Cloud has launched the Baidu Qianfan Token Plan Enterprise Edition, a subscription service designed to streamline AI resource management for businesses. This new offering allows companies to procure, manage, and o…
-
Baidu Smart Cloud launches enterprise AI subscription plan with GLM-5.2 and DeepSeek-V4
Baidu Smart Cloud has launched Baidu Qianfan Token Plan Enterprise Edition, a subscription service designed for businesses to manage and utilize AI capabilities. This new offering provides a unified approach to procurin…
-
AI chip demand surges, driving GPU prices and sparking funding rounds
The AI chip industry is experiencing significant shifts, with major internet companies directly procuring thousands of NVIDIA B300 GPUs, bypassing traditional channels. This surge in demand is driving up prices for high…
-
Mimo 2.5 excels at large context tasks on consumer GPUs
The Mimo 2.5 large language model demonstrates impressive speed and performance with large context windows, particularly on dual RTX Pro 6000 GPUs. This is attributed to its efficient 5-to-1 local/global sliding-window …
-
Unified API routes tasks to cheapest LLM, saving 65% on costs · 1 source tracked
A developer has created a unified API that routes requests to multiple large language models, including GLM-5.2, DeepSeek V4, MiniMax M3, and Kimi K2.6. This approach allows users to optimize costs by directing tasks to…
-
Multi-tier MoE caching discussed as future of LLM inference
A discussion on Reddit explores the concept of multi-tier Mixture of Experts (MoE) caching as a potential future direction for MoE model inference. The idea involves strategically distributing model experts across CPU a…
-
Fireworks AI offers frontier RL infrastructure as a managed service
Fireworks AI is launching a new managed service that provides specialized infrastructure for reinforcement learning on frontier models. This service addresses the complex challenge of ensuring numerical consistency betw…
-
New speculative decoding methods boost LLM inference speed and safety
Researchers are developing advanced speculative decoding techniques to accelerate large language model inference. HyperDFlash optimizes decoding for DeepSeek-V4's multi-hyper-connection architecture, improving draft acc…
-
Microsoft integrates DeepSeek V4 into Copilot Cowork, shifting to usage-based AI pricing
Microsoft is evolving its Copilot Cowork AI Agent system by introducing usage-based pricing and exploring the integration of DeepSeek V4 as a cost-effective model option. This strategic shift addresses the escalating co…
-
Microsoft explores DeepSeek V4 for cost-effective AI as Copilot pricing shifts
Microsoft is reportedly shifting its Copilot Cowork offering to a usage-based pricing model due to escalating costs. The company is exploring open-source alternatives, including DeepSeek V4, to manage expenses more effe…
-
Hugging Face highlights AI advancements: Transformers v5, DeepSeek V4, and more · 7 sources tracked
Hugging Face is highlighting several recent advancements across the AI ecosystem. These include the release of Transformers v5 for defining AI models, the OpenEnv framework for evaluating tool-using agents in real-world…