Brief

last 24h

[23/23] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · dev.to — LLM tag English(EN) · 23h

Qwen 3.6 Has Four Tiers. Here's How to Route Without Burning Cash.

Alibaba has released four tiers of its Qwen 3.6 model, with pricing varying by a factor of 41x between the cheapest and most expensive options. The article provides guidance on how to route requests to the appropriate tier to optimize costs and performance, suggesting that a dynamic routing strategy can significantly reduce monthly expenses without sacrificing quality for most tasks. It also highlights the risks associated with the 'Max-Preview' tier, recommending fallback mechanisms for production environments. AI

IMPACT Optimizing LLM costs through intelligent routing can significantly reduce operational expenses for AI applications.
SIGNIFICANT · Mastodon — fosstodon.org English(EN) · 7h

DeepSeek V4 Pro Could Reshape the Global Tech Industry The AI industry may have just entered its “budget airline moment.” Chinese AI lab DeepSeek has reportedly

Chinese AI lab DeepSeek has reportedly launched its V4 Pro model, potentially ushering in a new era of accessibility in the AI industry. This development is being compared to the "budget airline moment" for AI, suggesting a significant shift in the market. AI

IMPACT This release could democratize access to advanced AI models, potentially lowering costs and increasing adoption across industries.
- DeepSeek
- DeepSeek V4 Pro
TOOL · dev.to — LLM tag Tiếng Việt(VI) · 17h

Setting Up DeepSeek-V4-Pro Reasoning Proxy with Cursor (2026) Guide

A technical guide details how to integrate the DeepSeek V4-Pro model with the Cursor IDE, addressing a common HTTP 400 error. The issue arises because Cursor, adhering to the OpenAI schema, omits the `reasoning_content` field returned by DeepSeek V4-Pro, which the DeepSeek API requires for subsequent tool calls. To resolve this, the guide recommends using an open-source proxy, `deepseek-cursor-proxy`, which intercepts requests, stores the `reasoning_content`, and re-injects it before forwarding to DeepSeek. AI

IMPACT Provides a workaround for integrating a specific LLM with an IDE, improving developer workflow for users of these tools.
RESEARCH · Mastodon — mastodon.social 한국어(KO) · 11h

Counterpoint Research (@CounterpointTR) reports that the price of DeepSeek V4-Pro has been reduced by 75%, significantly shaking the price competitiveness of Western frontier AI models. API/model inference costs are rapidly decreasing, impacting companies' model selection, multi-model strategies, and inference optimization.

DeepSeek V4-Pro has significantly reduced its pricing by 75%, challenging the cost-effectiveness of Western frontier AI models. This substantial decrease in API and model inference expenses could directly influence how businesses select and optimize their AI models, potentially leading to wider adoption of multi-model strategies. AI

IMPACT This price cut by DeepSeek V4-Pro may force Western AI labs to re-evaluate their pricing strategies and could accelerate enterprise adoption of more cost-effective models.
- Counterpoint Research
- DeepSeek V4-Pro
SIGNIFICANT · The Decoder English(EN) · 2d

Alibaba's latest AI model ran autonomously for 35 hours to optimize code for its own custom chip

Alibaba's Qwen team has released Qwen3.7-Max, a new proprietary AI model designed for extended autonomous agent tasks. This model has demonstrated its capabilities by running for 35 hours to optimize code for Alibaba's custom chip. In benchmarks, Qwen3.7-Max performs comparably to Anthropic's Claude Opus 4.6 and surpasses other Chinese models such as DeepSeek V4 Pro and Kimi K2.6. AI

IMPACT Sets a new benchmark for autonomous agent execution duration and performance against leading models.
TOOL · 36氪 (36Kr) 中文(ZH) · 4d

Behind 900 Million Clicks, The Real World of AI Applications | 2026 China AI Application Panorama Report

A new report from Quantum Bit Think Tank analyzes the evolving landscape of AI applications in China, shifting from simple chatbots to task-oriented agents. The report highlights a significant increase in AI application usage, with web traffic exceeding 900 million monthly visits and app downloads surpassing 240 million. Key trends include the rise of agents, the democratization of AI models, AI assistants becoming primary interfaces, the initial success of paid AI models, and the deepening penetration of AI in vertical business sectors. AI

IMPACT Highlights China's leading role in AI application adoption and the shift towards task-oriented AI, influencing global development priorities.
- Baidu
- GPT-5.5
- Alibaba
- DeepSeek V4-Pro
- China
- Tencent
- Zhipu AI
- Kimi K2.5
- ByteDance
- Seedance 2.0
- Doubao
- AI applications
- Quantum Bit Think Tank
TOOL · dev.to — LLM tag English(EN) · 5d

Which LLM is the best stock picker? I built a benchmark to find out.

A new benchmark, dubbed 1rok, has been launched to evaluate the stock-picking capabilities of frontier large language models. The benchmark assigns each participating LLM a virtual portfolio of $100,000 and tasks them with selecting stocks weekly, with performance tracked against market outcomes. This initiative aims to provide a more practical, downstream evaluation of LLMs beyond traditional coding and reasoning benchmarks, focusing on decision-making under uncertainty. AI

IMPACT Provides a novel benchmark for evaluating LLM decision-making under uncertainty, moving beyond traditional coding and reasoning tasks.
- Google
- OpenAI
- xAI
- GPT-5.5
- Gemini 3.1 Pro Preview
- Kimi K2.6
- GLM-5.1
- DeepSeek V4 Pro
- Moonshot
- Grok 4.3
- MiniMax M2.7
- 1rok
SIGNIFICANT · Mastodon — mastodon.social Deutsch(DE) · 23h

RT @DataChaz: Are you aware of what is happening here? 🚨 DEEPSEEK V4 PRO IS NOW FOREVER 75% CHEAPER. The new king in cost per token. Input:

DeepSeek V4 Pro is now available at a significantly reduced price, making it 75% cheaper than before. This new pricing strategy positions the model as a highly cost-effective option for users, particularly in terms of cost per token. AI

IMPACT New pricing models for frontier models can accelerate adoption and shift competitive dynamics.
- Mastodon
- DeepSeek V4 Pro
TOOL · Mastodon — fosstodon.org English(EN) · 5d

One command to install the entire AI design stack. Ollama + Hermes Agent + DeepSeek V4 Pro. Here's how to set it up: https:// youtu.be/lQHyLYXlunI # AI # design

A user has shared instructions for a one-command installation of an AI design stack. This stack includes Ollama, Hermes Agent, and DeepSeek V4 Pro, with a YouTube video tutorial provided for setup. The setup aims to streamline the process of deploying these AI tools for design purposes. AI

IMPACT Simplifies deployment of AI tools for design workflows.
TOOL · dev.to — LLM tag English(EN) · 4d

How I Adapted Self-Critique Loops for a One-Person Builder Stack. The MINDCHANGE Axis Result Was Negative.

A solo developer adapted existing self-critique methods for large language models to fit within a single-agent, single-session framework suitable for a one-person operation. The new MINDCHANGE pattern includes three stages: negative-self, self-audit, and mind-change, aiming to differentiate genuine weaknesses from superficial critiques. This approach was tested with five different models, including Claude Opus 4.7 and Gemini 3.5 Flash, and is designed to be cost-effective for frequent, automated use. AI

IMPACT Enables more efficient and cost-effective self-improvement for LLMs in constrained environments.
COMMENTARY · 36氪 (36Kr) 中文(ZH) · 2d

The movie "Letters to Grandma" box office exceeds 900 million

DeepSeek announced that the API pricing for its DeepSeek-V4-Pro model will revert to its original price after May 31, 2026, following a 75% discount period. The article also mentions the box office success of the film "Letters to Grandma" surpassing 900 million yuan and a report from The Lancet on global mental health. AI

IMPACT DeepSeek-V4-Pro API pricing will return to original levels after a promotional discount ends in mid-2026.
SIGNIFICANT · 量子位 (QbitAI) 中文(ZH) via roundup · 4d · [2 sources]

热门文章

DeepSeek is reportedly in advanced talks for a substantial funding round, potentially reaching 70 billion RMB (approximately $10 billion USD) at a $45 billion valuation. The company aims to prioritize AGI research and maintain its commitment to open-source models, with founder Liang Feng emphasizing technological advancement over immediate commercialization. This funding round is attracting significant interest from major players, including battery giant CATL, which sees DeepSeek as a key energy consumer for its data center expansion, and potentially JD.com and NetEase. AI

IMPACT This substantial funding could accelerate DeepSeek's pursuit of AGI and bolster its infrastructure, potentially influencing the broader AI hardware and energy sectors.
SIGNIFICANT · dev.to — LLM tag 中文(ZH) · 5d · [2 sources]

Alibaba Qwen3.7-Max Released: 35 Hours of Autonomous Evolution, The Road to the Top for Domestic Large Models

Alibaba has unveiled its new flagship large language model, Qwen3.7-Max, at the Cloud Summit. This model demonstrates a remarkable ability to autonomously evolve and optimize itself over 35 hours, a key feature that has propelled it to the top of the Arena leaderboard for Chinese AI models. Qwen3.7-Max also shows significant improvements in coding, multimodal understanding, and reasoning capabilities, approaching GPT-4o levels. AI

IMPACT Sets a new benchmark for Chinese LLMs and showcases advanced autonomous agent capabilities, potentially accelerating development in agentic AI.
- 真武M890
- Kimi-K2.6
- GLM-5.1
- DeepSeek-v4-pro
- Alibaba Cloud
- Qwen3.7-Max
- Alibaba
- GPT-4o
- Arena
FRONTIER RELEASE · Hugging Face Trending Models English(EN) · 2w · [6 sources]

tencent/Hy-MT2-30B-A3B

Tencent has released its Hy-MT2 family of multilingual translation models, available in 1.8B, 7B, and 30B-A3B sizes. These models support translation across 33 languages and are designed for complex, real-world scenarios, including instruction-following. The 1.8B model features extreme quantization for on-device deployment, reducing its size to 440MB while improving inference speed. The Hy-MT2 models demonstrate strong performance, with the 7B and 30B-A3B versions outperforming open-source competitors like DeepSeek-V4-Pro and Kimi K2.6, and the 1.8B model competing with mainstream commercial APIs. AI

IMPACT Sets a new benchmark for multilingual translation models, particularly in fast-thinking and instruction-following capabilities.
- Hugging Face
- Microsoft
- Kimi K2.6
- DeepSeek-V4-Pro
- Tencent
- Doubao
- AngelSlim
- Hy-MT2
- IFMTBench
TOOL · Together AI blog English(EN) · 1w · [2 sources]

Violin: An open-source video translation skill that breaks language barriers

Together AI has launched Violin, an open-source video translation tool designed to make online video content accessible across language barriers. The system utilizes advanced AI, including speech recognition, large language models, and speech synthesis, to provide high-quality translations. Violin also features interactive capabilities like a content-aware chat assistant and personalized voice selection, aiming to broaden the reach of video content globally. AI

IMPACT Enhances accessibility of video content globally by leveraging multiple AI models for translation and interaction.
SIGNIFICANT · The Verge — AI English(EN) · 3w · [34 sources]

Microsoft starts canceling Claude Code licenses

Major tech companies like Microsoft, Meta, and Amazon are reportedly pulling back on internal AI usage due to escalating costs, primarily driven by the increased consumption of tokens by agentic AI tools. This phenomenon, dubbed 'tokenmaxxing,' where employees use AI extensively to meet productivity targets, is proving more expensive than human labor in some cases. Microsoft's decision to discontinue Claude Code licenses in favor of its own GitHub Copilot CLI exemplifies this trend, driven by both cost-cutting and a strategic move to control internal development workflows. AI

IMPACT Rising AI token costs and 'tokenmaxxing' are forcing companies to re-evaluate AI adoption, potentially slowing enterprise-wide integration.
- Jensen Huang
- Copilot CLI
- Meta
- Microsoft
- Amazon
- Nvidia
- OpenClaw
- Claude Code
- Peter Steinberger
- agentic AI
- tokenmaxxing
- DeepSeek
- DeepSeek v4 Pro
- Jevons Paradox
- GitHub Copilot CLI
- Anthropic
- Goldman Sachs
- Qwen3.6-27B
- Rajesh Jha
FRONTIER RELEASE · Qwen tech blog English(EN) · 1mo · [17 sources]

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Qwen has released Qwen3.6-27B, a dense 27-billion-parameter multimodal model designed for advanced coding tasks. This model aims to provide flagship-level agentic coding performance, surpassing previous open-source models in this category. Various community members have already made different quantized versions of Qwen3.6-27B available on Hugging Face, facilitating its use across different platforms and libraries. AI

IMPACT Sets a new benchmark for dense coding models, potentially influencing future development in agentic AI and code generation.
COMMENTARY · Mastodon — fosstodon.org English(EN) · 6d · [2 sources]

AI procurement is shifting from "one model fits all" to task-specific routing. Claude Opus remains costly but competitive for high-judgment work. Cheaper altern

A developer found DeepSeek's v4 Pro model to be a capable and cost-effective alternative to Anthropic's Claude Opus for real-world coding tasks. Over a month, the developer used DeepSeek for building an MVP and indexing a marketplace, noting that the model handled long-running agentic sessions and high-volume classification tasks without issues. While Claude Opus was previously used for its high-judgment capabilities, DeepSeek proved competitive and significantly cheaper, prompting a shift towards task-specific AI model routing. AI

IMPACT Developers are exploring cost-effective, task-specific AI models, potentially reducing reliance on premium options for routine tasks.
- Anthropic
- Gemini
- Claude Opus
- DeepSeek
- Grok
- DeepSeek v4 Pro
COMMENTARY · dev.to — LLM tag English(EN) · 1w · [3 sources]

How much does it really cost to use AI models for coding?

A developer detailed their experience using open-weight AI models for a coding project, incurring a cost of only $5 for over 400 million tokens via a subscription service. This contrasts sharply with the estimated $138.70 per month if using traditional inference providers like OpenRouter, and a staggering $690.77 per month for a model like GPT-5.4. The analysis raises questions about the sustainability of current AI subscription models and whether companies are subsidizing usage to gain market share. AI

IMPACT Highlights the significant cost savings and potential economic models behind AI inference, impacting developer choices and company strategies.
- GPT-5.4
- Xiaomi
- DeepSeek
- Kimi K2.6
- OpenRouter
- DeepSeek V4 Pro
- MiMo-V2.5-Pro
- Opencode Go
- MoonshotAI
TOOL · X — Fireworks (inference infra) English(EN) · 1w

RT @Azure: Kimi K2.6 and DeepSeek V4 Pro are now GA on @FireworksAI_HQ on Foundry + PTU support in the US Data Zone—predictable performance…

Fireworks AI has announced that Kimi K2.6 and DeepSeek V4 Pro models are now generally available on its platform. These models are accessible via Azure Foundry and include PTU support within the US Data Zone, promising predictable performance for users. AI

IMPACT Makes existing frontier models more accessible via cloud infrastructure, potentially increasing adoption.
RESEARCH · Together AI blog English(EN) · 3w

DeepSeek-V4 Pro now available on Together AI

DeepSeek-V4 Pro, a large Mixture-of-Experts model with 1.6 trillion parameters, is now accessible on the Together AI platform. This model is designed for long-context reasoning, supporting up to a 512K-token context window in its initial Together AI deployment, with plans for a 1M-token context window. It features controllable reasoning modes to optimize for speed or depth and offers specialized pricing for cached input tokens to reduce costs on repeated queries. AI

IMPACT Enables new applications requiring reasoning over extensive datasets, potentially lowering costs for repeated long-context queries.
SIGNIFICANT · Fireworks AI blog English(EN) · 4w

DeepSeek V4 Pro: Validating Frontier Models for Production

Fireworks AI has released DeepSeek V4 Pro, an open-source model notable for its advancements in long-context reasoning, agentic performance, and inference efficiency. The model features a mixture-of-experts architecture and a 1M-token context window, designed for cost-effective handling of extensive state and complex agentic workflows. Fireworks AI delayed the public release to address critical serving-path correctness issues that caused reasoning degradation and output corruption, ensuring production readiness before launch. AI

IMPACT Sets a new standard for open-source models in long-context reasoning and agentic tasks, potentially influencing future model development and deployment strategies.
- DeepSeek
- DeepSeek V4 Pro
- SGLang
- vLLM
- Fireworks AI
SIGNIFICANT · arXiv cs.CL English(EN) · 20mo · [280 sources]

Asking For An Old Friend: Diagnosing and Mitigating Temporal Failure Modes in LLM-based Statutory Question Answering

Researchers have developed a benchmark to test Large Language Models' ability to handle temporal changes in legal statutes, identifying issues like outdated information and recency bias. Meanwhile, the AI industry is seeing a significant shift as model labs increasingly focus on building agent-based products rather than just foundational models. This strategic pivot is exemplified by companies like AI21 and DeepSeek, and is further underscored by DeepSeek's aggressive pricing strategy for its V4-Pro model, making advanced AI more accessible. AI

IMPACT The industry's focus is shifting from foundational models to agent-based products, with aggressive pricing making advanced AI more accessible and competitive.
- Anthropic
- OpenAI
- Claude
- Andrej Karpathy
- Tesla
- Nick Joseph
- LangSmith
- DeepSeek
- AI21
- Google
- Cursor
- Qwen
- Alibaba
- Gemini
- Codex
- Devin
- Gemini 3.1 Pro Preview
- Qwen3.7 Preview
- DeepSeek-V4-Pro
- Cursor Composer 2.5
- Gemini Flash
- GPT-5.5
- Claude Opus 4.7