Brief

last 24h

[5/5] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

SIGNIFICANT · Artificial Intelligence News English(EN) · 5d · [3 sources]

Nvidia’s Vera chip is the US$200 billion bet Jensen Huang doesn’t want you to overlook

Nvidia CEO Jensen Huang has introduced the Vera chip, a new CPU designed specifically for agentic AI, targeting a substantial $200 billion market segment. This initiative aims to diversify Nvidia's revenue beyond its dominant AI GPU offerings, with Huang projecting Vera to become the company's second-largest sales contributor. The chip is positioned to address the growing demand for efficient inference workloads, a space where custom silicon from hyperscalers presents increasing competition. AI

IMPACT Nvidia's new Vera chip could shift inference workload dynamics and create a new competitive front against hyperscaler custom silicon.
- Vera
- Jensen Huang
- Nvidia
- Intel
- Microsoft
- Google
- Amazon
- AMD
- Groq
- Blackwell
SIGNIFICANT · SCMP — Tech English(EN) · 4d · [3 sources]

What the China-US stability pact means for Southeast Asia

Taiwan has initiated its first formal crackdown on the illicit export of AI chips, raiding 12 locations and seeking three fugitives accused of document forgery and fraudulent declarations. This action is part of a broader effort to prevent restricted NVIDIA hardware, particularly from Super Micro Computer Inc. servers, from reaching China and other restricted regions, in direct violation of US trade restrictions. The crackdown signifies a major policy shift by Taiwan's government under President Lai Ching-te, aimed at securing the global AI supply chain and responding to pressure from Washington. AI

IMPACT Tightens restrictions on AI chip exports, potentially impacting supply chains and increasing costs for restricted markets.
- NVIDIA
- China
- Taiwan
- Blackwell
- Lai Ching-te
- Super Micro Computer Inc.
- US
- Hopper
- United States
- Alibaba
- Macau
- Hong Kong
RESEARCH · Lobsters — AI tag English(EN) · 3d · [3 sources]

Dissecting ThunderKittens, anatomy of a compact DSL for high-performance AI kernels

A new article details ThunderKittens, a compact domain-specific language (DSL) developed at Stanford's Hazy Research Lab for creating high-performance AI kernels. The DSL aims to strike a balance between research productivity and hardware efficiency by abstracting repetitive GPU programming tasks like tile layouts and memory allocation. This allows developers to maintain close reasoning about data movement and scheduling while still enabling performance optimization for modern AI workloads on hardware like NVIDIA's Hopper and Blackwell architectures. AI

IMPACT Enables more efficient AI model training and inference by optimizing low-level GPU kernel performance.
- NVIDIA
- AI
- Stanford
- FlashAttention-2
- Hopper
- PyTorch
- CUDA
- GPU
- Blackwell
- Triton
- Hazy Research Lab
- ThunderKittens
SIGNIFICANT · 36氪 (36Kr) 中文(ZH) · 2w · [43 sources]

Nvidia: This year's CPU revenue is expected to reach $20 billion

Google has launched its Gemini 3.5 series of models, including updates to its large context window capabilities. Separately, Nvidia's CFO expressed confidence in significant revenue from their Blackwell and Vera Rubin chips, projecting substantial income between 2025 and 2027. Airbnb is expanding its offerings to include grocery delivery, car rentals, and AI-powered tools for trip planning and property comparison. AI

IMPACT Major AI model updates and hardware revenue projections signal continued industry growth and innovation.
- Google
- Gemini 3.5
- Eddie Wu
- DeepSeek
- Alibaba
- Vera CPU
- Nvidia
- Joe Tsai
- Airbnb
- Vera Rubin
- Blackwell
- AI
RESEARCH · X — Perplexity English(EN) · 1w · [4 sources]

We published new research on how we serve post-trained Qwen3 235B models on NVIDIA GB200 NVL72 Blackwell racks.

Perplexity has published research detailing how they serve large language models, specifically Qwen3 235B, on NVIDIA's GB200 NVL72 Blackwell racks. The findings indicate that the GB200 platform offers significant improvements over previous NVIDIA hardware for large-model inference, boasting reduced latency and higher throughput. This research highlights the GB200's capabilities for both training and high-throughput inference, particularly for Mixture-of-Experts (MoE) models. AI

IMPACT NVIDIA's GB200 Blackwell platform shows significant gains in LLM inference speed and cost-efficiency, potentially accelerating deployment of large models.
- NVIDIA
- Perplexity
- Hopper
- H200
- Blackwell
- Qwen3 235B
- GB200 NVL72

Brief

Nvidia’s Vera chip is the US$200 billion bet Jensen Huang doesn’t want you to overlook

What the China-US stability pact means for Southeast Asia

Dissecting ThunderKittens, anatomy of a compact DSL for high-performance AI kernels

Nvidia: This year's CPU revenue is expected to reach $20 billion

We published new research on how we serve post-trained Qwen3 235B models on NVIDIA GB200 NVL72 Blackwell racks.