ENTITY AI accelerator

AI accelerator

PulseAugur coverage of AI accelerator — every cluster mentioning AI accelerator across labs, papers, and developer communities, ranked by signal.

Total · 30d

1 over 90d

Releases · 30d

0 over 90d

Papers · 30d

0 over 90d

TIER MIX · 90D

RELATIONSHIPS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/2 · 27 TOTAL

RESEARCH · CL_25155 · May 10 · 14:52

Alphabet Poised to Overtake Nvidia as World's Largest Company on AI Dominance

Alphabet Inc. is on the verge of surpassing Nvidia Corp. to become the world's largest company, driven by its dominant and diversified presence across the AI ecosystem. The tech giant's market capitalization has surged,…
TOOL · CL_21185 · May 6 · 10:43

Google DeepMind's AlphaEvolve AI optimizes TPUs, boosts commercial AI training and simulations

Google DeepMind has announced AlphaEvolve, an AI-powered coding agent that has been integrated into its infrastructure to optimize hardware and software. The system has already improved the efficiency of Google's next-g…
RESEARCH · CL_18345 · May 5 · 12:19

New NPU-aware denoising model achieves high fidelity on mobile devices

Researchers have developed a novel approach for real image denoising specifically optimized for mobile Neural Processing Units (NPUs). This method uses a lightweight student network trained via knowledge distillation fr…
SIGNIFICANT · CL_13509 · May 3 · 08:12

Google's Gemma 4 models achieve 3x speed boost with speculative decoding

Google has released Multi-Token Prediction (MTP) drafters for its Gemma 4 open models, which can increase inference speed by up to three times. This advancement utilizes a speculative decoding architecture, allowing a l…
RESEARCH · CL_12133 · May 1 · 11:29

GPT-5.5 Super App, Nebius NVIDIA Cloud, and Google TPU Sales Highlight AI Advancements

A new claim suggests that GPT-5.5, combined with Codex, can function as a "super app" with seven distinct capabilities. These features reportedly include app building, debugging, web browsing, and image generation, posi…
TOOL · CL_17313 · May 1 · 09:00

Next-gen chips promise data centers greater efficiency and AI power

Next-generation chip designs, including those optimized for AI, energy efficiency, and heat tolerance, have the potential to significantly alter data center infrastructure. Innovations in packaging, memory, and offload …
RESEARCH · CL_14166 · May 1 · 06:53

AI-accelerated CFD simulations adapted for IPU platform show performance gains

Researchers have adapted AI-accelerated computational fluid dynamics (CFD) simulations to run on Graphcore's Intelligence Processing Units (IPUs). The study focused on training machine learning models to predict simulat…
RESEARCH · CL_11898 · May 1 · 04:00

Researchers benchmark object detection models for edge devices

Researchers have benchmarked several deep learning object detection models, including YOLOv8, EfficientDet Lite, and SSD variants, on various edge computing devices like Raspberry Pi and Jetson Orin Nano. The study eval…
RESEARCH · CL_14183 · Apr 30 · 21:35

Study finds switchless networks more cost-effective for MoE LLM serving

A new paper analyzes network topologies for Mixture-of-Experts (MoE) Large Language Model (LLM) serving, finding that lower-cost, switchless networks can be more cost-effective than expensive scale-up infrastructures. T…
SIGNIFICANT · CL_09985 · Apr 29 · 22:20

Google to sell its TPUs to some customers, who also fancy big-G GPUs

Alphabet announced a significant increase in its 2026 capital expenditure guidance, raising it to $180-$190 billion, driven by unprecedented demand for AI computing resources. The company's CFO highlighted strong growth…
SIGNIFICANT · CL_07575 · Apr 28 · 13:00

Tenstorrent launches Galaxy Blackhole AI servers with 32 accelerators

Tenstorrent has announced the general availability of its Galaxy Blackhole AI compute platform, featuring 32 Blackhole accelerators in a 6U chassis for $110,000. The system offers 23 petaFLOPS of FP8 performance and can…
SIGNIFICANT · CL_07216 · Apr 28 · 07:47

Google Cloud's AI compute market share rises with surging TPU demand

Google Cloud's market share is projected to increase significantly by 2026, driven by a massive surge in demand for Tensor Processing Units (TPUs). The company is expected to control a quarter of the global AI computing…
RESEARCH · CL_08328 · Apr 28 · 07:42

AHASD architecture boosts LLM speculative decoding on mobile devices

Researchers have developed AHASD, a novel asynchronous heterogeneous architecture designed to optimize large language model (LLM) inference on mobile devices. This architecture employs task-level decoupling for parallel…
SIGNIFICANT · CL_07091 · Apr 28 · 05:22

Google unveils TPU V8 with two chips for training and inference at massive scale

Google has unveiled its eighth-generation Tensor Processing Units (TPUs), marking a significant shift by introducing two distinct chip designs for the first time. These new TPUs are engineered for specific, crucial task…
RESEARCH · CL_06821 · Apr 28 · 04:00

Tessera offers secure, near-line-rate weight streaming for edge AI accelerators

Researchers have developed Tessera, a new architecture designed to securely stream model weights to edge accelerators in Unified Memory Architecture (UMA) systems. This approach addresses the challenge of protecting pro…
RESEARCH · CL_14463 · Apr 27 · 04:00

New research explores efficient LLM inference through sparse caching, batching, and secure computation.

Multiple research papers are exploring novel techniques to enhance the efficiency and performance of Large Language Model (LLM) inference and training. These advancements include queueing-theoretic frameworks for stabil…
RESEARCH · CL_06213 · Apr 27 · 03:48

New techniques ZipCCL and FlashOverlap accelerate LLM training by optimizing communication

Researchers have developed ZipCCL, a lossless compression library designed to accelerate the distributed training of large language models by addressing communication bottlenecks. The library utilizes novel techniques l…
SIGNIFICANT · CL_03252 · Apr 24 · 22:20

Google Cloud Next unveils new TPUs, Gemini Enterprise Agent Platform

Google Cloud has announced new AI innovations, including their eighth-generation Tensor Processing Units (TPUs) designed for both inference and reasoning. The company also unveiled the Gemini Enterprise Agent Platform, …
SIGNIFICANT · CL_02794 · Apr 24 · 12:00

Meta, NVIDIA, and AWS advance agentic AI with new models and ARM-based infrastructure

Meta has entered into a multi-year agreement with Amazon Web Services (AWS) to utilize tens of millions of AWS's Graviton 5 CPU cores. This collaboration aims to diversify Meta's compute infrastructure and will support …
RESEARCH · CL_03108 · Apr 22 · 22:00

Photonic processors offer energy-efficient alternative for deep learning computations

The future of deep learning may involve photonic processors that use light instead of electrons to perform calculations. This approach aims to reduce the significant energy demands of current neural networks, which rely…

Alphabet Poised to Overtake Nvidia as World's Largest Company on AI Dominance

Google DeepMind's AlphaEvolve AI optimizes TPUs, boosts commercial AI training and simulations

New NPU-aware denoising model achieves high fidelity on mobile devices

Google's Gemma 4 models achieve 3x speed boost with speculative decoding

GPT-5.5 Super App, Nebius NVIDIA Cloud, and Google TPU Sales Highlight AI Advancements

Next-gen chips promise data centers greater efficiency and AI power

AI-accelerated CFD simulations adapted for IPU platform show performance gains

Researchers benchmark object detection models for edge devices

Study finds switchless networks more cost-effective for MoE LLM serving

Google to sell its TPUs to some customers, who also fancy big-G GPUs

Tenstorrent launches Galaxy Blackhole AI servers with 32 accelerators

Google Cloud's AI compute market share rises with surging TPU demand

AHASD architecture boosts LLM speculative decoding on mobile devices

Google unveils TPU V8 with two chips for training and inference at massive scale

Tessera offers secure, near-line-rate weight streaming for edge AI accelerators

New research explores efficient LLM inference through sparse caching, batching, and secure computation.

New techniques ZipCCL and FlashOverlap accelerate LLM training by optimizing communication

Google Cloud Next unveils new TPUs, Gemini Enterprise Agent Platform

Meta, NVIDIA, and AWS advance agentic AI with new models and ARM-based infrastructure

Photonic processors offer energy-efficient alternative for deep learning computations