ENTITY graphics processing unit

graphics processing unit

PulseAugur coverage of graphics processing unit — every cluster mentioning graphics processing unit across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

223

223 over 90d

Releases · 30d

0 over 90d

Papers · 30d

69 over 90d

TIER MIX · 90D

significant 13
research 49
tool 114
commentary 39
meme 8

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

29 day(s) with sentiment data

RECENT · PAGE 9/10 · 200 TOTAL

TOOL · CL_18041 · May 5 · 22:01

GPU hardware analysis reveals memory bandwidth, not FLOPS, is key for LLMs

This article explains the fundamental architecture of GPUs, focusing on how their design prioritizes memory bandwidth over raw computational power for machine learning tasks. It details how GPUs manage thousands of thre…
SIGNIFICANT · CL_17945 · May 5 · 21:00

Datacenter AI clusters rely on Indium Phosphide for laser chips

Indium Phosphide (InP) is a critical semiconductor material used in datacenter laser chips and optical transceivers that connect GPUs in AI clusters. Its unique crystal lattice allows for the growth of alloys that emit …
RESEARCH · CL_17304 · May 5 · 20:05

Astera Labs launches new fabric switch to boost AI workload efficiency

Astera Labs has introduced a new smart fabric switch, the Scorpio X-Series, designed to address inefficiencies in AI infrastructure. This new hardware aims to reduce coordination overhead and improve accelerator utiliza…
SIGNIFICANT · CL_16724 · May 5 · 13:42

India's Krutrim pivots to cloud amid GPU woes; new tech tackles RAG hallucinations

India's first generative AI unicorn, Krutrim, is shifting its focus from developing sovereign AI models to offering cloud services by 2026. This pivot is driven by the economic realities and significant GPU shortages im…
TOOL · CL_16219 · May 5 · 04:00

Graph Neural Networks accelerate VLSI design with faster capacitance modeling

Researchers have developed GNN-Ceff, a novel method utilizing Graph Neural Networks for post-layout effective capacitance modeling in VLSI design. This approach aims to improve the accuracy and speed of static timing an…
TOOL · CL_16179 · May 5 · 04:00

SwiftChannel framework co-designs AI hardware for faster 5G channel estimation

Researchers have developed SwiftChannel, a novel algorithm-hardware co-design framework for deep learning-based 5G channel estimation. This framework integrates a hardware-friendly convolutional neural network with a de…
TOOL · CL_16155 · May 5 · 04:00

SURGE system optimizes GPU encoding for large-scale text embedding generation

Researchers have developed SURGE, a new system designed to improve the efficiency of generating text embeddings on GPUs. SURGE addresses the bottleneck of processing numerous small data partitions by employing a streami…
TOOL · CL_16004 · May 5 · 04:00

New CUDA implementation speeds up optimal transport calculations on GPUs

Researchers have developed FastSinkhorn, a new CUDA implementation for the Sinkhorn algorithm used in optimal transport computations. This method operates entirely in the log-domain, ensuring numerical stability even wi…
TOOL · CL_15971 · May 5 · 04:00

New SPES framework enables memory-efficient decentralized LLM pretraining on fewer GPUs

Researchers have developed a novel decentralized framework called SPES for pretraining large language models, specifically Mixture-of-Experts (MoE) architectures. This method significantly reduces memory requirements by…
RESEARCH · CL_15670 · May 5 · 04:00

New HERMES and DSCache methods improve streaming video understanding with KV cache

Researchers have developed new methods to improve the efficiency of multimodal large language models (MLLMs) for understanding streaming video. One approach, HERMES, conceptualizes the KV cache as a hierarchical memory …
RESEARCH · CL_15158 · May 4 · 23:15

Zyphra's TSP strategy boosts LLM training throughput by 2.6x

Zyphra has developed a new technique called Tensor and Sequence Parallelism (TSP) designed to optimize the training and inference of large transformer models. This hardware-aware strategy combines aspects of Tensor Para…
RESEARCH · CL_14976 · May 4 · 21:05

NVIDIA cuOpt and OpenAI achieve breakthroughs in supply chain and voice AI

NVIDIA is enhancing supply chain decision systems with its cuOpt technology, which combines agentic AI with GPU acceleration for real-time, large-scale planning. Separately, OpenAI has achieved low-latency voice AI, del…
TOOL · CL_14833 · May 4 · 16:05

AWS SageMaker adds automatic instance fallback for AI endpoints

Amazon SageMaker has introduced a new feature called capacity-aware instance pools for AI inference endpoints. This enhancement allows users to define a prioritized list of instance types, enabling SageMaker to automati…
RESEARCH · CL_16299 · May 4 · 13:49

Coral and CoRAL systems optimize LLM serving and robotic control

Researchers have developed two distinct systems named Coral and CoRAL. Coral is an adaptive system designed for cost-efficient serving of multiple large language models across heterogeneous cloud GPUs, aiming to optimiz…
MEME · CL_14555 · May 4 · 08:04

Mastodon users criticize energy consumption of AI hardware

The user is expressing frustration about the energy consumption associated with specialized hardware, drawing a parallel to the cryptocurrency industry. They note that ASICs have largely replaced GPUs in certain applica…
SIGNIFICANT · CL_13762 · May 3 · 15:36

ODMs transition from manufacturing to AI infrastructure partners for complex racks

Original Design Manufacturers (ODMs) are transitioning from traditional hardware production to becoming key partners in AI infrastructure. This evolution is spurred by the increasing complexity of AI hardware, particula…
TOOL · CL_13684 · May 3 · 13:06

GitHub tool measures GPU 'useful' work amid AI and security buzz

A new GitHub tool called Utilyze has been released, designed to monitor GPU performance for "useful" work. The tool aims to track computational tasks beyond entertainment, incorporating buzzwords like AI, workflow autom…
RESEARCH · CL_13590 · May 3 · 09:58

Sasha Rush releases Autodiff Puzzles to teach automatic differentiation

Sasha Rush has released "Autodiff Puzzles," an interactive Google Colab notebook designed to teach automatic differentiation. Similar to his previous puzzle series on Tensors and GPUs, these challenges guide users throu…
TOOL · CL_17313 · May 1 · 09:00

Next-gen chips promise data centers greater efficiency and AI power

Next-generation chip designs, including those optimized for AI, energy efficiency, and heat tolerance, have the potential to significantly alter data center infrastructure. Innovations in packaging, memory, and offload …
SIGNIFICANT · CL_11581 · May 1 · 04:07

Datavault AI raises $120M to build nationwide GPU network for AI compute

Datavault AI has secured $120 million in funding from Scilex Holding to establish a nationwide GPU network. This initiative aims to provide increased computing power for companies engaged in artificial intelligence deve…

GPU hardware analysis reveals memory bandwidth, not FLOPS, is key for LLMs

Datacenter AI clusters rely on Indium Phosphide for laser chips

Astera Labs launches new fabric switch to boost AI workload efficiency

India's Krutrim pivots to cloud amid GPU woes; new tech tackles RAG hallucinations

Graph Neural Networks accelerate VLSI design with faster capacitance modeling

SwiftChannel framework co-designs AI hardware for faster 5G channel estimation

SURGE system optimizes GPU encoding for large-scale text embedding generation

New CUDA implementation speeds up optimal transport calculations on GPUs

New SPES framework enables memory-efficient decentralized LLM pretraining on fewer GPUs

New HERMES and DSCache methods improve streaming video understanding with KV cache

Zyphra's TSP strategy boosts LLM training throughput by 2.6x

NVIDIA cuOpt and OpenAI achieve breakthroughs in supply chain and voice AI

AWS SageMaker adds automatic instance fallback for AI endpoints

Coral and CoRAL systems optimize LLM serving and robotic control

Mastodon users criticize energy consumption of AI hardware

ODMs transition from manufacturing to AI infrastructure partners for complex racks

GitHub tool measures GPU 'useful' work amid AI and security buzz

Sasha Rush releases Autodiff Puzzles to teach automatic differentiation

Next-gen chips promise data centers greater efficiency and AI power

Datavault AI raises $120M to build nationwide GPU network for AI compute