实体 graphics processing unit

graphics processing unit

PulseAugur coverage of graphics processing unit — every cluster mentioning graphics processing unit across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

134

90 天内 134

发布 · 30天

90 天内 0

论文 · 30天

90 天内 48

层级分布 · 90 天

significant 9
research 35
tool 64
commentary 23
meme 3

关系

情绪 · 30 天

18 天有情绪数据

最近 · 第 2/7 页 · 共 134 条

COMMENTARY · CL_40510 · May 20 · 10:24

AI development costs rise, potentially slowing industry boom

The escalating costs of AI development, particularly for advanced hardware like GPUs, are beginning to strain the rapid expansion of the AI industry. This price surge, driven by high demand and limited supply, could pot…
SIGNIFICANT · CL_40296 · May 20 · 08:09

NYSE owner plans futures market for AI computing power as AI reshapes jobs

Intercontinental Exchange, the parent company of the New York Stock Exchange, is planning to launch futures contracts for computing power, specifically focusing on GPUs. This initiative, in partnership with Ornn, aims t…
SIGNIFICANT · CL_40095 · May 20 · 05:38

Alibaba Cloud pivots to high-margin AI token revenue amid investor scrutiny

Alibaba's cloud division is facing scrutiny over its AI strategy, with investors closely monitoring its token revenue growth as a key indicator of future profitability. While AI compute sales offer high revenue, they yi…
RESEARCH · CL_42791 · May 20 · 00:33

Mahjong RL simulator Mahjax achieves 2M steps/sec on GPUs

Researchers have developed Mahjax, a new GPU-accelerated simulator for the complex game of Riichi Mahjong, implemented in JAX. This tool is designed to facilitate reinforcement learning research, particularly for agents…
RESEARCH · CL_41740 · May 19 · 18:57

New framework models pump deterioration for targeted infrastructure management

Researchers have developed a new framework for causal discovery in infrastructure management, focusing on pump equipment deterioration. This method combines Bayesian hierarchical hazard modeling with causal discovery to…
TOOL · CL_40774 · May 19 · 15:01

GEM framework optimizes MoE AI model GPU mapping for faster inference

Researchers have developed GEM, a framework designed to optimize the mapping of experts to GPUs in Mixture-of-Expert (MoE) AI models. This new approach accounts for variability in GPU performance, aiming to reduce infer…
COMMENTARY · CL_38548 · May 19 · 06:52

Baidu CFO: AI infrastructure too hard to build, cloud providers to profit

Baidu's CFO stated that building AI infrastructure is prohibitively difficult, leading to cloud providers capitalizing on the situation. This difficulty stems from the high costs and complexity associated with AI hardwa…
TOOL · CL_38376 · May 19 · 05:48

AI adoption faces infrastructure, legal, and automation hurdles globally

Manus has launched Scheduled Tasks 2.0, transforming basic reminders into intelligent agents capable of maintaining context and autonomously updating web applications. Meanwhile, the United Arab Emirates aims to generat…
RESEARCH · CL_40163 · May 18 · 22:35

KV Cache Optimization Solves LLM GPU Memory Bottleneck

Large language models (LLMs) face a significant bottleneck in serving efficiency due to the memory demands of KV cache, which stores intermediate attention calculations. This KV cache, essential for enabling faster resp…
RESEARCH · CL_37752 · May 18 · 21:01

US eyes non-GPU hardware for supercomputers amid AI security concerns

The US government is exploring alternative hardware for its next major supercomputer, potentially moving beyond traditional GPUs. This exploration is driven by the accelerating adoption of AI and the associated security…
SIGNIFICANT · CL_37589 · May 18 · 17:33

AI investment shifts from GPU training to inference infrastructure

The AI industry's investment focus is shifting from GPU manufacturing for model training to the infrastructure required for inference. As AI tools become more integrated into daily operations, the demand for continuous …
RESEARCH · CL_37002 · May 18 · 12:30

Nvidia releases open Ising quantum AI models for qubit calibration

Nvidia has released open-source Ising quantum AI models designed to automate and improve the calibration of quantum processors. These models, which include a vision-language model for proposing calibration actions and C…
COMMENTARY · CL_35994 · May 18 · 01:00

Orphaned AI tasks continue to consume resources post-disconnection

AI systems can continue to consume resources like tokens and GPU time even after a user has disconnected from the service. This occurs due to orphaned asynchronous tasks that were initiated before the user session ended…
MEME · CL_35700 · May 17 · 16:37

Meme humorously frames AI war with hardware components

This cluster consists of two identical Mastodon posts discussing a meme titled "Big E offering advice in the WAR AGAINST ABOMINABLE INTELLIGENCE." The meme appears to be a humorous take on AI, with the posts noting that…
TOOL · CL_35014 · May 16 · 21:49

Developer creates embcache to prevent stale vector matches

A developer has designed and documented a new GPU-native, two-tier cache called embcache, specifically for handling vector embeddings and KV states. This cache addresses the critical issue of stale vector matches that c…
TOOL · CL_34412 · May 16 · 10:44

SalesCloser deploys GPU cluster for custom AI model fine-tuning

SalesCloser has enhanced its conversational AI capabilities by deploying a dedicated GPU inference cluster. This infrastructure upgrade allows for the fine-tuning of custom models and supports agentic workflows. The mov…
RESEARCH · CL_33185 · May 15 · 12:50

AI infrastructure focus shifts from GPU quantity to operational efficiency

The AI infrastructure landscape is shifting focus from acquiring more GPUs to optimizing the efficiency of existing systems. As AI workloads move into production, concerns about power grid strain, complex cluster manage…
SIGNIFICANT · CL_32503 · May 15 · 02:25

Datavault AI deploys 48k GPUs; XMax pivots to AI; MindBio detects intoxication

Datavault AI is preparing to deploy a massive fleet of 48,000 GPUs across over 100 U.S. markets, positioning itself ahead of potential Senate digital asset legislation. Separately, XMax is pivoting from furniture to AI,…
SIGNIFICANT · CL_31808 · May 14 · 15:16

OpenAI's massive GPU network faces copyright lawsuits; Mozilla opposes Google's Prompt API

OpenAI has reportedly built a massive 131,000-GPU training cluster using unconventional networking strategies that challenge industry norms. This infrastructure development coincides with a new lawsuit from authors alle…
COMMENTARY · CL_30985 · May 14 · 04:40

Tencent: GPUs profitable only for personalized ads

Tencent has stated that the high cost of GPUs is only justified when they are used to power personalized advertising. The company's Chief Strategy Officer noted that deploying GPUs for ad tech leads to improved targetin…

AI development costs rise, potentially slowing industry boom

NYSE owner plans futures market for AI computing power as AI reshapes jobs

Alibaba Cloud pivots to high-margin AI token revenue amid investor scrutiny

Mahjong RL simulator Mahjax achieves 2M steps/sec on GPUs

New framework models pump deterioration for targeted infrastructure management

GEM framework optimizes MoE AI model GPU mapping for faster inference

Baidu CFO: AI infrastructure too hard to build, cloud providers to profit

AI adoption faces infrastructure, legal, and automation hurdles globally

KV Cache Optimization Solves LLM GPU Memory Bottleneck

US eyes non-GPU hardware for supercomputers amid AI security concerns

AI investment shifts from GPU training to inference infrastructure

Nvidia releases open Ising quantum AI models for qubit calibration

Orphaned AI tasks continue to consume resources post-disconnection

Meme humorously frames AI war with hardware components

Developer creates embcache to prevent stale vector matches

SalesCloser deploys GPU cluster for custom AI model fine-tuning

AI infrastructure focus shifts from GPU quantity to operational efficiency

Datavault AI deploys 48k GPUs; XMax pivots to AI; MindBio detects intoxication

OpenAI's massive GPU network faces copyright lawsuits; Mozilla opposes Google's Prompt API

Tencent: GPUs profitable only for personalized ads