graphics processing unit
PulseAugur coverage of graphics processing unit — every cluster mentioning graphics processing unit across labs, papers, and developer communities, ranked by signal.
- used by Vulkan 90%
- used by Triton 90%
- used by central processing unit 70%
- competes with Tensor Processing Unit 70%
- competes with application-specific integrated circuit 70%
- competes with Apple Neural Engine 70%
- instance of high-performance computing 70%
- used by AI inference 70%
- used by H.1000 Gnome 70%
- used by Innu-aimun 70%
- competes with Cerebras Systems 70%
- used by SemiAnalysis 70%
29 day(s) with sentiment data
-
Orphaned AI tasks continue to consume resources post-disconnection
AI systems can continue to consume resources like tokens and GPU time even after a user has disconnected from the service. This occurs due to orphaned asynchronous tasks that were initiated before the user session ended…
-
Meme humorously frames AI war with hardware components
This cluster consists of two identical Mastodon posts discussing a meme titled "Big E offering advice in the WAR AGAINST ABOMINABLE INTELLIGENCE." The meme appears to be a humorous take on AI, with the posts noting that…
-
Developer creates embcache to prevent stale vector matches
A developer has designed and documented a new GPU-native, two-tier cache called embcache, specifically for handling vector embeddings and KV states. This cache addresses the critical issue of stale vector matches that c…
-
SalesCloser deploys GPU cluster for custom AI model fine-tuning
SalesCloser has enhanced its conversational AI capabilities by deploying a dedicated GPU inference cluster. This infrastructure upgrade allows for the fine-tuning of custom models and supports agentic workflows. The mov…
-
New research tackles attention mechanism limitations in transformers
Researchers are exploring novel approaches to enhance the efficiency and effectiveness of attention mechanisms in transformers. Several papers introduce methods to mitigate issues like over-smoothing and computational b…
-
AI infrastructure focus shifts from GPU quantity to operational efficiency
The AI infrastructure landscape is shifting focus from acquiring more GPUs to optimizing the efficiency of existing systems. As AI workloads move into production, concerns about power grid strain, complex cluster manage…
-
Datavault AI deploys 48k GPUs; XMax pivots to AI; MindBio detects intoxication
Datavault AI is preparing to deploy a massive fleet of 48,000 GPUs across over 100 U.S. markets, positioning itself ahead of potential Senate digital asset legislation. Separately, XMax is pivoting from furniture to AI,…
-
OpenAI's massive GPU network faces copyright lawsuits; Mozilla opposes Google's Prompt API
OpenAI has reportedly built a massive 131,000-GPU training cluster using unconventional networking strategies that challenge industry norms. This infrastructure development coincides with a new lawsuit from authors alle…
-
Tencent: GPUs profitable only for personalized ads
Tencent has stated that the high cost of GPUs is only justified when they are used to power personalized advertising. The company's Chief Strategy Officer noted that deploying GPUs for ad tech leads to improved targetin…
-
Hierarchical Transformer Preconditioner speeds up physics simulations
Researchers have developed a Hierarchical Transformer Preconditioner designed to improve the efficiency of real-time physics simulations. This new method utilizes a multiscale structural prior derived from an H-matrix p…
-
Godot Engine Tech Lead Discusses AI's Practical Role and GPU Needs
An interview with Clay John, technical lead at the Godot Engine, focused on the open-source philosophy and rapid development cycles of the game engine. John highlighted Godot's emphasis on practical rendering features a…
-
Nscale secures $790M for Norway AI data center amid energy scramble
Nscale, an AI infrastructure developer, has secured $790 million in financing for its data center campus in Narvik, Norway. The deal, backed by several Nordic and European banks, signals a shift towards treating AI infr…
-
LoKA framework enables low-precision FP8 for large recommendation models
Researchers have developed LoKA, a framework designed to make low-precision arithmetic, specifically FP8, practical for large recommendation models (LRMs). Unlike previous attempts that often degraded model quality, LoK…
-
CPUs are sufficient for most AI tasks, offering cost savings
Most AI applications do not require a GPU and can perform optimally using CPU infrastructure. This approach can be more cost-effective for businesses. The article provides guidance on how to integrate AI into applicatio…
-
Cerebras Systems boosts IPO on AI compute demand
Cerebras Systems is significantly increasing its IPO price and share count due to high demand driven by the AI industry's need for compute power. While GPUs, particularly from Nvidia, have dominated AI workloads like tr…
-
Sakana AI, NVIDIA unveil TwELL for faster LLM training and inference
Researchers from Sakana AI and NVIDIA have developed TwELL, a novel method that significantly speeds up large language model (LLM) operations. By targeting the feedforward layers, which are computationally intensive, Tw…
-
AI models increasingly run on-device, reducing service reliance
The shift towards running AI models locally on devices is a positive development, moving away from a reliance on "LLM as a Service" models. While the necessary hardware, such as GPUs, remains costly, there is an expecta…
-
China's CPI Rises 1.2% in April; Stock Market Opens Higher
China's National Bureau of Statistics reported that the Consumer Price Index (CPI) rose by 1.2% year-on-year in April, with a 0.3% increase month-on-month. Concurrently, the Producer Price Index (PPI) saw a 2.8% year-on…
-
A-share Market Surges, Shanghai Index Breaks 4200 Amidst Tech Gains
The A-share market saw a broad increase, with the Shanghai Composite Index surpassing the 4200-point mark. Key sectors leading the gains included engineering machinery and semiconductors, while precious metals and spiri…
-
Companies Lay Off Staff to Fund AI Investments Amid Rising Costs
Companies are continuing to lay off employees to fund their investments in AI, according to a report. The primary driver for these job cuts is not automation replacing workers, but rather the escalating costs associated…