graphics processing unit
PulseAugur coverage of graphics processing unit — every cluster mentioning graphics processing unit across labs, papers, and developer communities, ranked by signal.
- used by Vulkan 90%
- used by central processing unit 70%
- competes with Tensor Processing Unit 70%
- competes with application-specific integrated circuit 70%
- competes with Cerebras Systems 70%
- used by H.1000 Gnome 70%
- used by SemiAnalysis 70%
- uses data processing unit 70%
- used by Fp8 70%
- used by Dohuk Polytechnic University 70%
- competes with central processing unit 50%
- uses central processing unit 50%
29 day(s) with sentiment data
-
Kubernetes GPU Node Setup Crucial for LLM Deployment
This article details the complex process of preparing GPU nodes for large language models (LLMs) within a Kubernetes environment. It emphasizes that simply adding GPUs to a node is insufficient, as Kubernetes needs spec…
-
iPhone LLM benchmark: Neural Engine beats GPU in sustained performance
On-device LLM performance on the iPhone 17 Pro reveals that while GPUs offer superior initial generation speeds, they quickly overheat and throttle. Apple's Neural Engine, though slower to start, maintains a more consis…
-
4-8 GPUs sufficient for most AI inference, Leaseweb advises
For most AI inference workloads, 4 to 8 dedicated GPUs are sufficient, offering better performance and cost-effectiveness than over-provisioned cloud resources. This setup is ideal for AI-based search platforms and medi…
-
Modular data centers cut costs and timelines for AI infrastructure
Modular data center construction offers significant cost and timeline advantages over traditional methods, with costs ranging from $4.5-6.5M per MW compared to $11.3M for traditional builds. The most substantial benefit…
-
Cooler Master releases GPU accessory to improve PC cooling
Cooler Master has released a new accessory designed to improve PC cooling by redirecting GPU heat away from the CPU. This device attaches to the graphics card and, according to the company, can lower temperatures by 4-6…
-
Data center hardware obsolescence may create a used market for consumers
The rapid obsolescence of high-end GPUs and RAM in data centers, with a typical lifespan of 3-4 years, may create a future consumer market for slightly older, but still powerful, hardware. This could offer a more afford…
-
UpCloud offers cost-effective Nvidia GPUs for self-hosted AI models
UpCloud is offering a viable and cost-effective solution for individuals and businesses looking to run their own AI models on rented hardware. The service provides Nvidia GPUs, which are particularly beneficial for batc…
-
New LipFit package enables GPU-accelerated data approximation with constraints
Researchers have developed a new method for multivariate scattered data interpolation and approximation that ensures Lipschitz continuity and can enforce monotonicity constraints. This approach, which does not require a…
-
Co-packaged optics emerge as key solution for AI data center GPU interconnects
The increasing demand for AI data centers, driven by large language models and AI agents, has created a significant bottleneck in communication links between GPUs. This bottleneck, where GPUs spend more time waiting for…
-
Digital workers enable 24/7 operations, reshaping jobs and companies
Digital workers, powered by AI and automation, are beginning to operate around the clock, fundamentally altering traditional work structures and company productivity. This shift introduces the concept of non-stop operat…
-
Vatican may acquire advanced GPUs for AI and data processing
The Vatican may be acquiring advanced GPUs, aligning with a global trend of institutions leveraging powerful hardware for data processing and artificial intelligence. While the specific technological needs of the Vatica…
-
LLM inference speed bottlenecked by GPU memory bandwidth, not compute
This article explains that the primary bottleneck for LLM inference in production is often the model's raw speed on the GPU, rather than serving logic or network overhead. It details how LLM inference, particularly duri…
-
Reddit user enjoys listening to GPU's working sounds
A Reddit user on the r/StableDiffusion subreddit shared a peculiar habit of enjoying the sounds their GPU makes while working. The user described the noise as a blend of 1980s cassette-loading software and electronic mu…
-
AI Supply Chain Vulnerable to Six Critical Mineral Chokepoints
A detailed analysis highlights six critical chokepoints in the AI supply chain, focusing on the minerals and components essential for GPUs, HBM chips, and data center cooling systems. China's dominant role in processing…
-
Hyperscalers diversify server hardware with new GPU, XPU, and CPU chips
Hyperscalers are introducing a wider array of specialized chips, including GPUs, XPUs, and CPUs, which is driving innovation in server rack and board designs. This diversification aims to cater to a broader spectrum of …
-
AgentSwarms launches interactive LLM-GPU matching tool
AgentSwarms has launched an interactive blog post designed to help users match open-source LLMs with appropriate GPUs. The tool allows users to select model size and quantization levels, with the interface calculating a…
-
Perpetual futures markets emerge for GPU compute and memory hedging
A new type of financial market, known as perpetual futures, is emerging to allow businesses to hedge against price volatility in crucial inputs like GPU compute and memory. These markets, which trade continuously and ne…
-
AI inference verification achieved with bit-exact precision
Researchers have developed a method to verify AI inference results with bit-exact precision, overcoming the challenge posed by non-deterministic GPU arithmetic. Their approach analyzes accumulated rounding errors as an …
-
SPARROW platform uses AI and solar for remote biodiversity monitoring
Researchers have developed SPARROW, an open-source platform that uses solar power, edge AI, and satellite communication for continuous biodiversity monitoring in remote areas. The system integrates low-power GPUs with v…
-
Tsinghua AIR releases UniLab for 10x faster robot training
Researchers from Tsinghua University's AIR DISCOVER Lab have introduced UniLab, an open-source framework for robot reinforcement learning training. This new architecture utilizes a heterogeneous approach, offloading phy…