PCI Express
PulseAugur coverage of PCI Express — every cluster mentioning PCI Express across labs, papers, and developer communities, ranked by signal.
7 day(s) with sentiment data
-
Rosewill M.2 SSD Cloner and Eraser Hits Record Low Price of $47
Rosewill's M.2 SSD Cloner and Eraser is currently available at its lowest price of $47, offering a convenient solution for IT professionals and home users alike. This device supports cloning and erasing NVMe drives both…
-
Meta pauses employee-tracking AI program after internal data leak
Meta has temporarily halted an internal AI training program that tracked employee computer activity, including mouse movements and keystrokes, following a significant data leak. The program, known as the Model Compatibi…
-
AI Sector Poised for Growth: Computing Power, PCIe, and Drug Discovery Lead the Charge
Multiple reports from 36Kr highlight a bullish outlook on the AI sector, particularly focusing on the demand for computing power and related infrastructure. Analysts from CITIC Securities and CITIC Construction Investme…
-
Chinese firm creates compact V100 GPU for AI
A Chinese company called "GPU god" has developed a single-slot, half-height PCIe version of the NVIDIA V100 GPU. This custom-designed card retains the full performance of the V100 core and is intended for passive coolin…
-
User asks about dual-GPU performance for local LLMs
A user on Reddit's r/LocalLLaMA subreddit is seeking advice on optimizing hardware for running large language models locally. They are currently able to run a 16 billion parameter model with Q4 quantization on a single …
-
AMD B650 chipset cards add M.2 slots and USB ports to PCs
New expansion cards featuring AMD's Promontory 21 chipset are now available, offering users the ability to add significant I/O capabilities to their PCs. These cards can provide up to four M.2 slots, multiple USB ports,…
-
User doubles LLM inference speed by fixing PCIe slot bottleneck
A user building a multi-GPU setup for local LLM inference discovered a significant performance bottleneck caused by a misconfigured PCIe slot. One of the four RTX 3090 GPUs was incorrectly placed in a slot that only sup…
-
Broadcom, FuriosaAI partner on Ethernet AI inference platform
Broadcom and FuriosaAI have partnered to develop a rack-scale inference platform that aims to move AI infrastructure away from GPU-centric designs. This collaboration integrates FuriosaAI's processor architecture with B…
-
MacBook Air gets desktop GPU via Linux VM for AI tasks
A recent project explored connecting a high-end NVIDIA RTX 5090 GPU to an M4 MacBook Air via a Thunderbolt eGPU setup. While macOS lacks native drivers for NVIDIA GPUs on Apple Silicon, the author successfully passed th…
-
Modded Nvidia V100 server GPU runs LLMs efficiently for $200
A YouTuber successfully adapted an Nvidia Tesla V100 server GPU, originally designed for specialized sockets, into a standard PCIe card for consumer motherboards. This modification, costing around $200, allows the older…
-
Proprietary GPU to PCIe adapter enables cheaper local LLMs
A recent Hackaday article details a method for integrating proprietary-bus GPUs into standard PCIe slots, making them usable for local LLM deployment. This approach offers a more budget-friendly option for individuals i…
-
RoundPipe enables efficient LLM fine-tuning on consumer GPUs
Researchers have developed RoundPipe, a new pipeline scheduling method designed to make fine-tuning large language models on consumer-grade GPUs more efficient. This approach addresses the limitations of existing method…
-
InnoGrit's Wu Zining discusses AI SSDs transforming idle compute into effective power.
In the AI era, storage is shifting from merely holding data to actively influencing computational speed. Yingren Technology's Chairman Wu Zining highlights that AI SSDs are crucial for transforming idle computing power …
-
New architectures and frameworks target LLM serving bottlenecks for long contexts
Researchers have developed novel architectures and techniques to address the escalating latency and energy consumption challenges in serving large language models (LLMs) with long contexts. One approach, AMMA, proposes …