PulseAugur
EN
LIVE 02:21:26

User seeks advice on dual GPU VRAM upgrade for LLMs amid PCIe concerns

A user on Reddit's r/LocalLLaMA subreddit is seeking advice on adding a second AMD 7900XTX GPU to their system to increase VRAM for local large language model (LLM) inference. The primary concern is the potential performance bottleneck caused by the motherboard's PCIe lane configuration, specifically a PCIe 2.0 slot for the secondary GPU, while the CPU supports PCIe 4.0. The user is weighing the cost and benefit of upgrading the motherboard to a PCIe 4.0 compatible model and is also inquiring about the effectiveness of tensor parallelism with these GPUs and whether PCIe 2.0 will be a significant issue for layer splitting. AI

IMPACT Users considering multi-GPU setups for local LLM inference need to carefully evaluate PCIe bandwidth limitations.

RANK_REASON User query about hardware configuration for AI inference.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

User seeks advice on dual GPU VRAM upgrade for LLMs amid PCIe concerns

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/itch- ·

    I want to add a second 7900XTX, question about pcie2/3/4

    <!-- SC_OFF --><div class="md"><p>I've got a 7900XTX in my old gaming PC, now I want more vram and if I stay on one GPU I can only reasonably get 32GB and that just doesn't sound good enough. Using two slots, 48GB sounds way better and is much cheaper. I think 48GB is the minimum…