PulseAugur
EN
LIVE 18:34:32

Local LLM user weighs RTX 30 series vs P40 GPUs for VRAM

A user on the r/LocalLLaMA subreddit is seeking advice on whether to trade their RTX 3080 and RTX 3070 GPUs for two NVIDIA P40 cards. The primary concern is optimizing performance for running local large language models, particularly for models exceeding 10GB VRAM, where the P40's 24GB capacity might offer an advantage despite potential speed differences with smaller models. The user also notes potential issues with mixing GPU architectures and is looking for guidance on the best hardware configuration for their specific use case, which includes running a Hermes agent and testing new large models. AI

RANK_REASON This is a user query on a consumer hardware subreddit about optimizing for local LLM inference, not a significant industry event or release.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/Macestudios32 ·

    Should I exchange my rtx 30 for P40?

    <!-- SC_OFF --><div class="md"><p>Hello everyone,</p> <p>The question is simple. With a limited budget.</p> <p>Would it be worth exchanging a RTX3080(10gb) and a RTX3070(8gb) for two p40(24gb) GPUs?</p> <p>I imagine that in models of less than 10gb the 3080 would be faster, but f…