Brief · PulseAugur

MEME · r/LocalLLaMA English(EN) · 5h

Buy recommendations on a thight Budget to aid my RX 6800

A user on the r/LocalLLaMA subreddit is seeking advice on purchasing hardware for running large language models on a limited budget. They are considering either a Radeon VII with 32GB VRAM or two P100 GPUs offering a combined 48GB VRAM, both at a similar price point. The user is weighing the trade-offs between more VRAM and faster inference speeds, specifically asking about the utility of higher VRAM for Mixture-of-Experts (MoE) models at Q8 quantization and seeking recommendations for other suitable MoE models. AI

Qwen
Gemma
Radeon VII
MoE Models
RX 6800