Buy recommendations on a thight Budget to aid my RX 6800
A user on the r/LocalLLaMA subreddit is seeking advice on purchasing hardware for running large language models on a limited budget. They are considering either a Radeon VII with 32GB VRAM or two P100 GPUs offering a combined 48GB VRAM, both at a similar price point. The user is weighing the trade-offs between more VRAM and faster inference speeds, specifically asking about the utility of higher VRAM for Mixture-of-Experts (MoE) models at Q8 quantization and seeking recommendations for other suitable MoE models. AI