Looking for Suggestions — Single 5090 & 64gb DDR5
A user on the r/LocalLLaMA subreddit is seeking advice on optimizing their hardware setup for running large language models. They have a single NVIDIA RTX 5090 GPU with 64GB of DDR5 RAM and are debating between using Qwen 3.6 27b NVFP4 via vLLM or a 35b a3b model at Q8 on Llama for agentic coding tasks. The user is primarily concerned with effectively utilizing their system's memory for better performance. AI
IMPACT Users are exploring hardware configurations to optimize local LLM performance for specific tasks like agentic coding.