Users on the r/LocalLLaMA subreddit are expressing a strong need for new large language models (LLMs) in the 80-160 billion parameter range. Current models are either too small for users with high-capacity but slower unified memory systems (like Apple devices or AMD Ryzen AI 395) or too large for those with limited VRAM. The community is requesting models that can effectively utilize systems with 80-128GB of RAM or 64GB of VRAM, as existing options are either outdated or poorly suited for their hardware configurations. AI
IMPACT The demand highlights a gap in the market for LLMs optimized for high-capacity, lower-bandwidth memory systems, potentially influencing future model development priorities.
RANK_REASON User discussion and requests for specific model sizes and capabilities, not a direct release or announcement.
- AMD 9700 AI Pro
- Deepseek V4 Pro
- DGX Spark
- Gemma
- Gemma 4 26B
- Glm 4.5 Air
- GPT OSS 120B
- Kimi 2.7
- MiniMax M3
- Nemotron 3 Super 120B
- Qwen
- Qwen 3.5 122B
- Qwen 3.6 35B
- Qwen 3 Coder Next 80B
- Rtx 3090
- RTX 6000 Pros
- unified memory
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →