User seeks largest AI model for 64GB VRAM distillation

By PulseAugur Editorial · [1 sources] · 2026-06-27 19:48

A user on the r/LocalLLaMA subreddit is seeking the largest possible capable AI model that can fit within 64 GB of VRAM for the purpose of distillation. They are open to models around 72 billion parameters and are prioritizing memory capacity over speed, expressing satisfaction with a processing rate of 12 tokens per second. AI

RANK_REASON This is a user query on a specific subreddit about hardware limitations for AI models, not a significant industry event or release.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

User seeks largest AI model for 64GB VRAM distillation

COVERAGE [1]

r/LocalLLaMA TIER_1 English(EN) · /u/AppropriatePush6262 · 2026-06-27 19:48

Biggest model that is capable which can fit under 64 gb vram for the purpose of distillation

<div class="md"><p>hi all, I have 64 gb VRAM, and I am looking for biggest model that I can use to distill prefer a reasoning model.</p> <p>even with 12 tokens per second I am happy, a 72 b model can fit in my machine, I have dual r9700, dont have speed but got the…

COVERAGE [1]

Biggest model that is capable which can fit under 64 gb vram for the purpose of distillation

RELATED ENTITIES

RELATED TOPICS