Gemma 4 QAT MLX model size puzzles local LLM users

By PulseAugur Editorial · [1 sources] · 2026-06-08 16:26

A user on the r/LocalLLaMA subreddit is inquiring about the unusually large file size of the MLX version of the Gemma 4 QAT model. They noted that this version is approximately 27GB, significantly larger than the non-QAT version (17GB) and the regular 4-bit MLX version (also 17GB). The user is seeking an explanation for this discrepancy in file size. AI

RANK_REASON User-generated question about a technical detail of a specific model version, lacking broader industry impact or official announcement.

Read on r/LocalLLaMA →

other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Gemma 4 QAT MLX model size puzzles local LLM users

COVERAGE [1]

r/LocalLLaMA TIER_1 English(EN) · /u/mjsxi__ · 2026-06-08 16:26

Why is the MLX version of the Gemma 4 QAT so big??

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1u0cu9e/why_is_the_mlx_version_of_the_gemma_4_qat_so_big/"> <img alt="Why is the MLX version of the Gemma 4 QAT so big??" src="https://preview.redd.it/4pgsjbpc436h1.png?width=640&crop=smart&auto=webp&a…

COVERAGE [1]

Why is the MLX version of the Gemma 4 QAT so big??

RELATED ENTITIES

RELATED TOPICS