PulseAugur
EN
LIVE 18:12:15

Gemma 4 QAT MLX model size puzzles local LLM users

A user on the r/LocalLLaMA subreddit is inquiring about the unusually large file size of the MLX version of the Gemma 4 QAT model. They noted that this version is approximately 27GB, significantly larger than the non-QAT version (17GB) and the regular 4-bit MLX version (also 17GB). The user is seeking an explanation for this discrepancy in file size. AI

RANK_REASON User-generated question about a technical detail of a specific model version, lacking broader industry impact or official announcement.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Gemma 4 QAT MLX model size puzzles local LLM users

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/mjsxi__ ·

    Why is the MLX version of the Gemma 4 QAT so big??

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1u0cu9e/why_is_the_mlx_version_of_the_gemma_4_qat_so_big/"> <img alt="Why is the MLX version of the Gemma 4 QAT so big??" src="https://preview.redd.it/4pgsjbpc436h1.png?width=640&amp;crop=smart&amp;auto=webp&a…