English(EN) Why is the MLX version of the Gemma 4 QAT so big??

Gemma 4 QAT MLX 模型大小让本地 LLM 用户感到困惑

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-08 16:26

一位 Reddit r/LocalLLaMA 子版块的用户正在询问 Gemma 4 QAT 的 MLX 版本异常大的文件大小。他们注意到该版本约为 27GB，远大于非 QAT 版本（17GB）和常规 4 位 MLX 版本（也为 17GB）。用户正在寻求对文件大小差异的解释。 AI

排序理由用户生成的技术细节问题，关于特定模型版本，缺乏更广泛的行业影响或官方公告。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/mjsxi__ · 2026-06-08 16:26

为什么 MLX 版本的 Gemma 4 QAT 如此之大？

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1u0cu9e/why_is_the_mlx_version_of_the_gemma_4_qat_so_big/"> <img alt="Why is the MLX version of the Gemma 4 QAT so big??" src="https://preview.redd.it/4pgsjbpc436h1.png?width=640&crop=smart&auto=webp&a…