Gemma 4 31B QAT version excels with long context

By PulseAugur Editorial · [1 sources] · 2026-06-06 05:11

A user on r/LocalLLaMA compared three versions of the Gemma 4 31B model: the standard UD version, a "heretic" version, and a QAT version. The standard version struggled with long contexts and complex tool chains, while the "heretic" version was more error-prone. The QAT version, however, handled 32k context with full reasoning effectively and performed all tasks correctly. AI

IMPACT The QAT version of Gemma 4 31B demonstrates improved performance with long contexts, suggesting potential for more robust local LLM deployments.

RANK_REASON User comparison of different model quantizations and versions. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/LocalLLaMA →

model release

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

r/LocalLLaMA TIER_1 English(EN) · /u/Some-Cauliflower4902 · 2026-06-06 05:11

A quick Gemma4 31B comparison (Q4_k_M, QAT, heretic)

<div class="md"><p>No numbers. Not sure if anybody cares…</p> <p>I’ve run the UD version of Q4_k_m for a month. I talk to this model nicely, because it’s a functional nervous wreck. And initially I thought that might be an alignment thing, so I also have the hereti…

COVERAGE [1]

A quick Gemma4 31B comparison (Q4_k_M, QAT, heretic)

RELATED ENTITIES

RELATED TOPICS