PulseAugur
EN
LIVE 04:42:36

LLaMA users debate Qwen 3.6 27B vs 35B for coding

A user on the r/LocalLLaMA subreddit is seeking advice on optimizing their use of the Qwen 3.6 large language model. They are comparing the 27B and 35B parameter versions, specifically inquiring about the best quantization methods for coding tasks. The discussion includes options like Q4KM with full KV quantization versus Q6K with Q8_0 KV quantization, with one user suggesting the 35B Q8_0 version is superior. AI

IMPACT Users are discussing optimal configurations for local LLM deployment, impacting performance and accessibility for developers.

RANK_REASON User discussion about model versions and quantization, not a new release or benchmark.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/siegevjorn ·

    Qwen 3.6 coding choice–27B vs 35B quants

    <!-- SC_OFF --><div class="md"><p>I've been using Qwen 3.6 35BA3B for a while in Q8_0 quant, KV Q8_0 as well. I'm trying to explore Qwen 2.6 27B. Any tips on which quant to use?</p> <p>Context size is 262144</p> <ol> <li><p>Q4KM with full KV quant (fp16)</p></li> <li><p>Q6K with …