English(EN) What is your experience between Qwen3.6 27B at IQ3 and 35B-A3B at Q4?

LLaMA 用户讨论 Qwen3.6 27B 与 35B-A3B 量化质量

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 13:11

r/LocalLLaMA 子版块的用户正在讨论他们对 Qwen3.6 模型不同量化版本的体验。具体来说，他们正在将 27B 参数模型的 IQ3 量化与 35B-A3B 变体的 Q4 量化进行比较。对话侧重于哪个版本在特定用例（尤其是在智能体应用中）提供了更好的能力，而不是原始生成速度。 AI

影响用户正在评估模型大小与量化级别之间在本地部署方面的权衡，这会影响实际 AI 应用的性能。

排序理由用户讨论模型量化质量，而非主要发布或重大行业事件。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/CodProfessional3712 · 2026-06-04 13:11

Qwen3.6 27B 在 IQ3 和 35B-A3B 在 Q4 之间的体验如何？

<div class="md"><p>If you’ve had the opportunity to compare these two together with your own benchmarks and use cases, which would you say edges out in capability (not raw throughput in token generation speed)? Asking because I know the quality generally drops shar…