club-3090 adds FP8 support for Qwen3.6-27B model

By PulseAugur Editorial · [1 sources] · 2026-06-07 22:07

The club-3090 project has introduced experimental FP8 quantization support for the Qwen3.6-27B model. This new feature is particularly relevant for users operating dual RTX 3090 graphics card setups. The performance of the FP8 quantized model is reported to be nearly identical to the original unquantized BF16 version. AI

IMPACT Enables more efficient local inference for a specific large language model on consumer hardware.

RANK_REASON This is a release of an optimized version of an existing open-source model, not a new frontier model release. [lever_c_demoted from research: ic=1 ai=0.7]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

r/LocalLLaMA TIER_1 English(EN) · /u/xspider2000 · 2026-06-07 22:07

club-3090 adds experimental FP8 support for Qwen3.6-27B!

<div class="md">It’s finally here! Something many of us running dual RTX 3090 rigs have been anticipating. club-3090 has rolled out experimental support for Qwen3.6-27B with FP8 quantization. The official Qwen/Qwen3.6-27B…

COVERAGE [1]

club-3090 adds experimental FP8 support for Qwen3.6-27B!

RELATED ENTITIES

RELATED TOPICS