Brief · PulseAugur

TOOL · r/MachineLearning English(EN) · 3h

Gemma 4 12B local setup thread — what's your hardware, quant, and use case? [D]

Google's Gemma 4 12B multimodal model is now available, with the community quickly releasing various quantized versions for local setup. A Reddit thread on r/MachineLearning is collecting user experiences regarding hardware requirements, quantization methods, and performance metrics like tokens per second. Users are sharing details on their setups, including chip, RAM, GPU, runtime environments, and practical use cases, to determine the model's actual performance floor on consumer hardware. AI

IMPACT Community-driven data collection will help users assess Gemma 4 12B's viability on local hardware.

Google
transformers
llama.cpp
ollama
GGUF
Apache 2.0
vllm
lm studio
r/MachineLearning
MLX
Gemma 4 12B
mlx-lm