Google DeepMind releases Gemma 4 multimodal open-weight models

By PulseAugur Editorial · [2 sources] · 2026-06-03 15:47

Google DeepMind has released Gemma 4, a new family of open-weight multimodal models. These models support text and image inputs, with some variants also handling audio and video. Gemma 4 models feature a large context window of up to 256K tokens and are available in various sizes and architectures, including Dense and Mixture-of-Experts, making them deployable across a range of devices. AI

IMPACT Expands access to multimodal AI capabilities, enabling new applications in text, image, and audio processing across various devices.

RANK_REASON This is a release of new open-weight models from a frontier lab (Google DeepMind). [lever_c_demoted from frontier_release: ic=2 ai=1.0]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Google DeepMind releases Gemma 4 multimodal open-weight models

COVERAGE [2]

r/LocalLLaMA TIER_1 Nederlands(NL) · /u/jacek2023 · 2026-06-03 15:57

google/gemma-4-12B · Hugging Face

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tvtn6m/googlegemma412b_hugging_face/"> <img alt="google/gemma-4-12B · Hugging Face" src="https://external-preview.redd.it/4vL8SPTXouClbqOPTWUMnxIMcPeFGNV0dBbwHJDBGE4.png?width=640&crop=smart&auto=webp…
r/LocalLLaMA TIER_1 Dansk(DA) · /u/jacek2023 · 2026-06-03 15:47

ggml-org/gemma-4-12b-it-GGUF · Hugging Face

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tvtc54/ggmlorggemma412bitgguf_hugging_face/"> <img alt="ggml-org/gemma-4-12b-it-GGUF · Hugging Face" src="https://external-preview.redd.it/sZmv38bM9O5UWNsdM-vUUI0M3Z1x6YC36svn6j_-fpE.png?width=640&crop=sm…

COVERAGE [2]

google/gemma-4-12B · Hugging Face

ggml-org/gemma-4-12b-it-GGUF · Hugging Face

RELATED ENTITIES

RELATED TOPICS