Google DeepMind releases DiffusionGemma for faster local text generation

By PulseAugur Editorial · [1 sources] · 2026-06-10 16:15

Google DeepMind has released DiffusionGemma, an experimental open-source model designed for rapid text generation. Unlike traditional models that produce text token by token, DiffusionGemma generates multiple tokens in parallel, significantly speeding up output. NVIDIA has optimized this model to run efficiently on its GPUs, including GeForce RTX, RTX PRO, and DGX Spark systems, enabling faster local AI applications. AI

IMPACT Enables faster local AI applications and interactive agentic workflows by improving text generation latency.

RANK_REASON Google DeepMind released a new experimental model, DiffusionGemma, with details on its architecture and performance. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on NVIDIA Blog →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Google DeepMind releases DiffusionGemma for faster local text generation

COVERAGE [1]

NVIDIA Blog TIER_1 English(EN) · Michael Fukuyama · 2026-06-10 16:15

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

Today, Google DeepMind released DiffusionGemma — an experimental open model built for exceptionally fast text generation. NVIDIA has optimized DiffusionGemma to run even faster across NVIDIA GeForce RTX GPUs, the NVIDIA RTX PRO platform and NVIDIA DGX Spark systems, from local PC…

COVERAGE [1]

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

RELATED ENTITIES

RELATED TOPICS