NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Google DeepMind has released DiffusionGemma, an experimental open-source model designed for rapid text generation. Unlike traditional models that produce text token by token, DiffusionGemma generates multiple tokens in parallel, significantly speeding up output. NVIDIA has optimized this model to run efficiently on its GPUs, including GeForce RTX, RTX PRO, and DGX Spark systems, enabling faster local AI applications. AI
IMPACT Enables faster local AI applications and interactive agentic workflows by improving text generation latency.