PulseAugur
EN
LIVE 07:17:06
中文(ZH) Mythos阴影里谷歌悄悄发模型,速度暴涨4倍

Google releases DiffusionGemma, a 4x faster text generation model

Google has released DiffusionGemma, a new 26B parameter MoE model that utilizes diffusion models for text generation, achieving speeds up to four times faster than traditional autoregressive models. This approach processes tokens in parallel, similar to image generation, enabling faster inference and reduced memory requirements, making it feasible for local execution on consumer hardware like a 4090 GPU. While DiffusionGemma excels in speed and offers self-correction capabilities due to its bidirectional attention, it currently lags behind standard Gemma models in quality, positioning it as an experimental model for speed-sensitive applications. AI

IMPACT Accelerates text generation speed and enables local LLM deployment, potentially shifting inference paradigms.

RANK_REASON Model release from a major frontier lab (Google) with novel architecture. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 一水 ·

    In the shadow of Mythos, Google quietly releases models, speed increases 4x

    用扩散模型生成文字