PulseAugur
EN
LIVE 14:34:27
日本語(JA) ノートPCで動くGoogle製「Gemma 4 12B」がエンコーダー不要で画像&音声を処理する仕組みとは? – GIGAZINE https://www. yayafa.com/2815917/ # AgenticAi # AI # ArtificialGeneralIntelligence # Artificial

Google's Gemma 4 12B processes images and audio without encoders

Google has released Gemma 4 12B, a lightweight, multimodal AI model designed to run on consumer hardware with as little as 16GB of VRAM. This model uniquely processes images and audio without traditional encoders, reducing memory usage and latency. For images, it uses a 35 million parameter embedding module to convert pixel data into a format usable by the LLM, while audio is processed by tokenizing 40-millisecond segments directly. AI

IMPACT Enables more efficient multimodal AI processing on consumer hardware, potentially lowering barriers to entry for complex AI applications.

RANK_REASON New model release from a frontier lab (Google DeepMind) with technical details provided. [lever_c_demoted from frontier_release: ic=2 ai=1.0]

Read on Mastodon — fosstodon.org →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

Google's Gemma 4 12B processes images and audio without encoders

COVERAGE [3]

  1. Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] ·

    Anthropic warns of the risks of "AI creating AI" self-improvement loops, discusses the possibility of AI itself accelerating AI development – GIGAZINE https://www.yayafa.com/2815919/ # AgenticAi # AI # Anthropic # AnthropicClaude # Artificial

    Anthropicが「AIがAIを作る」自己改善ループのリスクを警告、AI開発をAI自身が加速する可能性を論じる – GIGAZINE https://www. yayafa.com/2815919/ # AgenticAi # AI # Anthropic # AnthropicClaude # ArtificialGeneralIntelligence # ArtificialIntelligence # claude # エージェント型AI # 人工知能 # 汎用人工知能

  2. Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] ·

    How does Google's "Gemma 4 12B" run on a laptop and process images & audio without an encoder? – GIGAZINE https://www.yayafa.com/2815917/ # AgenticAi # AI # ArtificialGeneralIntelligence # Artificial

    ノートPCで動くGoogle製「Gemma 4 12B」がエンコーダー不要で画像&音声を処理する仕組みとは? – GIGAZINE https://www. yayafa.com/2815917/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepMind # Gemini # Google # GoogleAI # GoogleDeepMind # GoogleGemini # エージェント型AI # 人工知能 # 汎用人工知能

  3. Mastodon — mastodon.social TIER_1 日本語(JA) · [email protected] ·

    How does Google's "Gemma 4 12B" that runs on a laptop process images and audio without an encoder? https://fed.brid.gy/r/https://gigazine.net/news/20260604-gemma-4-12b-encoder-free/

    ノートPCで動くGoogle製「Gemma 4 12B」がエンコーダー不要で画像&音声を処理する仕組みとは? https:// fed.brid.gy/r/https://gigazine .net/news/20260604-gemma-4-12b-encoder-free/