Google DeepMind has fully released Gemma 3n, a mobile-first multimodal model designed for on-device applications. This new architecture supports image, audio, video, and text inputs, with text outputs, and is optimized for efficiency, offering versions with effective parameters of 2B and 4B that mimic the memory footprint of traditional 2B and 4B models. Gemma 3n introduces novel components like MatFormer for flexibility and Per Layer Embeddings for memory efficiency, achieving strong performance in multilinguality, math, coding, and reasoning, with the E4B version surpassing 1300 on the LMArena benchmark. The model is available through popular developer tools and integrates with the growing Gemmaverse ecosystem. AI
排序理由 Google DeepMind released Gemma 3n, a new multimodal model with novel architecture and strong benchmark performance for on-device applications.
- Gemini 2.0
- Gemma
- Gemma 3n
- Gemmaverse
- Google AI Edge
- Google DeepMind
- Hugging Face Transformers
- Institute of Science Tokyo
- LAuReL
- llama.cpp
- MatFormer
- MobileNet-v5
- Ollama
- Per Layer Embeddings
- Roboflow
- ShieldGemma 2
- MLX
AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →