google/diffusiongemma-26B-A4B-it
Google has released DiffusionGemma, an open-weight model based on its Gemini architecture, designed for multimodal tasks. This model, available under an Apache 2.0 license, can process both text and images, enabling it to generate descriptions or respond to image-based queries. It is accessible through various platforms, including NVIDIA's NIM cloud API and Hugging Face, with integrations for popular tools like llama.cpp and vLLM. AI
IMPACT Accelerates multimodal AI development with an accessible, open-weight model for diverse applications.