Ollama 0.31.1 boosts Gemma 4 speed on Apple Silicon

By PulseAugur Editorial · [1 sources] · 2026-07-01 00:36

Ollama has released version 0.31.1, which significantly improves the performance of Gemma 4 on Apple Silicon. The update leverages multi-token prediction (MTP) to achieve nearly 90% faster token generation on average, particularly noted in a coding-agent benchmark. This optimization aims to enhance the user experience for running AI models locally. AI

IMPACT This update enhances the local execution speed of AI models on Apple hardware, potentially improving developer workflows and accessibility.

RANK_REASON Software release for an AI model runner, not a frontier model release.

Read on Mastodon — fosstodon.org →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Ollama 0.31.1 boosts Gemma 4 speed on Apple Silicon

COVERAGE [1]

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-07-01 00:36

⚙️ New Ollama Release! ⚙️ Version: v0.31.1 Release Notes: ## Faster Gemma 4 on Apple Silicon <img width="1037" height="485" alt="Screenshot 2026-06-30 at 5 25 2

⚙️ New Ollama Release! ⚙️ Version: v0.31.1 Release Notes: ## Faster Gemma 4 on Apple Silicon <img width="1037" height="485" alt="Screenshot 2026-06-30 at 5 25 29 PM" src=" https:// github.com/user-attachments/as sets/547d5076-090f-43c4-a661-938e11abc955 " /> Gemma 4 is now signif…

COVERAGE [1]

⚙️ New Ollama Release! ⚙️ Version: v0.31.1 Release Notes: ## Faster Gemma 4 on Apple Silicon <img width="1037" height="485" alt="Screenshot 2026-06-30 at 5 25 2

RELATED ENTITIES

RELATED TOPICS