Ollama releases v0.30.8 with improved caching and stability

By PulseAugur Editorial · [1 sources] · 2026-06-12 20:37

Ollama has released version 0.30.8, introducing several improvements and fixes. Key updates include resolving an issue with `ollama launch` selecting incorrect providers and enhancing prompt caching for better KV cache reuse. The release also brings more stable MLX inference with hardened layers and improved recurrent model support. AI

IMPACT Enhances the usability and stability of local LLM deployment tools.

RANK_REASON This is a software release for a tool that facilitates running LLMs locally, not a new frontier model release or significant industry event.

Read on Mastodon — fosstodon.org →

MLX
Ollama

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-06-12 20:37

⚙️ New Ollama Release! ⚙️ Version: v0.30.8 Release Notes: ## What's Changed * Fixed `ollama launch` selecting the wrong provider in some cases * Improved prompt

⚙️ New Ollama Release! ⚙️ Version: v0.30.8 Release Notes: ## What's Changed * Fixed `ollama launch` selecting the wrong provider in some cases * Improved prompt caching by decoupling it from context shift for better KV cache reuse * More stable MLX inference with hardened linear …

COVERAGE [1]

⚙️ New Ollama Release! ⚙️ Version: v0.30.8 Release Notes: ## What's Changed * Fixed `ollama launch` selecting the wrong provider in some cases * Improved prompt

RELATED ENTITIES

RELATED TOPICS