Ollama v0.30.0-rc32 improves multi-GPU support and embeddings API

By PulseAugur Editorial · [2 sources] · 2026-05-31 19:21

Ollama has released a release candidate version v0.30.0-rc32, which includes several follow-up fixes and improvements for its llama-server functionality. These updates address issues with ROCm build flags for multi-GPU support on Windows, improve version detection for AMD HIP, and ensure consistent behavior for the embeddings API. Additionally, the release optimizes batch sizes for constrained VRAM and fixes a loading bug for v3 models in Imagegen, while also enhancing the model reloading process for embeddings. AI

IMPACT Enhances local LLM management tools with improved multi-GPU support and API consistency.

RANK_REASON This is a release candidate for a tool that manages LLM instances, not a new frontier model release.

Read on Ollama — Releases →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Ollama v0.30.0-rc32 improves multi-GPU support and embeddings API

COVERAGE [2]

Ollama — Releases TIER_1 (CA) · dhiltgen · 2026-06-01 17:44

v0.30.0-rc32: llama-server followups (#16353)

<ul> <li>llama-server followups</li> </ul> <p>Misc fixes for <a class="issue-link js-issue-link" href="https://github.com/ollama/ollama/pull/16031">#16031</a></p> <ul> <li>Add back dropped ROCm build flag for multi-GPU support on windows</li> <li>Fix amdhip64_*.dll version detect…
r/LocalLLaMA TIER_1 (SL) · /u/m94301 · 2026-05-31 19:21

Llama Studio v0.2.0

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tt4pag/llama_studio_v020/"> <img alt="Llama Studio v0.2.0" src="https://preview.redd.it/nbasscdzwi4h1.png?width=640&crop=smart&auto=webp&s=bb1e756b21ae5bbb4df943d6de083c54859f2022" title="Llama St…

COVERAGE [2]

v0.30.0-rc32: llama-server followups (#16353)

Llama Studio v0.2.0

RELATED ENTITIES

RELATED TOPICS