PulseAugur
EN
LIVE 18:41:15

Ollama v0.30.0-rc32 improves multi-GPU support and embeddings API

Ollama has released a release candidate version v0.30.0-rc32, which includes several follow-up fixes and improvements for its llama-server functionality. These updates address issues with ROCm build flags for multi-GPU support on Windows, improve version detection for AMD HIP, and ensure consistent behavior for the embeddings API. Additionally, the release optimizes batch sizes for constrained VRAM and fixes a loading bug for v3 models in Imagegen, while also enhancing the model reloading process for embeddings. AI

IMPACT Enhances local LLM management tools with improved multi-GPU support and API consistency.

RANK_REASON This is a release candidate for a tool that manages LLM instances, not a new frontier model release.

Read on Ollama — Releases →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Ollama v0.30.0-rc32 improves multi-GPU support and embeddings API

COVERAGE [2]

  1. Ollama — Releases TIER_1 (CA) · dhiltgen ·

    v0.30.0-rc32: llama-server followups (#16353)

    <ul> <li>llama-server followups</li> </ul> <p>Misc fixes for <a class="issue-link js-issue-link" href="https://github.com/ollama/ollama/pull/16031">#16031</a></p> <ul> <li>Add back dropped ROCm build flag for multi-GPU support on windows</li> <li>Fix amdhip64_*.dll version detect…

  2. r/LocalLLaMA TIER_1 (SL) · /u/m94301 ·

    Llama Studio v0.2.0

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tt4pag/llama_studio_v020/"> <img alt="Llama Studio v0.2.0" src="https://preview.redd.it/nbasscdzwi4h1.png?width=640&amp;crop=smart&amp;auto=webp&amp;s=bb1e756b21ae5bbb4df943d6de083c54859f2022" title="Llama St…