Ollama has released version 0.6.8, introducing performance enhancements for the Qwen 3 MoE model on both NVIDIA and AMD hardware. This update also addresses several issues, including problems with GGML assertions, image input leaks, context cancellation, and out-of-memory handling. Additionally, the release includes improvements for file transfer tools and streaming progress indicators for platforms like Discord and Slack. AI
影响 Improves the performance and stability of local AI model execution, benefiting developers and users running models like Qwen 3.
排序理由 This is a software update for a tool that facilitates running AI models locally, not a release of a new frontier model or significant research.
在 Mastodon — fosstodon.org 阅读 →
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →