The llama.cpp project has released version b9802, offering pre-compiled binaries for a wide range of operating systems and hardware architectures. This release includes support for macOS, Linux, Android, and Windows, with various CPU and GPU acceleration options such as Vulkan, ROCm, OpenVINO, SYCL, CUDA, and HIP. Some features, like KleidiAI enablement on macOS and openEuler support, are currently disabled. AI
IMPACT Provides broader accessibility and performance options for running large language models on diverse hardware.
RANK_REASON This is a software release for a tool that facilitates running LLMs locally, not a frontier model release.
Read on llama.cpp — Releases →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →