PulseAugur
实时 11:03:41

Llama.cpp adds MTP for Mac, improves offline builds

The llama.cpp project has introduced a new Metal Performance Tensors (MTP) feature for Mac hardware, showing potential gains in token generation speed. Initial tests on an M2 Ultra indicate that while prompt processing speed remains consistent, token generation can become more variable with MTP enabled, especially at higher context lengths. Additionally, the project has addressed issues with building llama.cpp on air-gapped Macs, requiring specific flags to disable UI downloads during the build process. AI

影响 Improves performance and usability for local LLM inference on Mac hardware.

排序理由 The article discusses improvements and features for an existing open-source software project, rather than a new model release or significant industry-wide event.

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Llama.cpp adds MTP for Mac, improves offline builds

报道来源 [2]

  1. dev.to — LLM tag TIER_1 English(EN) · SomeOddCodeGuy ·

    Llama.cpp's New MTP on MacOS

    <h2> MTP </h2> <p>So I decided to test out the new MTP in llama.cpp on Metal using my M2 Ultra, and figured I'd toss the results up here. This isn't meant to show the maximum tps you can get on Mac hardware; I'd have run it on the M5 Max or M3 Ultra if that were the case. My goal…

  2. dev.to — LLM tag TIER_1 English(EN) · SomeOddCodeGuy ·

    Building and Running Llama.cpp on an Air-Gapped Mac

    <p>If you ever tried to run Llama.cpp on a MacOS device that doesn't have internet on it, you've probably hit the annoying GateKeeper errors that it's downloaded from the internet and you should delete it. Generally I just build from source to avoid that, but I ran into something…