English(EN) iPhone on-device LLM: the GPU wins the sprint, the Neural Engine wins the marathon

iPhone 大模型基准测试：Neural Engine 在持续性能上胜过 GPU

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 09:38

iPhone 17 Pro 上的端侧大模型性能测试表明，虽然 GPU 在初始生成速度上占优，但它们会迅速过热并降频。苹果的 Neural Engine 虽然启动较慢，但由于功耗显著降低，在长时间内能保持更一致的解码速率。这表明，对于需要持续大模型运行的应用，Neural Engine 是更高效且最终更快的选择，而 GPU 更适合快速、爆发式的交互。 AI

影响 Neural Engine 的持续性能优势表明，为需要长时间运行任务的移动应用优化大模型部署。

排序理由移动硬件上大模型性能的基准分析。[lever_c_demoted from research: ic=1 ai=0.7]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

iPhone 大模型基准测试：Neural Engine 在持续性能上胜过 GPU

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · Daisuke Majima · 2026-06-04 09:38

iPhone 端侧大模型：GPU 赢下冲刺，Neural Engine 跑完马拉松

The follow-up to my <a href="https://rockyshikoku.medium.com/local-llm-on-iphone-which-runtime-is-actually-fastest-58096685481e" rel="noopener noreferrer">on-device runtime speed benchmark</a> — because burst tok/s only tells half the story. I benchmark on-devi…

报道来源 [1]

iPhone 端侧大模型：GPU 赢下冲刺，Neural Engine 跑完马拉松

相关实体

相关话题