English(EN) 768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

Reddit用户使用768GB二手Optane内存运行1万亿参数LLM

作者 PulseAugur 编辑部 · [3 个来源] · 2026-05-23 11:20

一位Reddit用户通过利用768GB的二手Intel Optane持久内存模块作为RAM，成功在一台单GPU工作站上本地运行了一个1万亿参数的LLM，具体为Kimi K2.5。该设置实现了每秒约4个token的性能，考虑到硬件的预算限制，这被认为是一个令人印象深刻的性能。已停产Optane DIMM的使用突显了为大型语言模型推理提供经济实惠、大容量内存解决方案的潜在市场缺口，尤其是在DRAM价格波动的情况下。 AI

影响展示了一种运行大型LLM的经济高效的本地方法，可能影响未来AI推理的硬件配置。

排序理由用户驱动的现有硬件在特定AI任务中的应用。

在 Tom's Hardware 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

Tom's Hardware TIER_1 English(EN) · Mark Tyson · 2026-05-23 11:20

768GB廉价Intel Optane DIMM内存条用于在单GPU系统上运行1万亿参数LLM — 本地Kimi K2.5安装实现了约每秒4个token

A Redditor has caused a stir by coaxing a workstation build using Optane PMem DIMMs as RAM to run a 1-trillion parameter LLM.
Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-23 14:48

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 t

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second A Redditor has caused a stir by coaxing a workstation build using Optane PMem DIMMs as RAM to run a 1-t…

链接 tomshardware.com/…/enthusiast-runs-1-tril… tomshardware.com/tech-industry
r/singularity TIER_2 English(EN) · /u/Anen-o-me · 2026-05-24 04:24

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

<table> <tr><td> <a href="https://www.reddit.com/r/singularity/comments/1tm1u3l/768gb_of_cheap_intel_optane_dimm_memory_sticks/"> <img alt="768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install …

报道来源 [3]

768GB廉价Intel Optane DIMM内存条用于在单GPU系统上运行1万亿参数LLM — 本地Kimi K2.5安装实现了约每秒4个token

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 t

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

相关实体

相关话题