PulseAugur
实时 12:46:56
English(EN) 768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

Reddit用户使用768GB二手Optane内存运行1万亿参数LLM

一位Reddit用户通过利用768GB的二手Intel Optane持久内存模块作为RAM,成功在一台单GPU工作站上本地运行了一个1万亿参数的LLM,具体为Kimi K2.5。该设置实现了每秒约4个token的性能,考虑到硬件的预算限制,这被认为是一个令人印象深刻的性能。已停产Optane DIMM的使用突显了为大型语言模型推理提供经济实惠、大容量内存解决方案的潜在市场缺口,尤其是在DRAM价格波动的情况下。 AI

影响 展示了一种运行大型LLM的经济高效的本地方法,可能影响未来AI推理的硬件配置。

排序理由 用户驱动的现有硬件在特定AI任务中的应用。

在 Tom's Hardware 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

Reddit用户使用768GB二手Optane内存运行1万亿参数LLM

报道来源 [3]

  1. Tom's Hardware TIER_1 English(EN) · Mark Tyson ·

    768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

    A Redditor has caused a stir by coaxing a workstation build using Optane PMem DIMMs as RAM to run a 1-trillion parameter LLM.

  2. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 t

    768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second A Redditor has caused a stir by coaxing a workstation build using Optane PMem DIMMs as RAM to run a 1-t…

  3. r/singularity TIER_2 English(EN) · /u/Anen-o-me ·

    768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

    <table> <tr><td> <a href="https://www.reddit.com/r/singularity/comments/1tm1u3l/768gb_of_cheap_intel_optane_dimm_memory_sticks/"> <img alt="768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install …