English(EN) Is it possible to run a giant model like GLM5.2 on this cluster (4x servers with 512GB RAM + dual AMD Epyc)? 16 channel memory should hit 409GB/s per node.

用户探讨在多节点CPU集群上运行大型GLM5.2模型

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-23 17:46

一位用户正在咨询关于在由四台Dell C6525服务器组成的集群上运行大型语言模型（特别是GLM5.2）的可行性。每台服务器配备双AMD EPYC 7702处理器、512GB内存和快速SSD存储，总计2TB内存，并在四个节点上提供显著的内存带宽。用户正在探索集群化这些系统的选项，以提高token速度或加载更大模型（如GLM5.2的Unsloth 4位或8位版本），用于代理编码任务。 AI

排序理由用户关于在自定义硬件上运行特定模型的提问，并非正式发布或行业活动。

在 r/LocalLLaMA 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/LocalLLaMA TIER_1 English(EN) · /u/StartupTim · 2026-06-23 17:46

像GLM5.2这样的巨型模型能否在此集群（4个服务器，512GB内存+双AMD Epyc）上运行？每个节点16通道内存应能达到409GB/s。

<div class="md"><p>Hey all,</p> <p>I have a piece of hardware laying around which is pretty fast from a traditional (non-GPU) server viewpoint. The hardware is the following:</p> <ul> <li>Dell C6525 Server with Quad Node (4x server blades) with the following:</li> …

报道来源 [1]

像GLM5.2这样的巨型模型能否在此集群（4个服务器，512GB内存+双AMD Epyc）上运行？每个节点16通道内存应能达到409GB/s。

相关实体

相关话题