The DS4 model is reportedly running on NVIDIA's DGX Spark hardware, utilizing GB10 and CUDA. Initial performance metrics indicate a speed of 12 tokens per second, with observed memory throughput limited to 270 GB/s. This setup is currently confined to a private branch, suggesting it is in an experimental or developmental phase. AI
影响 This indicates potential advancements in AI hardware utilization and performance benchmarks for large models.
排序理由 The cluster describes a model running on specific hardware, with performance metrics, which constitutes a research milestone or technical report.
在 Mastodon — mastodon.social 阅读 →
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →