PulseAugur
EN
LIVE 23:07:11

DGX Spark GPU overheating solved by clock-locking with nvidia-smi

A developer has found a workaround for overheating issues with the DGX Spark GPU when running large language models like Ollama and Qwen2.5. The GPU, specifically the GB10, lacks user-accessible power and fan controls, leading to temperatures around 83°C under sustained load. By using the `nvidia-smi --lock-gpu-clocks` command, the developer created a daemon that dynamically adjusts the GPU clock speed to keep temperatures below 78°C, reducing them to a sustained 72°C. While this method slightly impacts inference speed, it ensures 24/7 uptime and thermal headroom. AI

IMPACT Provides a practical solution for managing LLM inference hardware temperatures, ensuring stability and uptime.

RANK_REASON Developer shares a technical solution for a hardware issue with specific software.

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

DGX Spark GPU overheating solved by clock-locking with nvidia-smi

COVERAGE [1]

  1. dev.to — LLM tag TIER_1 English(EN) · Deal Estate ·

    DGX Spark hitting 83 C under sustained Ollama load — solved by clock-locking via nvidia-smi -lgc

    <h1> DGX Spark hitting 83°C under sustained Ollama load — solved by clock-locking via nvidia-smi -lgc </h1> <p><strong>TL;DR:</strong> GB10 in the DGX Spark has no user-exposed power-limit or fan-curve control (<code>nvidia-smi</code> returns <code>[N/A]</code> for both — firmwar…