Hearth operator scales LLMs to zero on Kubernetes

By PulseAugur Editorial · [1 sources] · 2026-06-04 14:30

Hearth, a new Kubernetes operator, aims to reduce costs for self-hosting open-source LLMs by scaling them down to zero when not in use. It supports various hardware accelerators beyond NVIDIA, including Ascend chips, and allows model deployment through a single manifest. While currently in alpha and not production-ready, Hearth has successfully demonstrated its scale-to-zero functionality with NVIDIA GPUs and is working on Ascend backend validation. AI

IMPACT Reduces operational costs for self-hosted LLMs by optimizing GPU utilization.

RANK_REASON This is a new software tool release for managing AI infrastructure, not a frontier model release or significant industry event.

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Hearth operator scales LLMs to zero on Kubernetes

COVERAGE [1]

dev.to — LLM tag TIER_1 English(EN) · kube-gopher · 2026-06-04 14:30

Idle GPUs also burn money — a Kubernetes Operator that can scale large models down to zero

It's early — come build it with me Hearth is moving fast and contributions are very welcome — especially validating the Ascend backend on real NPUs, plus the roadmap's P0/P1 items. There are good first issues waiting. ⭐ Star + follow alon…

COVERAGE [1]

Idle GPUs also burn money — a Kubernetes Operator that can scale large models down to zero

RELATED ENTITIES

RELATED TOPICS