PulseAugur
EN
LIVE 18:45:44

Hearth operator scales LLMs to zero on Kubernetes

Hearth, a new Kubernetes operator, aims to reduce costs for self-hosting open-source LLMs by scaling them down to zero when not in use. It supports various hardware accelerators beyond NVIDIA, including Ascend chips, and allows model deployment through a single manifest. While currently in alpha and not production-ready, Hearth has successfully demonstrated its scale-to-zero functionality with NVIDIA GPUs and is working on Ascend backend validation. AI

IMPACT Reduces operational costs for self-hosted LLMs by optimizing GPU utilization.

RANK_REASON This is a new software tool release for managing AI infrastructure, not a frontier model release or significant industry event.

Read on dev.to — LLM tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Hearth operator scales LLMs to zero on Kubernetes

COVERAGE [1]

  1. dev.to — LLM tag TIER_1 English(EN) · kube-gopher ·

    Idle GPUs also burn money — a Kubernetes Operator that can scale large models down to zero

    <p><strong>It's early — come build it with me</strong></p> <p>Hearth is moving fast and contributions are very welcome — especially validating the Ascend backend on real NPUs, plus the roadmap's P0/P1 items. There are good first issues waiting.</p> <p>⭐ <strong>Star + follow alon…