PulseAugur
EN
LIVE 13:34:26

Self-hosting LLMs is not cheaper than cloud, Reddit user argues

A Reddit user argues that self-hosting large language models is not economically cheaper than cloud-based solutions. They calculated that their personal rig, costing around $2800 and consuming significant electricity, incurs a higher per-token cost than renting cloud GPUs like an H100. The user concludes that the primary motivations for self-hosting are privacy, control, and the desire to tinker, rather than cost savings. AI

IMPACT Self-hosting LLMs is often perceived as more cost-effective, but this analysis suggests privacy and control are the main drivers, not economics.

RANK_REASON User-generated opinion piece on a technical topic.

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/Napster3301 ·

    Stop pretending self-hosting is cheaper. It's not. We do it for different reasons and we should say so.

    <!-- SC_OFF --><div class="md"><p>Did the math on my own rig last week and I'm tired of seeing this sub repeat the &quot;local is cheaper&quot; line without numbers. Let me actaully break it down.</p> <p>My setup: 2x 3090 (used, $1400 total), Ryzen 7900X, 64GB DDR5, around $2800 …