English(EN) How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

寻求在经济实惠、可控的生产环境中部署 LLM 的策略

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-26 06:29

一位 r/MachineLearning 子版块的用户正在寻求关于在生产环境中部署开源大型语言模型 (LLM) 的最经济实惠且最高效方法的建议。该用户旨在完全控制其 AI 产品的堆栈，并针对特定用例微调模型，同时避免 CUDA 或 Transformers 等复杂技术挑战。他们正在寻找一种直接的私有部署途径，此前他们曾通过 OpenRouter 使用过 LLM API。 AI

影响运营商正在探索经济高效且可控的部署开源 LLM 的方法，以获得完整的堆栈所有权和微调能力。

排序理由用户查询寻求关于 LLM 部署策略的建议。

在 r/MachineLearning 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/MachineLearning TIER_1 English(EN) · /u/Necessary_Gazelle211 · 2026-06-26 06:29

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

<div class="md">I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control. Few reasons behind this are: - I wanna own the complete stack around my product.…

报道来源 [1]

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

相关实体

相关话题