English(EN) Tokenminning (for people who write openai.chat.completions.create)

公司转向“tokenminning”以削减 AI 成本

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-19 17:49

公司正从优先考虑最大化 LLM 使用量的“tokenmaxxing”转向“tokenminning”，这是一种在保持输出质量的同时专注于最小化 token 支出的策略。这包括在代码中实施计量、提示词卫生、模型路由和硬预算上限。Meta、Uber、Walmart 和 Amazon 等早期采用者由于成本不断攀升，已放弃了无限使用 LLM 的做法，一些公司在几个月内就超出了年度 AI 预算。文章强调应先进行 token 计量，然后再优化提示词，并提出了一些实用步骤，例如记录 token 使用情况以及强制使用模式输出而非散文以提高成本效益。 AI

影响这种转向注重成本效益的 LLM 使用方式可能会影响 AI 驱动的应用程序的开发和部署方式，优先考虑效率和投资回报率。

排序理由文章讨论了 AI 成本管理中的战略转变，而非具体的产品发布或研究突破。

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · Oskar · 2026-06-19 17:49

Tokenminning (for people who write openai.chat.completions.create)

Your team shipped the agent. The demo was magic. Then finance asked why one engineer's coding assistant burned $40k in a month while the feature backlog looked the same. Welcome to the hangover after tokenmaxxing — the habit of maximizi…

报道来源 [1]

Tokenminning (for people who write openai.chat.completions.create)

相关实体

相关话题