English(EN) I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]

LLM推理定价在7家供应商之间进行比较，突出缓存成本

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-24 11:28

一位用户编制了一个电子表格，比较了包括OpenAI、Anthropic、Cohere和Mistral AI在内的七家供应商的LLM推理定价。比较侧重于输入/输出令牌定价、上下文窗口和缓存输入成本，而不是性能基准。一个关键发现是缓存输入定价的显著差异，其成本可能比非缓存输入便宜数十倍，这使得它成为代理和RAG管道等应用程序的关键因素。 AI

影响强调了缓存成本对LLM推理的重要性，可能影响应用程序设计和供应商选择。

排序理由用户生成的LLM推理定价和功能比较。[lever_c_demoted from research: ic=1 ai=0.7]

在 r/MachineLearning 阅读 →

基础设施

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/MachineLearning TIER_1 English(EN) · /u/Technomadlyf · 2026-06-24 11:28

I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]

<table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1ueavxn/i_compiled_llm_inference_pricing_across_7/"> <img alt="I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]" src="https://preview.redd…

报道来源 [1]

I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]

相关实体

相关话题