English(EN) Your MCP servers are burning 50k+ tokens before you type a word

Model Context Protocol 因服务器工具定义而产生高额 token 成本

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-28 19:41

Model Context Protocol (MCP) 的设计可能导致显著的 token 成本和延迟，因为每个连接的服务器都会为每个请求将其完整的工具定义加载到上下文窗口中。这种开销，在有多个服务器和工具的情况下，每次请求可能高达 50,000 到 75,000 个 token，占用了宝贵的上下文空间。为缓解此问题，用户可以通过禁用未使用的服务器、移除冗余、修剪工具表面积以及按需加载小众服务器而不是一直保持连接来减少 token 使用量。 AI

影响优化 MCP 等协议中的 token 使用量可以降低运营成本并提高 AI 应用程序的效率。

排序理由该条目讨论了一个工具和一种优化现有协议的方法，而不是新版本发布或重大的行业事件。

在 dev.to — MCP tag 阅读 →

基础设施

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

Model Context Protocol 因服务器工具定义而产生高额 token 成本

报道来源 [1]

dev.to — MCP tag TIER_1 English(EN) · Ali Al-Jaafari · 2026-06-28 19:41

Your MCP servers are burning 50k+ tokens before you type a word

<p>Here is something I did not realize about the Model Context Protocol until my context window kept feeling full for no reason.</p> <p>Every MCP server you connect loads its full set of tool definitions into the context window on every single request. Those schemas are not free.…

报道来源 [1]

Your MCP servers are burning 50k+ tokens before you type a word

相关实体

相关话题