LLM tool response bloat dwarfs schema costs, analysis finds

By PulseAugur Editorial · [1 sources] · 2026-06-10 13:30

A new analysis suggests that the perceived high cost of using tool schemas with large language models like Claude is often overstated, with actual schema costs being significantly lower than community estimates. The primary driver of context window consumption is not the initial schema definitions, but rather the accumulating responses from tool calls throughout a session. These responses, such as page snapshots or data query results, can quickly consume a large portion of the context window, making them a more critical area for optimization than the initial schema loading. AI

IMPACT Highlights that optimizing tool response handling, rather than just schema loading, is key to managing LLM context window costs.

RANK_REASON The article provides an analysis and measurement of LLM context window costs, offering an opinion on where the real cost lies.

Read on dev.to — MCP tag →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

LLM tool response bloat dwarfs schema costs, analysis finds

COVERAGE [1]

dev.to — MCP tag TIER_1 English(EN) · PavelTkachenk0 · 2026-06-10 13:30

The real MCP context tax isn't the schemas — it's the responses

If you've spent any time around MCP lately, you've seen the headline: your MCP server is eating your context window. The number that gets quoted is scary — GitHub's MCP server costs you ~55,000 tokens before you type a single word, a quarter of Claude…

COVERAGE [1]

The real MCP context tax isn't the schemas — it's the responses

RELATED ENTITIES

RELATED TOPICS