Anthropic has introduced a new method for its Claude API that significantly reduces token usage and improves accuracy by loading tool schemas on demand. Previously, agents would load all available tool schemas at the start of a request, leading to high token costs and degraded performance as the number of tools increased. The new deferred loading feature allows agents to only load necessary schemas when a task requires them, drastically cutting down context window usage and enhancing the model's ability to select the correct tool. AI
IMPACT Reduces token costs and improves agent accuracy by optimizing tool schema loading, potentially accelerating agent adoption.
RANK_REASON This describes a new feature for an existing product, not a new model release or fundamental research.
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →