Microsoft FastContext: a Repo-Explorer Subagent Cuts Coding-Agent Tokens 60%: Explorer-Subagent Context Offloading
Microsoft has developed FastContext, a system designed to reduce the token usage of coding agents by offloading the task of repository exploration to a dedicated subagent. This specialized 4B-30B model handles read-only searches, returning concise file-line citations rather than entire file contents to the main coding agent. This approach significantly cuts down on token consumption, as repository searching previously accounted for over half of a coding agent's tool-use turns and a substantial portion of its token budget. AI
IMPACT Reduces token costs and latency for coding agents by optimizing repository search, potentially accelerating development workflows.