Developer ditches semantic embeddings for BM25 in AI agent tool selection

By PulseAugur Editorial · [1 sources] · 2026-06-08 13:24

A developer building AI agents found that semantic embeddings, commonly used for tool selection, were unreliable in production. These embeddings struggled to differentiate between tools with similar descriptions, leading to incorrect tool choices. After testing three retrieval strategies, BM25-based search proved most effective, achieving 81% top-1 accuracy by indexing tool names, descriptions, and schema fields. AI

IMPACT Highlights the limitations of standard semantic search for structured tool selection in AI agents, advocating for keyword-based methods like BM25.

RANK_REASON Developer shares findings from testing retrieval strategies for AI agent tool selection, comparing semantic embeddings and BM25. [lever_c_demoted from research: ic=1 ai=0.7]

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

r/MachineLearning TIER_1 English(EN) · /u/AbjectBug5885 · 2026-06-08 13:24

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

<div class="md"><p>I've been building agents for about a year and recently shipped one for a client running ~140 MCP-exposed tools at peak. Along the way I made the canonical mistake. I used cosine similarity over tool description embeddings to pick which tools the…

COVERAGE [1]

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

RELATED ENTITIES

RELATED TOPICS