ToolBench
PulseAugur coverage of ToolBench — every cluster mentioning ToolBench across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
NaviAgent improves LLM tool orchestration with bilevel planning
Researchers have developed NaviAgent, a novel system designed to improve how large language models orchestrate the use of external tools. NaviAgent employs a bilevel architecture that separates task planning from tool e…
-
Cocreli architecture enforces preconditions for reliable instruction following
Researchers have introduced Cocoreli, a novel architecture designed to enhance the reliability of autonomous agents executing human instructions. Cocoreli addresses the issue of agents proceeding with actions despite in…
-
AgentHER framework boosts LLM agent training with failed trajectory relabeling
Researchers have developed AgentHER, a new framework designed to improve the training of LLM agents by repurposing failed trajectories. The system adapts Hindsight Experience Replay to natural language, identifying alte…