Brief

last 24h

[3/3] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.IR (Information Retrieval) English(EN) · 2d · [2 sources]

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

Researchers have developed a chance-corrected metric called Bits-over-Random (BoR) to evaluate the optimal number of tools an LLM agent should consider for a given query. This metric helps determine if success at a certain tool shortlist depth is better than random selection. Applying this principle through reinforcement learning, an agent learned to adapt its tool shortlist size per query, significantly reducing the number of tools presented while maintaining or improving coverage and LLM selection accuracy. AI

IMPACT Optimizes LLM agent efficiency by reducing unnecessary tool considerations, potentially improving response times and accuracy.
RESEARCH · arXiv cs.CL English(EN) · 5d · [2 sources]

Diagnosis Is Not Prescription: Linguistic Co-Adaptation Explains Patching Hazards in LLM Pipelines

A new research paper introduces the "Linguistic Contract" hypothesis to explain why fixing the most problematic module in a multi-module LLM agent can paradoxically worsen performance. The study found that while causal analysis often points to the routing module as the bottleneck, injecting corrections there degrades results. Instead, patching an upstream query-rewriting module proved more effective, suggesting that downstream modules adapt to upstream error distributions, and direct correction breaks this implicit alignment. AI

IMPACT Explains why direct intervention in LLM agent bottlenecks can fail, suggesting a need for indirect patching strategies to maintain system alignment.
TOOL · arXiv cs.CV English(EN) · 6d

A novel YOLO26-MoE optimized by an LLM agent for insulator fault detection considering UAV images

Researchers have developed a new object detection model, YOLO26-MoE, to improve the automated inspection of electrical power line insulators using UAVs. This model integrates a Mixture-of-Experts (MoE) module to better refine features for detecting subtle and varied fault patterns. An LLM agent was utilized to coordinate the optimization and training process, resulting in state-of-the-art performance with a 0.9900 [email protected]. AI

IMPACT Introduces an LLM-optimized model for improved infrastructure inspection, potentially enhancing grid reliability.

Brief

How Many Tools Should an LLM Agent See? A Chance-Corrected Answer

Diagnosis Is Not Prescription: Linguistic Co-Adaptation Explains Patching Hazards in LLM Pipelines

A novel YOLO26-MoE optimized by an LLM agent for insulator fault detection considering UAV images