Brief

last 24h

[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.CL English(EN) · 1w

Internalizing Tool Knowledge in Small Language Models via QLoRA Fine-Tuning

Researchers have developed a method to internalize tool knowledge into small language models using QLoRA fine-tuning, reducing the need for explicit tool schemas in prompts. By training models like Gemma 4 E4B and Qwen3-4B on tool-use examples, they achieved better planning scores than a baseline that received full tool descriptions. This approach significantly cuts down on input length and inference overhead while maintaining or improving tool-planning quality, though it may impact general knowledge retention. AI

IMPACT Enables more efficient use of smaller models in agentic systems by reducing prompt token overhead.
RESEARCH · arXiv cs.AI English(EN) · 5d · [2 sources]

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

A new research paper introduces optimizations for agentic plan-execute pipelines in industrial asset operations, addressing latency-sensitive workflows. The proposed temporal semantic cache and workflow optimizations, including disk-backed tool discovery and parallel execution, achieved significant speedups. The study highlights limitations of existing semantic caching methods for parameter-rich industrial queries, emphasizing the need for correctness in agent benchmarks. AI

IMPACT Introduces optimizations for agentic pipelines, potentially improving efficiency and reducing latency in industrial AI applications.
- LLM
- AssetOpsBench
- arXiv

Brief

Internalizing Tool Knowledge in Small Language Models via QLoRA Fine-Tuning

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines