Small LLMs internalize tool knowledge via QLoRA fine-tuning

作者 PulseAugur 编辑部 · [1 source] · 2026-05-18 02:48

Researchers have developed a method to internalize tool knowledge into small language models using QLoRA fine-tuning, reducing the need for explicit tool schemas in prompts. By training models like Gemma 4 E4B and Qwen3-4B on tool-use examples, they achieved better planning scores than a baseline that received full tool descriptions. This approach significantly cuts down on input length and inference overhead while maintaining or improving tool-planning quality, though it may impact general knowledge retention. AI

影响 Enables more efficient use of smaller models in agentic systems by reducing prompt token overhead.

排序理由 The cluster contains an academic paper detailing a new fine-tuning method for small language models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 · Tanmay Agarwal · 2026-05-18 02:48

Internalizing Tool Knowledge in Small Language Models via QLoRA Fine-Tuning

Large language models are increasingly used as planning components in agentic systems, but current tool-use pipelines often require full tool schemas to be included in every prompt, creating substantial token overhead and limiting the practicality of smaller models. This paper in…

报道来源 [1]

Internalizing Tool Knowledge in Small Language Models via QLoRA Fine-Tuning

相关实体

相关话题