English(EN) Optimal Query Allocation in Extractive QA with LLMs: A Learning-to-Defer Framework with Theoretical Guarantees

新框架优化LLM在抽取式问答中的使用

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-21 04:00

研究人员开发了一个学习延迟（Learning-to-Defer）框架，以提高使用大型语言模型（LLM）进行抽取式问答（EQA）的效率。该方法智能地将查询分配给专用模型，确保高置信度的预测，同时最大限度地降低计算成本。该框架在SQuADv1和TriviaQA等数据集上进行了测试，证明了其提高了答案的可靠性并显著降低了计算开销，使其适用于可扩展的EQA部署。 AI

影响优化了LLM在问答中的资源分配，可能降低成本并提高专业应用中的性能。

排序理由该集群包含一篇学术论文，详细介绍了用于提高LLM在问答中效率的新框架。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv stat.ML 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv stat.ML TIER_1 English(EN) · Yannis Montreuil, Shu Heng Yeo, Axel Carlier, Lai Xing Ng, Wei Tsang Ooi · 2026-05-21 04:00

Optimal Query Allocation in Extractive QA with LLMs: A Learning-to-Defer Framework with Theoretical Guarantees

arXiv:2410.15761v4 Announce Type: replace-cross Abstract: Large Language Models excel in generative tasks but exhibit inefficiencies in structured text selection, particularly in extractive question answering. This challenge is magnified in resource-constrained environments, wher…

报道来源 [1]

Optimal Query Allocation in Extractive QA with LLMs: A Learning-to-Defer Framework with Theoretical Guarantees

相关实体

相关话题