新算法利用用户重试来优化 LLM 路由和调度

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-30 04:00

研究人员开发了一种名为 ACQB（anytime CQB）的新算法，以改进大型语言模型（LLM）的查询路由和调度。该算法利用用户重试行为的隐式反馈，而非显式评分，来学习用户偏好并优化 LLM 分配。ACQB 旨在保持队列稳定性并减少对话式 LLM 服务中的累积遗憾，在合成数据、离线数据集和真实用户日志的实验中显示出有希望的结果。 AI

影响这项研究通过优化查询处理和减少用户等待时间，有望实现更高效、更稳定的 LLM 服务。

排序理由该集群包含一篇详细介绍 LLM 路由和调度新算法的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Seoungbin Bae, Junyoung Son, Dabeen Lee · 2026-06-30 04:00

Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits

arXiv:2602.02061v2 Announce Type: replace Abstract: Explosive demands for LLMs often cause user queries to accumulate in server queues, requiring efficient routing (query-LLM matching) and scheduling (query prioritization) mechanisms. Several online algorithms are being deployed,…

报道来源 [1]

Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits

相关实体

相关话题