English(EN) Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

LLM通过位置感知草稿和不变重排序加速推荐推理

作者 PulseAugur 编辑部 · [6 个来源] · 2026-04-30 08:49

两篇新研究论文解决了使用大型语言模型（LLM）进行推荐系统方面的挑战。一篇题为PAD-Rec的论文介绍了一个位置感知草稿模块，通过考虑项目内的令牌位置和推测深度来加速LLM在生成式列表式推荐中的推理。另一篇题为InvariRank的论文提出了一个架构框架，使基于LLM的推荐重排序对候选项目的顺序不变，从而确保稳定可靠的排名。 AI

影响引入了提高基于LLM的推荐系统效率和可靠性的方法。

排序理由两篇在arXiv上发表的学术论文，提出了基于LLM的推荐系统的新方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 6 个来源。我们如何撰写摘要 →

报道来源 [6]

arXiv cs.AI TIER_1 English(EN) · Jiaju Chen, Chongming Gao, Chenxiao Fan, Haoyan Liu, Qingpeng Cai, Peng Jiang, Xiangnan He · 2026-05-01 04:00

LLM驱动的生成式列表式推荐中的位置感知草稿以加速推理

arXiv:2604.27747v1 Announce Type: cross Abstract: Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target distribution, speculative decod…
arXiv cs.LG TIER_1 English(EN) · Ethan Bito, Yongli Ren, Estrid He · 2026-05-01 04:00

一次通过，任意顺序：基于LLM的推荐的位置不变列表重排

arXiv:2604.27599v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used for recommendation reranking, but their listwise predictions can depend on the order in which candidates are presented. This creates a mismatch between the set-based nature of rec…
arXiv cs.AI TIER_1 English(EN) · Xiangnan He · 2026-04-30 11:37

LLM驱动的生成式列表式推荐中的位置感知草稿以加速推理

Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target distribution, speculative decoding (SD) uses a small draft model to propose sever…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-30 11:37

LLM驱动的生成式列表式推荐中的位置感知草稿以加速推理

Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target distribution, speculative decoding (SD) uses a small draft model to propose sever…
arXiv cs.LG TIER_1 English(EN) · Estrid He · 2026-04-30 08:49

一次通过，任意顺序：基于LLM的推荐的位置不变列表重排

Large language models (LLMs) are increasingly used for recommendation reranking, but their listwise predictions can depend on the order in which candidates are presented. This creates a mismatch between the set-based nature of recommendation and the sequence-based computation of …
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-30 08:49

一次通过，任意顺序：基于LLM的推荐的位置不变列表重排

Large language models (LLMs) are increasingly used for recommendation reranking, but their listwise predictions can depend on the order in which candidates are presented. This creates a mismatch between the set-based nature of recommendation and the sequence-based computation of …

报道来源 [6]

LLM驱动的生成式列表式推荐中的位置感知草稿以加速推理

一次通过，任意顺序：基于LLM的推荐的位置不变列表重排

LLM驱动的生成式列表式推荐中的位置感知草稿以加速推理

LLM驱动的生成式列表式推荐中的位置感知草稿以加速推理

一次通过，任意顺序：基于LLM的推荐的位置不变列表重排

一次通过，任意顺序：基于LLM的推荐的位置不变列表重排

相关实体

相关话题