PulseAugur
实时 23:48:34
English(EN) Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

LLM通过位置感知草稿和不变重排序加速推荐推理

两篇新研究论文解决了使用大型语言模型(LLM)进行推荐系统方面的挑战。一篇题为PAD-Rec的论文介绍了一个位置感知草稿模块,通过考虑项目内的令牌位置和推测深度来加速LLM在生成式列表式推荐中的推理。另一篇题为InvariRank的论文提出了一个架构框架,使基于LLM的推荐重排序对候选项目的顺序不变,从而确保稳定可靠的排名。 AI

影响 引入了提高基于LLM的推荐系统效率和可靠性的方法。

排序理由 两篇在arXiv上发表的学术论文,提出了基于LLM的推荐系统的新方法。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 6 个来源。 我们如何撰写摘要 →

LLM通过位置感知草稿和不变重排序加速推荐推理

报道来源 [6]

  1. arXiv cs.AI TIER_1 English(EN) · Jiaju Chen, Chongming Gao, Chenxiao Fan, Haoyan Liu, Qingpeng Cai, Peng Jiang, Xiangnan He ·

    Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

    arXiv:2604.27747v1 Announce Type: cross Abstract: Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target distribution, speculative decod…

  2. arXiv cs.LG TIER_1 English(EN) · Ethan Bito, Yongli Ren, Estrid He ·

    One Pass, Any Order: Position-Invariant Listwise Reranking for LLM-Based Recommendation

    arXiv:2604.27599v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used for recommendation reranking, but their listwise predictions can depend on the order in which candidates are presented. This creates a mismatch between the set-based nature of rec…

  3. arXiv cs.AI TIER_1 English(EN) · Xiangnan He ·

    Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

    Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target distribution, speculative decoding (SD) uses a small draft model to propose sever…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

    Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target distribution, speculative decoding (SD) uses a small draft model to propose sever…

  5. arXiv cs.LG TIER_1 English(EN) · Estrid He ·

    One Pass, Any Order: Position-Invariant Listwise Reranking for LLM-Based Recommendation

    Large language models (LLMs) are increasingly used for recommendation reranking, but their listwise predictions can depend on the order in which candidates are presented. This creates a mismatch between the set-based nature of recommendation and the sequence-based computation of …

  6. Hugging Face Daily Papers TIER_1 English(EN) ·

    One Pass, Any Order: Position-Invariant Listwise Reranking for LLM-Based Recommendation

    Large language models (LLMs) are increasingly used for recommendation reranking, but their listwise predictions can depend on the order in which candidates are presented. This creates a mismatch between the set-based nature of recommendation and the sequence-based computation of …