PulseAugur
实时 23:09:28

New LenVM model offers token-level length control for LLMs

Researchers have developed a new framework called the Length Value Model (LenVM) that predicts the remaining generation length for tokens in large language models. This token-level approach models length as a value estimation problem, providing a dense, annotation-free supervision signal. Experiments show LenVM significantly improves exact length matching on the LIFEBench task and allows for controlled trade-offs between performance and efficiency, maintaining high accuracy on GSM8K even with strict token budgets. AI

影响 Enables more efficient and controlled text generation, potentially improving LLM performance on tasks requiring specific output lengths.

排序理由 Academic paper introducing a novel modeling technique for LLMs.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

New LenVM model offers token-level length control for LLMs

报道来源 [1]

  1. arXiv cs.CL TIER_1 English(EN) · Zhen Zhang, Changyi Yang, Zijie Xia, Zhen Yang, Chengzhi Liu, Zhaotiao Weng, Yepeng Liu, Haobo Chen, Jin Pan, Chenyang Zhao, Yuheng Bu, Alkesh Patel, Zhe Gan, Xin Eric Wang ·

    Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

    arXiv:2604.27039v1 Announce Type: new Abstract: Token serves as the fundamental unit of computation in modern autoregressive models, and generation length directly influences both inference cost and reasoning performance. Despite its importance, existing approaches lack fine-grai…