PulseAugur
实时 04:37:08

New APPS method improves LLM reasoning by guiding decoder to better solutions

Researchers have developed Auxiliary Particle Power Sampling (APPS), a novel blockwise particle algorithm designed to improve the efficiency of large language model inference. APPS aims to better locate correct multi-step solutions that base LLMs already assign probability mass to, but struggle to find. By redistributing compute across competing prefixes and using future-value-guided selection, APPS enhances the accuracy-runtime trade-off for training-free decoding on reasoning benchmarks. AI

影响 Improves the efficiency of LLM inference for complex reasoning tasks, potentially narrowing the gap with post-trained systems.

排序理由 This is a research paper detailing a new algorithm for LLM inference.

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

New APPS method improves LLM reasoning by guiding decoder to better solutions

报道来源 [2]

  1. arXiv cs.LG TIER_1 English(EN) · Tu Nguyen, Rasul Tutunov, Xiaotong Ji, Matthieu Zimmer ·

    The Model Knows, the Decoder Finds: Future Value Guided Particle Power Sampling

    arXiv:2605.02427v1 Announce Type: cross Abstract: A recurring pattern in "reasoning without training" is that base LLMs already assign non-trivial probability mass to correct multi-step solutions; the bottleneck is locating these modes efficiently at inference time. Power samplin…

  2. arXiv cs.AI TIER_1 English(EN) · Matthieu Zimmer ·

    The Model Knows, the Decoder Finds: Future Value Guided Particle Power Sampling

    A recurring pattern in "reasoning without training" is that base LLMs already assign non-trivial probability mass to correct multi-step solutions; the bottleneck is locating these modes efficiently at inference time. Power sampling provides a principled way to bias decoding towar…