新的SEVRA方法优化LLM推理，提高准确性和效率

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-18 00:00

研究人员开发了一种名为选择性推理分配验证（SEVRA）的新方法，以优化大型语言模型（LLM）的推理使用。SEVRA充当服务层控制器，决定是接受模型的初步答案还是进行额外的验证。在MATH500数据集上使用冻结的Qwen3-4B模型进行测试时，SEVRA在显著减少令牌使用量和有害答案翻转的同时，实现了比总是验证更高的准确性。然而，研究还发现，增加初始推理预算有时可以比选择性恢复产生更少的令牌，但结果相似或更好，这表明在采用选择性验证之前，调整初始预算是主要的优化步骤。 AI

影响这项研究通过优化LLM的推理过程，有望实现更高效的LLM部署，在保持或提高准确性的同时降低计算成本。

排序理由该集群包含一篇详细介绍LLM推理新方法的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.AI TIER_1 English(EN) · Sajib Acharjee Dip, Dawei Zhou, Liqing Zhang · 2026-06-19 04:00

Think Again or Think Longer? Selective Verification for Budget-Aware Reasoning

arXiv:2606.19808v1 Announce Type: new Abstract: Test-time reasoning is increasingly used as a serving-time control knob, but extra reasoning is not uniformly valuable: it can repair failed attempts, waste compute on already-correct answers, or introduce harmful answer changes. We…
arXiv cs.CL TIER_1 English(EN) · Liqing Zhang · 2026-06-18 05:25

三思而后行，还是长思？面向预算感知的推理进行选择性验证

Test-time reasoning is increasingly used as a serving-time control knob, but extra reasoning is not uniformly valuable: it can repair failed attempts, waste compute on already-correct answers, or introduce harmful answer changes. We study this as a deployment allocation problem r…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-18 00:00

三思而后行，还是长思？预算感知推理的选择性验证

Selective verification approaches optimize test-time reasoning by dynamically deciding when to verify answers, achieving better accuracy and efficiency compared to always-verifying or self-consistency methods.

报道来源 [3]

Think Again or Think Longer? Selective Verification for Budget-Aware Reasoning

三思而后行，还是长思？面向预算感知的推理进行选择性验证

三思而后行，还是长思？预算感知推理的选择性验证

相关实体

相关话题