English(EN) Does the Same Token Mean the Same State? MoE Routing as Signal for Reasoning Control

新的 RAD 方法在不进行文本分析的情况下控制 MoE 语言模型的推理

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-22 03:17

研究人员开发了一种名为 RAD（路由一致性解码）的新方法，用于控制稀疏专家混合（MoE）语言模型的推理。该技术利用 MoE 模型的内部路由状态来指导模型的响应，而不是依赖输出文本。RAD 在各种数据集（包括数学和代码生成任务）上的表现与传统方法相当，并为无法进行精确字符串匹配的任务提供了一种替代方法。 AI

影响引入了一种控制 MoE 模型的新颖方法，该方法有望提高需要复杂推理或代码生成任务的性能。

排序理由介绍一种控制 MoE 语言模型新颖方法的论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Yugang Jiang · 2026-06-22 03:17

Does the Same Token Mean the Same State? MoE Routing as Signal for Reasoning Control

In sparse Mixture-of-Experts language models, does the same token id imply the same router state and the same experts producing it? Holding the emitted token id fixed at repeated anchors, we find it does not: the experts that produce it still separate task context, trajectory his…

报道来源 [1]

Does the Same Token Mean the Same State? MoE Routing as Signal for Reasoning Control

相关实体

相关话题