English(EN) From "Aha Moments" to Controllable Thinking: Toward Meta-Cognitive Reasoning in Large Reasoning Models via Decoupled Reasoning and Control

新的MERA框架提高了LLM的推理效率和准确性

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-24 04:00

研究人员开发了MERA，一个新颖的元认知推理框架，旨在提高大型推理模型（LRMs）的效率和准确性。MERA通过将推理过程与控制机制解耦来解决LRMs中的“过度思考”问题，使模型能够更好地决定何时停止生成文本。该框架利用接管式管道创建监督数据，并采用控制段策略优化（CSPO）进行训练，最终实现更具成本效益和更精确的推理。 AI

影响 MERA控制推理的方法可以降低推理成本和延迟，使LLM在实际应用中更加实用。

排序理由该集群包含一篇详细介绍大型推理模型新框架的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Rui Ha, Rui Pu, Chaozhuo Li, Li Sun, Sen Su · 2026-06-24 04:00

From "Aha Moments" to Controllable Thinking: Toward Meta-Cognitive Reasoning in Large Reasoning Models via Decoupled Reasoning and Control

arXiv:2508.04460v2 Announce Type: replace Abstract: Large Reasoning Models (LRMs) can exhibit step-by-step reasoning, reflection, and backtracking, but these behaviors are often unregulated, leading to overthinking. As a result, LRMs continue generating redundant reasoning even a…

报道来源 [1]

From "Aha Moments" to Controllable Thinking: Toward Meta-Cognitive Reasoning in Large Reasoning Models via Decoupled Reasoning and Control

相关实体

相关话题