PulseAugur
实时 01:17:40
English(EN) Qwen-AgentWorld: Language World Models for General Agents

新的 AI 智能体利用世界模型和自我修复来增强推理能力

研究人员推出了 Qwen-AgentWorld,这是一种新颖的语言世界模型,旨在模拟七个领域的智能体环境。该模型通过持续预训练、监督微调和强化学习的三阶段流程进行训练,并使用新的 AgentWorldBench 基准进行评估。此外,还开发了一个名为 Polaris 的小型语言模型框架,通过经验抽象和策略修复实现递归自我改进,在各种推理基准上显示出持续的提升。 AI

影响 世界建模和自我压缩智能体方面的这些进步可能带来更强大、更高效的 AI 系统来应对复杂任务。

排序理由 该集群包含两篇详细介绍新 AI 智能体框架和模型的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 7 个来源。 我们如何撰写摘要 →

新的 AI 智能体利用世界模型和自我修复来增强推理能力

报道来源 [7]

  1. Qwen tech blog TIER_1 English(EN) · QwenTeam ·

    Qwen-AgentWorld:通用智能体的语言世界模型

    Today we release Qwen-AgentWorld, a native language world model that simulates agent environments across seven domains: Native world modeling: environment modeling is the training objective from continual pre-training onward (CPT → SFT → RL), not a post hoc adaptation on top of a…

  2. arXiv cs.CL TIER_1 English(EN) · Yuxin Zuo, Zikai Xiao, Li Sheng, Fei Huang, Jianhong Tu, Yuxuan Liu, Tianyi Tang, Xiaomeng Hu, Yang Su, Qingfeng Lan, Yantao Liu, Qin Zhu, Yinger Zhang, Bowen Yu, Haiquan Zhao, Haiyang Xu, Jianxin Yang, Jiayang Cheng, Junyang Wang, Lianghao Deng, Mingfen… ·

    Qwen-AgentWorld: 通用智能体的语言世界模型

    arXiv:2606.24597v1 Announce Type: new Abstract: A world model predicts environment dynamics based on current observations and actions, serving as a core cognitive mechanism for reasoning and planning. In this work, we investigate how world modeling based on language models can fu…

  3. arXiv cs.LG TIER_1 English(EN) · Aditya Kakade, Vivek Srivastava, Shirish Karande ·

    Polaris:一种通过经验抽象策略修复的小型语言模型 Godel Agent 框架

    arXiv:2603.23129v3 Announce Type: replace Abstract: G\"odel agent realize recursive self-improvement: an agent inspects its own policy and traces and then modifies that policy in a tested loop. We introduce Polaris, a G\"odel agent for compact models that performs policy repair v…

  4. arXiv cs.CL TIER_1 English(EN) · Ning Ding ·

    Qwen-AgentWorld:通用智能体的语言世界模型

    A world model predicts environment dynamics based on current observations and actions, serving as a core cognitive mechanism for reasoning and planning. In this work, we investigate how world modeling based on language models can further push the boundaries of general agents. (i)…

  5. Hugging Face Daily Papers TIER_1 English(EN) ·

    Qwen-AgentWorld:通用智能体的语言世界模型

    Language-based world models enable agentic environment simulation across multiple domains and enhance general agent performance through scalable simulation and improved downstream task performance.

  6. arXiv cs.CL TIER_1 English(EN) · Daniel Khashabi ·

    自密实语言模型代理

    Long agent traces composed of chains of thought and tool calls accumulate stale content that anchor subsequent generations, and eventually outgrow the context window. Existing scaffolds mitigate it with fixed-interval compaction triggered at a token threshold. Such triggers pay n…

  7. Hugging Face Daily Papers TIER_1 English(EN) ·

    自密实语言模型代理

    SelfCompact is a scaffolding approach that enables models to autonomously determine optimal compaction timing and methods for managing long agent traces, achieving better performance with reduced token costs compared to fixed-interval methods.