English(EN) DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

DeepReinforce 发布 Ornith-1.0 开源编码模型，可学习 RL 脚手架

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-25 17:11

DeepReinforce 推出了 Ornith-1.0，这是一个在 MIT 许可下提供的开源编码模型家族。这些模型基于 Gemma 4 和 Qwen 3.5 构建，专为代理编码任务设计，并在训练过程中独特地学习自身的强化学习脚手架。最大的模型 Ornith-1.0-397B 在 SWE-Bench Verified 基准测试中取得了 82.4% 的优异成绩。 AI

影响此次发布为训练编码代理提供了一种新颖的方法，有可能提高它们在没有固定约束的情况下学习和适应的能力。

排序理由具有新颖的自脚手架 RL 功能的新模型家族的开源发布。[lever_c_demoted from frontier_release: ic=2 ai=1.0]

在 MarkTechPost 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

DeepReinforce 发布 Ornith-1.0 开源编码模型，可学习 RL 脚手架

报道来源 [2]

MarkTechPost TIER_1 English(EN) · Asif Razzaq · 2026-06-25 17:11

DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

<p>DeepReinforce released Ornith-1.0, an open-source coding model family built on Gemma 4 and Qwen 3.5. Instead of a fixed harness, the model learns its own scaffold during reinforcement learning. The 397B flagship reports 82.4 on SWE-Bench Verified, with all weights under the MI…
Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-06-25 17:51

DeepReinforce has released Ornith-1.0, an open-source coding model family built on Gemma 4 and Qwen 3.5 that learns its own RL scaffolds during training. The 39

DeepReinforce has released Ornith-1.0, an open-source coding model family built on Gemma 4 and Qwen 3.5 that learns its own RL scaffolds during training. The 397B model achieves 82.4% on SWE-Bench Verified and is available under the MIT license. https://www. marktechpost.com/2026…

报道来源 [2]

DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

DeepReinforce has released Ornith-1.0, an open-source coding model family built on Gemma 4 and Qwen 3.5 that learns its own RL scaffolds during training. The 39

相关实体

相关话题