DeepReinforce has launched Ornith-1.0, a family of open-source coding models available under the MIT license. These models, built upon Gemma 4 and Qwen 3.5, are designed for agentic coding tasks and uniquely learn their own reinforcement learning scaffolds during training. The largest model, Ornith-1.0-397B, has demonstrated strong performance, achieving 82.4% on the SWE-Bench Verified benchmark. AI
IMPACT This release offers a novel approach to training coding agents, potentially improving their ability to learn and adapt without fixed harnesses.
RANK_REASON Open-source release of a new model family with novel self-scaffolding RL capabilities. [lever_c_demoted from frontier_release: ic=2 ai=1.0]
- Claude Opus 4.7
- Claude Opus 4.8
- DeepReinforce
- Gemma 4
- GLM-5.2-744B
- Ornith-1.0
- Qwen 3.5
- SWE-Bench Verified
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →