Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%
Microsoft Research has developed Webwright, an open-source framework that allows AI agents to interact with the web using a terminal-based approach. Unlike traditional agents that act one step at a time in a browser, Webwright agents write and execute Playwright code, bash commands, and inspect logs within a terminal environment. This method significantly improves performance, achieving 60.1% on the Odysseys benchmark, a substantial increase from the 33.5% scored by a base GPT-5.4 model using a conventional screenshot-based agent setting. AI
IMPACT Enables AI agents to perform complex web tasks more effectively by adopting a code-centric development approach, potentially improving automation and data extraction.