Microsoft Research has developed Webwright, an open-source framework that allows AI agents to interact with the web using a terminal-based approach. Unlike traditional agents that act one step at a time in a browser, Webwright agents write and execute Playwright code, bash commands, and inspect logs within a terminal environment. This method significantly improves performance, achieving 60.1% on the Odysseys benchmark, a substantial increase from the 33.5% scored by a base GPT-5.4 model using a conventional screenshot-based agent setting. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT Enables AI agents to perform complex web tasks more effectively by adopting a code-centric development approach, potentially improving automation and data extraction.
RANK_REASON The cluster describes the release of a new open-source framework by Microsoft Research for AI agents, including benchmark results.