research · [2 sources] · 2026-05-24 08:56

Microsoft Research's Webwright boosts AI web agent performance

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 2 sources

Microsoft Research has developed Webwright, an open-source framework that allows AI agents to interact with the web using a terminal-based approach. Unlike traditional agents that act one step at a time in a browser, Webwright agents write and execute Playwright code, bash commands, and inspect logs within a terminal environment. This method significantly improves performance, achieving 60.1% on the Odysseys benchmark, a substantial increase from the 33.5% scored by a base GPT-5.4 model using a conventional screenshot-based agent setting. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Enables AI agents to perform complex web tasks more effectively by adopting a code-centric development approach, potentially improving automation and data extraction.

RANK_REASON The cluster describes the release of a new open-source framework by Microsoft Research for AI agents, including benchmark results.

Read on MarkTechPost →

Microsoft Research's Webwright boosts AI web agent performance

COVERAGE [2]

MarkTechPost TIER_1 · Asif Razzaq · 2026-05-24 08:56

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

<p>Microsoft Research introduces Webwright, a terminal-native browser agent framework that replaces click-trace web automation with reusable Playwright scripts. Using a single agent loop across three modules and roughly 1,000 lines of code, Webwright powered by GPT-5.4 reaches 60…
Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-24 09:52

Microsoft Research has unveiled Webwright, a terminal-native web agent framework that achieves 60.1% on the Odysseys benchmark, a significant leap from the base

Microsoft Research has unveiled Webwright, a terminal-native web agent framework that achieves 60.1% on the Odysseys benchmark, a significant leap from the base GPT-5.4 score of 33.5%. The framework enables autonomous AI agents to navigate the web independently. https://www. mark…

LINKS marktechpost.com/…/microsoft-research-rel…

COVERAGE [2]

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

Microsoft Research has unveiled Webwright, a terminal-native web agent framework that achieves 60.1% on the Odysseys benchmark, a significant leap from the base

RELATED ENTITIES

RELATED TOPICS