PulseAugur
EN
LIVE 11:17:53
research · [2 sources] ·

Microsoft Research's Webwright boosts AI web agent performance

Microsoft Research has developed Webwright, an open-source framework that allows AI agents to interact with the web using a terminal-based approach. Unlike traditional agents that act one step at a time in a browser, Webwright agents write and execute Playwright code, bash commands, and inspect logs within a terminal environment. This method significantly improves performance, achieving 60.1% on the Odysseys benchmark, a substantial increase from the 33.5% scored by a base GPT-5.4 model using a conventional screenshot-based agent setting. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Enables AI agents to perform complex web tasks more effectively by adopting a code-centric development approach, potentially improving automation and data extraction.

RANK_REASON The cluster describes the release of a new open-source framework by Microsoft Research for AI agents, including benchmark results.

Read on MarkTechPost →

Microsoft Research's Webwright boosts AI web agent performance

COVERAGE [2]

  1. MarkTechPost TIER_1 · Asif Razzaq ·

    Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

    <p>Microsoft Research introduces Webwright, a terminal-native browser agent framework that replaces click-trace web automation with reusable Playwright scripts. Using a single agent loop across three modules and roughly 1,000 lines of code, Webwright powered by GPT-5.4 reaches 60…

  2. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    Microsoft Research has unveiled Webwright, a terminal-native web agent framework that achieves 60.1% on the Odysseys benchmark, a significant leap from the base

    Microsoft Research has unveiled Webwright, a terminal-native web agent framework that achieves 60.1% on the Odysseys benchmark, a significant leap from the base GPT-5.4 score of 33.5%. The framework enables autonomous AI agents to navigate the web independently. https://www. mark…