PulseAugur / Brief
EN
LIVE 04:08:27

Brief

last 24h
[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Microsoft Releases Fara1.5: A Family of Browser Computer-Use Agents (4B/9B/27B) That Outperform OpenAI Operator and Gemini 2.5 Computer Use on Online-Mind2Web

    Microsoft Research has introduced Fara1.5, a series of three browser computer-use agent models (4B, 9B, and 27B parameters) built upon Qwen3.5. These agents are designed to interact with real browsers by interpreting screenshots and executing mouse and keyboard actions to complete tasks. In evaluations on the Online-Mind2Web benchmark, the largest Fara1.5 model achieved a 72% task success rate, surpassing competitors like OpenAI's Operator and Google's Gemini 2.5 Computer Use. AI

    Microsoft Releases Fara1.5: A Family of Browser Computer-Use Agents (4B/9B/27B) That Outperform OpenAI Operator and Gemini 2.5 Computer Use on Online-Mind2Web

    IMPACT Sets a new benchmark for browser automation agents, potentially impacting how users interact with web services and how developers build agentic applications.

  2. Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

    Microsoft Research has developed Webwright, an open-source framework that allows AI agents to interact with the web using a terminal-based approach. Unlike traditional agents that act one step at a time in a browser, Webwright agents write and execute Playwright code, bash commands, and inspect logs within a terminal environment. This method significantly improves performance, achieving 60.1% on the Odysseys benchmark, a substantial increase from the 33.5% scored by a base GPT-5.4 model using a conventional screenshot-based agent setting. AI

    Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

    IMPACT Enables AI agents to perform complex web tasks more effectively by adopting a code-centric development approach, potentially improving automation and data extraction.