PulseAugur
EN
LIVE 17:25:57
ENTITY Web agents

Web agents

PulseAugur coverage of Web agents — every cluster mentioning Web agents across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
6
6 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
5
5 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 6 TOTAL
  1. TOOL · CL_117697 ·

    New benchmark reveals hidden failure modes in web agents

    A new arXiv paper introduces Parallel WebBench, a benchmark designed to evaluate web agents more rigorously by identifying failures beyond just final answer correctness. The study reveals persistent issues such as searc…

  2. RESEARCH · CL_115252 ·

    New Ko-WideSearch benchmark reveals web agents struggle with breadth-search tasks

    A new benchmark called Ko-WideSearch has been developed to evaluate the breadth-search capabilities of web agents, focusing on exhaustive set enumeration rather than depth-based question answering. This Korean-language …

  3. TOOL · CL_93479 ·

    New framework MUZZLE finds 44 novel attacks on web agents

    Researchers have developed MUZZLE, an automated framework designed to test the security of web agents against indirect prompt injection attacks. This system adaptively identifies vulnerable injection points and crafts c…

  4. TOOL · CL_79926 ·

    Web agents should adopt typed actions over click-based browsing

    A new position paper proposes a shift from low-level, click-based interactions to typed actions for web agents. This approach, termed 'web verbs,' would expose web operations as typed functions with structured inputs an…

  5. TOOL · CL_39498 ·

    TinyFish Vault secures web agent logins without password exposure

    TinyFish Vault is a new credential management system designed to allow web agents to access accounts securely. It separates the authentication process from direct password exposure. This enables automated agents to perf…

  6. RESEARCH · CL_32655 ·

    New WARD defense system protects web agents from prompt injection attacks

    Researchers have developed WARD, a novel defense system designed to protect web agents from prompt injection attacks. This system addresses limitations of existing guard models, such as poor generalization and high fals…