ENTITY Web agents

Web agents

PulseAugur coverage of Web agents — every cluster mentioning Web agents across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

6 over 90d

Releases · 30d

0 over 90d

Papers · 30d

5 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 6 TOTAL

TOOL · CL_117697 · Jun 30 · 04:00

New benchmark reveals hidden failure modes in web agents

A new arXiv paper introduces Parallel WebBench, a benchmark designed to evaluate web agents more rigorously by identifying failures beyond just final answer correctness. The study reveals persistent issues such as searc…
RESEARCH · CL_115252 · Jun 25 · 00:00

New Ko-WideSearch benchmark reveals web agents struggle with breadth-search tasks

A new benchmark called Ko-WideSearch has been developed to evaluate the breadth-search capabilities of web agents, focusing on exhaustive set enumeration rather than depth-based question answering. This Korean-language …
TOOL · CL_93479 · Jun 16 · 04:00

New framework MUZZLE finds 44 novel attacks on web agents

Researchers have developed MUZZLE, an automated framework designed to test the security of web agents against indirect prompt injection attacks. This system adaptively identifies vulnerable injection points and crafts c…
TOOL · CL_79926 · Jun 9 · 04:00

Web agents should adopt typed actions over click-based browsing

A new position paper proposes a shift from low-level, click-based interactions to typed actions for web agents. This approach, termed 'web verbs,' would expose web operations as typed functions with structured inputs an…
TOOL · CL_39498 · May 19 · 18:50

TinyFish Vault secures web agent logins without password exposure

TinyFish Vault is a new credential management system designed to allow web agents to access accounts securely. It separates the authentication process from direct password exposure. This enables automated agents to perf…
RESEARCH · CL_32655 · May 14 · 16:26

New WARD defense system protects web agents from prompt injection attacks

Researchers have developed WARD, a novel defense system designed to protect web agents from prompt injection attacks. This system addresses limitations of existing guard models, such as poor generalization and high fals…

New benchmark reveals hidden failure modes in web agents

New Ko-WideSearch benchmark reveals web agents struggle with breadth-search tasks

New framework MUZZLE finds 44 novel attacks on web agents

Web agents should adopt typed actions over click-based browsing

TinyFish Vault secures web agent logins without password exposure

New WARD defense system protects web agents from prompt injection attacks