ENTITY AgentDojo

AgentDojo

PulseAugur coverage of AgentDojo — every cluster mentioning AgentDojo across labs, papers, and developer communities, ranked by signal.

Total · 30d

7

7 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

6

6 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

instance of InjecAgent 70%

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL

TOOL · CL_116442 · Jun 29 · 17:13

Prompt optimization may weaken LLM adversarial robustness, new benchmark suggests

A new benchmark has been developed to investigate whether prompt optimization techniques for Large Language Models (LLMs) weaken their robustness against adversarial attacks, specifically prompt injection. Initial findi…
TOOL · CL_70446 · Jun 4 · 04:00

LLM attack benchmarks cover less than 25% of threat landscape

Researchers have developed a new framework to audit the coverage of benchmarks designed to test Large Language Model (LLM) attacks. This framework, based on a taxonomy of over 500 inference-time attacks, reveals that cu…
TOOL · CL_53868 · May 27 · 04:00

New Protocol Enables LLMs to Safely Control Small Devices

Researchers have introduced the Device Context Protocol (DCP), a new architecture designed to enable large language models (LLMs) to safely control constrained devices. DCP is significantly more lightweight than existin…
TOOL · CL_50472 · May 26 · 02:04

Arc Gate offers solution to OpenAI's 'unfixable' prompt injection vulnerability

OpenAI has stated that prompt injection in browser agents is an unfixable structural vulnerability at the model level. However, a new architectural solution called Arc Gate has demonstrated significant success in mitiga…
TOOL · CL_32688 · May 14 · 17:30

LLM attack benchmarks show significant gaps in security coverage

Researchers have developed a new framework to audit the coverage of LLM attack benchmarks, revealing significant gaps in current evaluations. Their analysis of six public benchmarks showed they collectively cover less t…
RESEARCH · CL_16489 · May 4 · 03:35

New attack exploits LLM agent relays, bypassing alignment defenses

Researchers have identified a new vulnerability in LLM agent architectures that use Bring-Your-Own-Key (BYOK) systems. These architectures route LLM traffic through third-party relays, creating an integrity gap where a …
RESEARCH · CL_99526 · Apr 15 · 22:38

New research explores LLM agent evaluation and improvement techniques

Researchers are exploring new methods for evaluating and improving Large Language Model (LLM) agents. One paper introduces semantic early-stopping for iterative LLM loops, aiming to reduce token usage by halting when me…