ENTITY LLM agents

LLM agents

PulseAugur coverage of LLM agents — every cluster mentioning LLM agents across labs, papers, and developer communities, ranked by signal.

Total · 30d

10 over 90d

Releases · 30d

0 over 90d

Papers · 30d

9 over 90d

TIER MIX · 90D

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 10 TOTAL

RESEARCH · CL_28076 · May 12 · 08:02

LLM agent prompt optimization breaks prefix cache, increasing costs

A technical article explores how optimizing prompts for LLM agents can inadvertently break the prefix cache, leading to higher costs than expected. The author explains that while fewer tokens in a prompt might seem chea…
TOOL · CL_28316 · May 11 · 16:14

New LITMUS benchmark tests LLM agent safety in real OS environments

Researchers have introduced LITMUS, a new benchmark designed to evaluate the behavioral safety of LLM agents operating within real OS environments. This benchmark addresses limitations in existing safety evaluations by …
TOOL · CL_27489 · May 11 · 09:46

LLM agents show promise in multimodal clinical prediction

Researchers have benchmarked Large Language Model (LLM) agents for multimodal clinical prediction tasks, synthesizing data from electronic health records, medical images, and clinical notes. Their study found that singl…
TOOL · CL_27527 · May 11 · 06:36

LLM agents exploit e-commerce markets in new simulation

Researchers have developed TruthMarketTwin, a novel simulation framework designed to study the behavior of large language model (LLM) agents in e-commerce settings. This framework models bilateral trade with asymmetric …
TOOL · CL_27572 · May 11 · 01:49

Nautilus Compass detects LLM agent persona drift without model access

Researchers have developed Nautilus Compass, a novel system designed to detect persona drift in large language model (LLM) agents operating in production environments. This black-box method functions solely at the promp…
TOOL · CL_22542 · May 8 · 04:00

Researchers reveal LoopTrap to exploit LLM agent termination vulnerabilities

Researchers have identified a new vulnerability in LLM agents called Termination Poisoning, where malicious prompts can trick agents into believing tasks are incomplete, leading to infinite loops. They developed ten att…
TOOL · CL_26964 · May 5 · 14:30

ScrapMem framework enables efficient on-device LLM agent memory

Researchers have developed ScrapMem, a novel framework designed to enable long-term personalized memory for LLM agents on resource-constrained edge devices. The system utilizes an "Optical Forgetting" mechanism to progr…
RESEARCH · CL_16489 · May 4 · 03:35

New attack exploits LLM agent relays, bypassing alignment defenses

Researchers have identified a new vulnerability in LLM agent architectures that use Bring-Your-Own-Key (BYOK) systems. These architectures route LLM traffic through third-party relays, creating an integrity gap where a …
RESEARCH · CL_11730 · May 1 · 04:00

LLMs compute Nash equilibrium but suppress it via final-layer overrides

Researchers have investigated why large language models (LLMs) deviate from Nash equilibrium play in strategic interactions. By examining open-source models like Llama-3 and Qwen2.5, they found that while opponent histo…
RESEARCH · CL_02979 · Apr 23 · 06:00

New benchmark reveals enterprise LLM agents leak sensitive data

A new benchmark called CI-Work has been developed to assess the contextual integrity of enterprise LLM agents, focusing on their ability to handle sensitive information. Evaluations of current leading models show signif…

LLM agent prompt optimization breaks prefix cache, increasing costs

New LITMUS benchmark tests LLM agent safety in real OS environments

LLM agents show promise in multimodal clinical prediction

LLM agents exploit e-commerce markets in new simulation

Nautilus Compass detects LLM agent persona drift without model access

Researchers reveal LoopTrap to exploit LLM agent termination vulnerabilities

ScrapMem framework enables efficient on-device LLM agent memory

New attack exploits LLM agent relays, bypassing alignment defenses

LLMs compute Nash equilibrium but suppress it via final-layer overrides

New benchmark reveals enterprise LLM agents leak sensitive data