PulseAugur
LIVE 00:53:53
ENTITY LLM agents

LLM agents

PulseAugur coverage of LLM agents — every cluster mentioning LLM agents across labs, papers, and developer communities, ranked by signal.

Total · 30d
10
10 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
9
9 over 90d
TIER MIX · 90D
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 10 TOTAL
  1. RESEARCH · CL_28076 ·

    LLM agent prompt optimization breaks prefix cache, increasing costs

    A technical article explores how optimizing prompts for LLM agents can inadvertently break the prefix cache, leading to higher costs than expected. The author explains that while fewer tokens in a prompt might seem chea…

  2. TOOL · CL_28316 ·

    New LITMUS benchmark tests LLM agent safety in real OS environments

    Researchers have introduced LITMUS, a new benchmark designed to evaluate the behavioral safety of LLM agents operating within real OS environments. This benchmark addresses limitations in existing safety evaluations by …

  3. TOOL · CL_27489 ·

    LLM agents show promise in multimodal clinical prediction

    Researchers have benchmarked Large Language Model (LLM) agents for multimodal clinical prediction tasks, synthesizing data from electronic health records, medical images, and clinical notes. Their study found that singl…

  4. TOOL · CL_27527 ·

    LLM agents exploit e-commerce markets in new simulation

    Researchers have developed TruthMarketTwin, a novel simulation framework designed to study the behavior of large language model (LLM) agents in e-commerce settings. This framework models bilateral trade with asymmetric …

  5. TOOL · CL_27572 ·

    Nautilus Compass detects LLM agent persona drift without model access

    Researchers have developed Nautilus Compass, a novel system designed to detect persona drift in large language model (LLM) agents operating in production environments. This black-box method functions solely at the promp…

  6. TOOL · CL_22542 ·

    Researchers reveal LoopTrap to exploit LLM agent termination vulnerabilities

    Researchers have identified a new vulnerability in LLM agents called Termination Poisoning, where malicious prompts can trick agents into believing tasks are incomplete, leading to infinite loops. They developed ten att…

  7. TOOL · CL_26964 ·

    ScrapMem framework enables efficient on-device LLM agent memory

    Researchers have developed ScrapMem, a novel framework designed to enable long-term personalized memory for LLM agents on resource-constrained edge devices. The system utilizes an "Optical Forgetting" mechanism to progr…

  8. RESEARCH · CL_16489 ·

    New attack exploits LLM agent relays, bypassing alignment defenses

    Researchers have identified a new vulnerability in LLM agent architectures that use Bring-Your-Own-Key (BYOK) systems. These architectures route LLM traffic through third-party relays, creating an integrity gap where a …

  9. RESEARCH · CL_11730 ·

    LLMs compute Nash equilibrium but suppress it via final-layer overrides

    Researchers have investigated why large language models (LLMs) deviate from Nash equilibrium play in strategic interactions. By examining open-source models like Llama-3 and Qwen2.5, they found that while opponent histo…

  10. RESEARCH · CL_02979 ·

    New benchmark reveals enterprise LLM agents leak sensitive data

    A new benchmark called CI-Work has been developed to assess the contextual integrity of enterprise LLM agents, focusing on their ability to handle sensitive information. Evaluations of current leading models show signif…