PulseAugur
EN
LIVE 10:59:09

EvoDefense uses LLMs to co-evolve defenses against black-box attacks

Researchers have developed EvoDefense, a novel approach to protect large language models (LLMs) from attacks in black-box scenarios. This system uses a guard LLM and an experience memory to continuously refine defense strategies through an iterative attack-defense evolution loop. EvoDefense demonstrates strong generalization capabilities, effectively defending against unseen attacks and various LLM architectures without requiring retraining. AI

IMPACT Enhances LLM security by providing a dynamic defense mechanism against evolving adversarial attacks.

RANK_REASON The cluster contains a research paper detailing a new method for LLM security.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 Nederlands(NL) · Yu Li, Yuenan Hou, Yingmei Wei, Yanming Guo, Chaochao Lu ·

    EvoDefense: Co-Evolving Black-Box Defense with Large Language Models

    arXiv:2605.31140v1 Announce Type: cross Abstract: Large Language Models (LLMs) remain highly vulnerable to diverse attacks, particularly in black-box settings where the internals of target models are inaccessible. Existing black-box defenses typically rely on pre-defined filterin…

  2. arXiv cs.CL TIER_1 Nederlands(NL) · Chaochao Lu ·

    EvoDefense: Co-Evolving Black-Box Defense with Large Language Models

    Large Language Models (LLMs) remain highly vulnerable to diverse attacks, particularly in black-box settings where the internals of target models are inaccessible. Existing black-box defenses typically rely on pre-defined filtering heuristics, which often fail to generalize to un…