New AE-CoT framework enhances LLM jailbreaks using evolutionary reasoning

By PulseAugur Editorial · [1 sources] · 2026-05-26 04:00

Researchers have developed an adaptive evolutionary framework called AE-CoT to jailbreak large reasoning models (LRMs). This method rewrites harmful goals into mild prompts and decomposes them into reasoning fragments to create jailbreak candidates. The framework then uses evolutionary search with crossover and mutation strategies to expand candidate diversity, and an independent scoring model evaluates harmfulness to enhance destructive generations. Experiments show AE-CoT outperforms existing jailbreak methods across multiple models and datasets. AI

IMPACT This research highlights new vulnerabilities in LLMs, potentially impacting their safe deployment and prompting further research into robust defense mechanisms.

RANK_REASON The cluster contains an academic paper detailing a new method for jailbreaking LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New AE-CoT framework enhances LLM jailbreaks using evolutionary reasoning

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Jianan Li, Simeng Qin, Xiaojun Jia, Lionel Z. Wang, Tianhang Zheng, Xiaoshuang Jia, Yang Liu, Xiaochun Cao · 2026-05-26 04:00

Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs

arXiv:2605.24497v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) have demonstrated remarkable capabilities in reasoning and generation tasks and are increasingly deployed in real-world applications. However, their explicit chain-of-thought (CoT) mechanism introduces …

COVERAGE [1]

Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs

RELATED ENTITIES

RELATED TOPICS