Researchers unveil PermaFrost-Attack for latent LLM poisoning during pretraining

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-23 23:32

Researchers have introduced PermaFrost-Attack, a novel method for embedding hidden vulnerabilities, termed 'logic landmines,' into large language models during their pretraining phase. This attack, known as Stealth Pretraining Seeding (SPS), involves distributing small, seemingly innocuous poisoned data across the web, which can then be absorbed into future training datasets like Common Crawl. These dormant landmines remain undetected by standard evaluations but can be activated by specific triggers to bypass safety mechanisms and induce unsafe behavior. AI

影响 Introduces a new class of latent vulnerabilities in LLMs, potentially impacting future model safety and trustworthiness.

排序理由 Academic paper detailing a novel attack vector on LLM pretraining.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Harsh Kumar, Rahul Maity, Tanmay Joshi, Aman Chadha, Vinija Jain, Suranjana Trivedy, Amitava Das · 2026-04-27 04:00

PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training

arXiv:2604.22117v1 Announce Type: cross Abstract: Aligned large language models(LLMs) remain vulnerable to adversarial manipulation, and their dependence on web-scale pretraining creates a subtle but serious attack surface. We study Stealth Pretraining Seeding (SPS), a new attack…
arXiv cs.CL TIER_1 English(EN) · Amitava Das · 2026-04-23 23:32

PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training

Aligned large language models(LLMs) remain vulnerable to adversarial manipulation, and their dependence on web-scale pretraining creates a subtle but serious attack surface. We study Stealth Pretraining Seeding (SPS), a new attack family in which adversaries distribute small amou…

报道来源 [2]

PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training

PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training

相关实体

相关话题