ENTITY
LlamaGuard
LlamaGuard
PulseAugur coverage of LlamaGuard — every cluster mentioning LlamaGuard across labs, papers, and developer communities, ranked by signal.
Total · 30d
4
4 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
AI safety models vulnerable to fine-tuning and embedding bypass attacks
Two new research papers explore vulnerabilities in AI safety mechanisms. The first paper, "When Safety Geometry Collapses," demonstrates how fine-tuning even benign guard models can inadvertently destroy their safety al…
-
New proxy tool blocks prompt injection attacks on AI models
A new tool called Arc Gate has been developed to act as a proxy, sitting in front of any OpenAI-compatible endpoint. This proxy is designed to effectively block prompt injection attacks before they can reach the underly…