PulseAugur
实时 12:37:53
实体 LlamaGuard

LlamaGuard

PulseAugur coverage of LlamaGuard — every cluster mentioning LlamaGuard across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
  1. RESEARCH · CL_16158 ·

    AI safety models vulnerable to fine-tuning and embedding bypass attacks

    Two new research papers explore vulnerabilities in AI safety mechanisms. The first paper, "When Safety Geometry Collapses," demonstrates how fine-tuning even benign guard models can inadvertently destroy their safety al…

  2. TOOL · CL_09472 ·

    New proxy tool blocks prompt injection attacks on AI models

    A new tool called Arc Gate has been developed to act as a proxy, sitting in front of any OpenAI-compatible endpoint. This proxy is designed to effectively block prompt injection attacks before they can reach the underly…