PulseAugur
EN
LIVE 11:31:01
ENTITY AdvBench

AdvBench

PulseAugur coverage of AdvBench — every cluster mentioning AdvBench across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
5
5 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
5
5 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL
  1. TOOL · CL_74402 ·

    Researchers automate security rule generation from attack simulations

    Researchers have developed a method to automatically generate security detection rules from attack simulations. This system deterministically maps findings from Breach-and-Attack-Simulation (BAS) tools to starter Sigma …

  2. RESEARCH · CL_70412 ·

    Hybrid defense framework boosts LLM accuracy and robustness

    Researchers have developed a novel hybrid defense framework to combat both hallucinations and adversarial manipulation in large language models. This approach integrates entropy-based methods for reducing hallucinations…

  3. RESEARCH · CL_62284 ·

    EvoDefense uses LLMs to co-evolve defenses against black-box attacks

    Researchers have developed EvoDefense, a novel approach to protect large language models (LLMs) from attacks in black-box scenarios. This system uses a guard LLM and an experience memory to continuously refine defense s…

  4. TOOL · CL_15984 ·

    New Logit-Gap Steering method efficiently measures AI alignment robustness

    Researchers have developed a new metric called the refusal-affirmation logit gap to quantify the safety margin of aligned language models. This metric, which measures the difference between refusal and affirmation token…

  5. RESEARCH · CL_11458 ·

    New diagnostic tool probes LLM circuits for safety and behavior insights

    A new research paper introduces "Perturbation Probing," a diagnostic method for understanding the internal workings of large language models. This technique uses two forward passes per prompt to identify and analyze "be…