PulseAugur
EN
LIVE 22:00:16
ENTITY textual refusal directions

textual refusal directions

PulseAugur coverage of textual refusal directions — every cluster mentioning textual refusal directions across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_119362 ·

    New MARS method enhances multimodal LLM safety using textual refusal directions

    Researchers have developed a new method called Modality-Agnostic Refusal Steering (MARS) to enhance safety in Multimodal Large Language Models (MLLMs). MARS leverages textual refusal directions, which are typically used…