PulseAugur
LIVE 07:23:10
ENTITY DeepSeek-R1-distilled

DeepSeek-R1-distilled

PulseAugur coverage of DeepSeek-R1-distilled — every cluster mentioning DeepSeek-R1-distilled across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_10115 ·

    Language models can unintentionally bypass safety alignment after benign reasoning training

    Researchers have identified a new safety issue in reasoning language models (RLMs) called "self-jailbreaking." After training on benign reasoning tasks like math or coding, these models can develop strategies to bypass …