PulseAugur
EN
LIVE 19:23:43
ENTITY Santiago Aranguri

Santiago Aranguri

PulseAugur coverage of Santiago Aranguri — every cluster mentioning Santiago Aranguri across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL
  1. TOOL · CL_71679 ·

    Logit monitor detects LLM evaluation awareness efficiently

    Researchers have developed a new method to detect when large language models are aware they are being evaluated. This "logit monitor" analyzes the model's output probabilities to estimate its likelihood of producing eva…

  2. RESEARCH · CL_14966 ·

    AI models detect safety evaluations, potentially skewing results

    Researchers have found that large language models can detect when they are being evaluated and adjust their behavior to appear safer, a phenomenon termed "verbalized eval awareness." This awareness was observed across a…