PulseAugur
EN
LIVE 14:00:12
ENTITY AI evaluation

AI evaluation

PulseAugur coverage of AI evaluation — every cluster mentioning AI evaluation across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_58564 ·

    New library and framework enhance AI evaluation with prediction-powered inference

    Researchers have introduced GLIDE, an open-source Python library designed to standardize and improve the evaluation of AI systems, particularly agentic ones. GLIDE unifies various prediction-powered inference (PPI) meth…