ENTITY ShieldGemma

ShieldGemma

PulseAugur coverage of ShieldGemma — every cluster mentioning ShieldGemma across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

6 over 90d

Releases · 30d

0 over 90d

Papers · 30d

4 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 6 TOTAL

RESEARCH · CL_109527 · Jun 24 · 13:00

Encoder classifiers offer cost-effective LLM safety evaluation, study finds

A new research paper explores the effectiveness of encoder classifiers, specifically from the ModernBERT family, as a cost-efficient alternative to LLM-based judges for evaluating the safety of large language model outp…
RESEARCH · CL_84362 · Jun 10 · 11:24

New system detects distributional shift in AI safety classifiers

Researchers have developed a new online system designed to monitor distributional shift in deployed AI safety classifiers. This system uses sequential statistics to detect when a classifier's performance degrades due to…
TOOL · CL_79753 · Jun 9 · 04:00

AI safety judges trained with curriculum for improved rubric consistency

Researchers have developed a new training strategy for AI safety judges, aiming to improve their consistency and reliability. The strategy involves using dynamic rubrics generated from prompt-response-label triples to e…
TOOL · CL_38995 · May 19 · 12:01

GLiNER Guard unifies LLM safety and PII detection in single pass

A new system called GLiNER Guard (GLiGuard) has been developed to streamline safety moderation and PII detection for large language models. This unified encoder collapses multiple classifiers and NER models into a singl…
TOOL · CL_30372 · May 13 · 20:41

Fastino Labs open-sources GLiGuard safety model

Fastino Labs has released GLiGuard, an open-source safety moderation model designed to be significantly faster and more efficient than existing solutions. Unlike traditional decoder-only models that generate responses t…
RESEARCH · CL_01364 · Jul 31 · 00:00

Google releases Gemma 2 2B, ShieldGemma, and Gemma Scope

Google has announced updates to its Gemma family of models, including the release of Gemma 2 2B. This new iteration is designed for efficiency and accessibility, aiming to empower developers with powerful yet lightweigh…

Encoder classifiers offer cost-effective LLM safety evaluation, study finds

New system detects distributional shift in AI safety classifiers

AI safety judges trained with curriculum for improved rubric consistency

GLiNER Guard unifies LLM safety and PII detection in single pass

Fastino Labs open-sources GLiGuard safety model

Google releases Gemma 2 2B, ShieldGemma, and Gemma Scope