ENTITY HateXplain

HateXplain

PulseAugur coverage of HateXplain — every cluster mentioning HateXplain across labs, papers, and developer communities, ranked by signal.

Total · 30d

2

2 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

2

2 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL

TOOL · CL_117726 · Jun 30 · 04:00

New methods probe generative models for bias and improve performance

Researchers have developed new methods, Attribution Graphs (AGs) and Causal Probing, to analyze the internal workings of generative models. These techniques aim to identify and correct issues like spurious correlations,…
TOOL · CL_117577 · Jun 30 · 04:00

Hate speech annotation pipeline flaw silences minority values

A new research paper highlights a critical flaw in how hate speech datasets are annotated, specifically concerning the boundary between offensive and hateful content. The study reveals that annotator disagreement is not…