ENTITY
HateXplain
HateXplain
PulseAugur coverage of HateXplain — every cluster mentioning HateXplain across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New methods probe generative models for bias and improve performance
Researchers have developed new methods, Attribution Graphs (AGs) and Causal Probing, to analyze the internal workings of generative models. These techniques aim to identify and correct issues like spurious correlations,…
-
Hate speech annotation pipeline flaw silences minority values
A new research paper highlights a critical flaw in how hate speech datasets are annotated, specifically concerning the boundary between offensive and hateful content. The study reveals that annotator disagreement is not…