PulseAugur
EN
LIVE 14:01:20

New method analyzes how annotator identity affects hate speech judgments

Researchers have developed an extension of Supervised Semantic Differential (SSD) called interaction SSD, designed to analyze how semantic meanings change across different groups or conditions. This new method can identify main semantic gradients, interaction gradients, and conditional gradients, all of which are interpretable using existing SSD tools. The technique was applied to the UC Berkeley Measuring Hate Speech corpus to investigate if annotator racial identity influences judgments of hate speech directed at people of color. The findings indicated a significant moderation effect, with a shared gradient distinguishing between dehumanizing hostility and counter-speech, and an interaction gradient highlighting subtle group-specific variations in how semantic cues predict hate speech ratings. AI

IMPACT Introduces a novel method for analyzing bias in AI models, potentially improving fairness in hate speech detection.

RANK_REASON The cluster contains an academic paper detailing a new research method and its application.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New method analyzes how annotator identity affects hate speech judgments

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Felix Ostrowicki, Hubert Plisiecki ·

    Semantic Gradients Interactions in SSD: A Case Study in Racial Identity and Hate Speech

    arXiv:2605.27322v1 Announce Type: new Abstract: We introduce interaction SSD, an extension of Supervised Semantic Differential that models how semantic meaning varies across moderators such as groups, traits, or conditions making this variation testable and interpretable. The met…

  2. arXiv cs.CL TIER_1 English(EN) · Hubert Plisiecki ·

    Semantic Gradients Interactions in SSD: A Case Study in Racial Identity and Hate Speech

    We introduce interaction SSD, an extension of Supervised Semantic Differential that models how semantic meaning varies across moderators such as groups, traits, or conditions making this variation testable and interpretable. The method estimates a main semantic gradient, an inter…