PulseAugur / Brief
EN
LIVE 12:25:14

Brief

last 24h
[1/1] 223 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Epistemic Injustice in Language Models: An Audit of Pretraining Filters and Guardrails

    A new research paper published on arXiv details how pretraining filters and guardrails in language models can lead to epistemic injustice. The audit found that these systems disproportionately flag content related to marginalized groups, such as transgender people, women, and Central Americans, while often failing to detect explicit hate speech or private information. Human annotators would have retained a significant majority of the content flagged by these automated systems, highlighting a gap in their ability to capture nuanced representational harms. AI

    IMPACT Reveals how current content moderation systems in LLMs can inadvertently silence marginalized voices, necessitating more nuanced approaches to AI safety.