A new study analyzing the LAION-5B image dataset has uncovered significant demographic and stereotypical biases. Researchers found that the dataset overrepresents young adults, White individuals, and males, while underrepresenting minority racial groups and older women. Furthermore, the study identified stereotypical associations between emotions and demographics, such as anger being linked to males and happiness to females. These deeply embedded imbalances in LAION-5B, a widely used training dataset, could influence the behavior and outputs of numerous downstream AI systems. AI
IMPACT Reveals systemic biases in a foundational AI training dataset that could propagate harmful stereotypes in downstream AI systems.
RANK_REASON Research paper analyzing biases in a large-scale image dataset. [lever_c_demoted from research: ic=1 ai=1.0]
- DeepFace
- Emo-AffectNet
- FairFace
- Hugging Face
- Iris Dominguez Catena
- LAION-2B-en
- LAION-2B-multi
- LAION-5B
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →