PulseAugur
实时 20:02:25
English(EN) Vision-Language Models Suppress Female Representations Under Ambiguous Input

视觉-语言模型显示出隐藏的性别偏见,压制女性表征

一项新的研究论文揭示,视觉-语言模型(VLMs)即使在被调整以避免人口统计学刻板印象的情况下,也表现出对女性表征的隐藏偏见。当呈现模糊的视觉输入时,这些模型倾向于默认将职业与男性关联,而不管其内部编码如何。该研究引入了一个名为 LALS 的指标来衡量这些内部概念关联,发现女性信号在输出生成之前就被压制了,而男性信号则在整个过程中被放大。 AI

影响 揭示了视觉-语言模型中一种微妙的内部偏见,这可能会影响下游应用,并强调了对更细致的对齐技术的需求。

排序理由 该集群包含一篇详细介绍 AI 模型偏见研究结果的研究论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Arnau Marin-Llobet, Simon Henniger, Mahzarin R. Banaji ·

    Vision-Language Models Suppress Female Representations Under Ambiguous Input

    arXiv:2605.31556v1 Announce Type: cross Abstract: Alignment teaches vision-language models (VLMs) to avoid expressing demographic biases, and when gender is clearly visible they largely succeed. Far less is known about ambiguous inputs (a worker in full gear, a figure seen from b…

  2. arXiv cs.AI TIER_1 English(EN) · Mahzarin R. Banaji ·

    视觉-语言模型在模糊输入下压制女性表征

    Alignment teaches vision-language models (VLMs) to avoid expressing demographic biases, and when gender is clearly visible they largely succeed. Far less is known about ambiguous inputs (a worker in full gear, a figure seen from behind) cases common in practice yet rarely studied…