English(EN) Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept Decompositions

新方法在不重新训练的情况下识别和缓解视觉模型中的偏见

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-27 17:39

研究人员开发了一种新颖的后验方法，可以在不要求额外标签或重新训练的情况下，识别和缓解已冻结的视觉模型中的偏见。该技术利用概念分解上的梯度探针，根据虚假概念与错误分类示例的交互来对其进行排名。这种方法成功地识别了 Colored MNIST 和 Waterbirds 等数据集中的已知虚假线索，并在 CelebA 中发现了与决策相关的方向，从而显著提高了最差群体准确性。 AI

影响提供了一种新的、无标签的方法来审计和消除已部署的视觉模型中的偏见，在不进行昂贵重新训练的情况下提高公平性。

排序理由该集群包含一篇学术论文，详细介绍了用于人工智能安全的新研究方法。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Thomas Vitry, Kieran Edgeworth, Stefan Wermter, Jae Hee Lee · 2026-05-28 04:00

偏差留下梯度痕迹：通过概念分解上的梯度探测进行无标签偏差识别

arXiv:2605.28780v1 Announce Type: cross Abstract: Vision classifiers can exploit spurious correlations, achieving high in-distribution accuracy yet failing under distribution shift. Existing approaches to bias mitigation and analysis often depend on curated datasets, spurious-att…
arXiv cs.CV TIER_1 English(EN) · Jae Hee Lee · 2026-05-27 17:39

偏差留下了梯度痕迹：通过概念分解上的梯度探测进行无标签偏差识别

Vision classifiers can exploit spurious correlations, achieving high in-distribution accuracy yet failing under distribution shift. Existing approaches to bias mitigation and analysis often depend on curated datasets, spurious-attribute or group labels, or retraining, which may b…

报道来源 [2]

偏差留下梯度痕迹：通过概念分解上的梯度探测进行无标签偏差识别

偏差留下了梯度痕迹：通过概念分解上的梯度探测进行无标签偏差识别

相关实体

相关话题