The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity
A new research paper published on arXiv demonstrates that no feature ranking method can be simultaneously faithful, stable, and complete when features are collinear. The study proves this impossibility and quantifies it across various model classes, suggesting that ensemble averaging methods like DASH can resolve this issue. The findings have direct implications for fairness auditing, indicating that SHAP-based proxy discrimination audits are unreliable under collinearity. AI
IMPACT Highlights fundamental limitations in current explainable AI methods, impacting fairness audits and model interpretability.