Researchers have developed H-Sets, a new framework designed to uncover and attribute higher-order feature interactions within image classifiers. This method moves beyond analyzing individual features to understand how groups of features collectively influence a model's output. H-Sets utilizes input Hessians to detect interacting feature pairs and then merges them into coherent sets, employing a set-level extension of Integrated Directional Gradients for attribution. Evaluations on various models and datasets indicate that H-Sets produce more interpretable and faithful saliency maps compared to existing techniques. AI
影响 Enhances interpretability of image classifiers by revealing complex feature interactions, potentially improving model debugging and trust.
排序理由 Academic paper detailing a new method for feature attribution in image classifiers.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →