This paper introduces Interpretability-Guided Bi-objective Optimization (IGBO), a new framework designed to train models that are both accurate and interpretable. IGBO integrates structured domain knowledge by using a bi-objective formulation and encodes feature importance hierarchies into a Directed Acyclic Graph (DAG). The framework utilizes Temporal Integrated Gradients (TIG) to measure feature importance and proposes a novel Relative Importance Score for quantifying feature attribution over time. AI
影响 Introduces a novel framework for enhancing model interpretability, potentially aiding in the development of more trustworthy AI systems.
排序理由 This is a research paper detailing a new framework for model interpretability. [lever_c_demoted from research: ic=1 ai=1.0]
- Central Limit Theorem
- Directed Acyclic Graph
- Interpretability-Guided Bi-objective Optimization
- Temporal Integrated Gradients
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →