"explanation faithfulness" Papers
6 papers found
Conference
FACE: Faithful Automatic Concept Extraction
Dipkamal Bhusal, Michael Clifford, Sara Rampazzi et al.
NEURIPS 2025arXiv:2510.11675
3
citations
Intrinsic User-Centric Interpretability through Global Mixture of Experts
Vinitra Swamy, Syrielle Montariol, Julian Blackwell et al.
ICLR 2025arXiv:2402.02933
10
citations
MIX: A Multi-view Time-Frequency Interactive Explanation Framework for Time Series Classification
Viet-Hung Tran, Ngoc Phu Doan, Zichi Zhang et al.
NEURIPS 2025
Towards Human-Understandable Multi-Dimensional Concept Discovery
Arne Grobrügge, Niklas Kühl, Gerhard Satzger et al.
CVPR 2025arXiv:2503.18629
2
citations
Improving Interpretation Faithfulness for Vision Transformers
Lijie Hu, Yixin Liu, Ninghao Liu et al.
ICML 2024spotlightarXiv:2311.17983
12
citations
Provably Better Explanations with Optimized Aggregation of Feature Attributions
Thomas Decker, Ananta Bhattarai, Jindong Gu et al.
ICML 2024arXiv:2406.05090
6
citations