"model interpretability" Papers
54 papers found • Page 2 of 2
Conference
Position: Stop Making Unscientific AGI Performance Claims
Patrick Altmeyer, Andrew Demetriou, Antony Bartlett et al.
ICML 2024arXiv:2402.03962
9
citations
Provably Better Explanations with Optimized Aggregation of Feature Attributions
Thomas Decker, Ananta Bhattarai, Jindong Gu et al.
ICML 2024arXiv:2406.05090
6
citations
Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer
Junyi Wu, Bin Duan, Weitai Kang et al.
CVPR 2024arXiv:2403.14552
16
citations
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction
Wei Qian, Chenxu Zhao, Yangyi Li et al.
AAAI 2024paperarXiv:2401.01549
10
citations