Spotlight "model interpretability" Papers
3 papers found
Conference
Self-Assembling Graph Perceptrons
Jialong Chen, Tong Wang, Bowen Deng et al.
NEURIPS 2025spotlight
The Fragile Truth of Saliency: Improving LLM Input Attribution via Attention Bias Optimization
Yihua Zhang, Changsheng Wang, Yiwei Chen et al.
NEURIPS 2025spotlight
Explaining Probabilistic Models with Distributional Values
Luca Franceschi, Michele Donini, Cedric Archambeau et al.
ICML 2024spotlightarXiv:2402.09947
3
citations