"interpretable models" Papers
9 papers found
Conference
Causal Concept Graph Models: Beyond Causal Opacity in Deep Learning
Gabriele Dominici, Pietro Barbiero, Mateo Espinosa Zarlenga et al.
ICLR 2025arXiv:2405.16507
21
citations
DeepHalo: A Neural Choice Model with Controllable Context Effects
Shuhan Zhang, Zhi Wang, Rui Gao et al.
NEURIPS 2025oralarXiv:2601.04616
Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference
Jorge García-Carrasco, Alejandro Maté, Juan Trujillo
AAAI 2025paperarXiv:2412.15750
3
citations
Hybrid Latent Reasoning via Reinforcement Learning
Zhenrui Yue, Bowen Jin, Huimin Zeng et al.
NEURIPS 2025arXiv:2505.18454
8
citations
Object-Centric Concept-Bottlenecks
David Steinmann, Wolfgang Stammer, Antonia Wüst et al.
NEURIPS 2025
The Rashomon Set Has It All: Analyzing Trustworthiness of Trees under Multiplicity
Ethan Hsu, Tony Cao, Lesia Semenova et al.
NEURIPS 2025
Language Model Guided Interpretable Video Action Reasoning
Ning Wang, Guangming Zhu, Hongsheng Li et al.
CVPR 2024arXiv:2404.01591
7
citations
PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers
Ananthu Aniraj, Cassio F. Dantas, Dino Ienco et al.
ECCV 2024arXiv:2407.04538
6
citations
Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Zhixuan Chu, Mengxuan Hu, Qing Cui et al.
AAAI 2024paperarXiv:2312.16113
13
citations