"interpretable concepts" Papers
4 papers found
Conference
An Analysis of Concept Bottleneck Models: Measuring, Understanding, and Mitigating the Impact of Noisy Annotations
Seonghwan Park, Jueun Mun, Donghyun Oh et al.
NEURIPS 2025arXiv:2505.16705
2
citations
Shortcuts and Identifiability in Concept-based Models from a Neuro-Symbolic Lens
Samuele Bortolotti, Emanuele Marconato, Paolo Morettin et al.
NEURIPS 2025arXiv:2502.11245
9
citations
Sparse autoencoders reveal selective remapping of visual concepts during adaptation
Hyesu Lim, Jinho Choi, Jaegul Choo et al.
ICLR 2025arXiv:2412.05276
31
citations
PRIME: Prioritizing Interpretability in Failure Mode Extraction
Keivan Rezaei, Mehrdad Saberi, Mazda Moayeri et al.
ICLR 2024arXiv:2310.00164
9
citations