"decision boundaries" Papers
3 papers found
Conference
Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness
Qi Zhang, Yifei Wang, Jingyi Cui et al.
ICLR 2025arXiv:2410.21331
4
citations
OOD Detection with Relative Angles
Berker Demirel, Marco Fumero, Francesco Locatello
NEURIPS 2025
Unlearning-based Neural Interpretations
Ching Lam Choi, Alexandre Duplessis, Serge Belongie
ICLR 2025arXiv:2410.08069
1
citations