"transformer architecture analysis" Papers
3 papers found
Conference
Correlation Dimension of Autoregressive Large Language Models
Xin Du, Kumiko Tanaka-Ishii
NEURIPS 2025
Transformer Layers as Painters
Qi Sun, Marc Pickett, Aakash Kumar Nain et al.
AAAI 2025paperarXiv:2407.09298
42
citations
Gradient-based Visual Explanation for Transformer-based CLIP
Chenyang ZHAO, Kun Wang, Xingyu Zeng et al.
ICML 2024