Poster "gradient descent training" Papers
3 papers found
Conference
Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks
Binghui Li, Zhixuan Pan, Kaifeng Lyu et al.
ICLR 2025arXiv:2410.10322
Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse
Arthur Jacot, Peter Súkeník, Zihan Wang et al.
ICLR 2025arXiv:2410.04887
10
citations
Asymptotics of Learning with Deep Structured (Random) Features
Dominik Schröder, Daniil Dmitriev, Hugo Cui et al.
ICML 2024arXiv:2402.13999
11
citations