"learning rate scaling" Papers
2 papers found
Conference
The Optimization Landscape of SGD Across the Feature Learning Strength
Alexander Atanasov, Alexandru Meterez, James Simon et al.
ICLR 2025arXiv:2410.04642
12
citations
Scaling Exponents Across Parameterizations and Optimizers
Katie Everett, Lechao Xiao, Mitchell Wortsman et al.
ICML 2024arXiv:2407.05872
51
citations