"large learning rates" Papers
2 papers found
Conference
A Minimalist Example of Edge-of-Stability and Progressive Sharpening
Liming Liu, Zixuan Zhang, Simon Du et al.
NEURIPS 2025arXiv:2503.02809
1
citations
Quadratic models for understanding catapult dynamics of neural networks
Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.
ICLR 2024arXiv:2205.11787
16
citations