Poster "adam optimizer" Papers
3 papers found
Conference
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
Rosie Zhao, Depen Morwani, David Brandfonbrener et al.
ICLR 2025
37
citations
Studying K-FAC Heuristics by Viewing Adam through a Second-Order Lens
Ross Clarke, Jose Miguel Hernandez-Lobato
ICML 2024arXiv:2310.14963
2
citations
Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook et al.
ICML 2024arXiv:2402.01567
22
citations