Poster "loss landscape" Papers
3 papers found
Conference
Neural Thermodynamics: Entropic Forces in Deep and Universal Representation Learning
Liu Ziyin, Yizhou Xu, Isaac Chuang
NEURIPS 2025arXiv:2505.12387
5
citations
The AdEMAMix Optimizer: Better, Faster, Older
Matteo Pagliardini, Pierre Ablin, David Grangier
ICLR 2025arXiv:2409.03137
27
citations
On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Yihao Zhang, Hangzhou He, Jingyu Zhu et al.
ICML 2024arXiv:2402.15152
25
citations