Poster "gradient descent optimization" Papers
8 papers found
Conference
Gradient descent with generalized Newton’s method
Zhiqi Bu, Shiyun Xu
ICLR 2025arXiv:2407.02772
8
citations
Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipf’s Law
Frederik Kunstner, Francis Bach
NEURIPS 2025arXiv:2505.19227
12
citations
Variational Inference with Mixtures of Isotropic Gaussians
Marguerite Petit-Talamon, Marc Lambert, Anna Korba
NEURIPS 2025arXiv:2506.13613
Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data
Xuran Meng, Difan Zou, Yuan Cao
ICML 2024arXiv:2310.01975
10
citations
How Graph Neural Networks Learn: Lessons from Training Dynamics
Chenxiao Yang, Qitian Wu, David Wipf et al.
ICML 2024arXiv:2310.05105
2
citations
Improving Sharpness-Aware Minimization by Lookahead
Runsheng Yu, Youzhi Zhang, James Kwok
ICML 2024
LoRA Training in the NTK Regime has No Spurious Local Minima
Uijeong Jang, Jason Lee, Ernest Ryu
ICML 2024arXiv:2402.11867
35
citations
Theoretical Guarantees for Variational Inference with Fixed-Variance Mixture of Gaussians
Tom Huix, Anna Korba, Alain Oliviero Durmus et al.
ICML 2024arXiv:2406.04012
10
citations