"weight decay regularization" Papers
2 papers found
Conference
AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs
Di He, Songjun Tu, Ajay Jaiswal et al.
NEURIPS 2025arXiv:2506.14562
1
citations
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
Sungyoon Kim, Mert Pilanci
ICML 2024spotlightarXiv:2402.03625
7
citations