Spotlight "distributed optimization" Papers
3 papers found
Conference
Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Zachary Charles, Gabriel Teston, Lucio Dery et al.
NEURIPS 2025spotlightarXiv:2503.09799
14
citations
Faster Adaptive Decentralized Learning Algorithms
Feihu Huang, jianyu zhao
ICML 2024spotlightarXiv:2408.09775
3
citations
On the Complexity of Finite-Sum Smooth Optimization under the Polyak–Łojasiewicz Condition
Yunyan Bai, Yuxing Liu, Luo Luo
ICML 2024spotlightarXiv:2402.02569
2
citations