Poster "learning dynamics" Papers
7 papers found
Conference
A Solvable Attention for Neural Scaling Laws
Bochen Lyu, Di Wang, Zhanxing Zhu
ICLR 2025
5
citations
Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient
George Wang, Jesse Hoogland, Stan van Wingerden et al.
ICLR 2025arXiv:2410.02984
24
citations
Learning Dynamics of LLM Finetuning
YI REN, Danica Sutherland
ICLR 2025arXiv:2407.10490
67
citations
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
Nabeel Seedat, Nicolas Huynh, Boris van Breugel et al.
ICML 2024arXiv:2312.12112
51
citations
Impact of Decentralized Learning on Player Utilities in Stackelberg Games
Kate Donahue, Nicole Immorlica, Meena Jagadeesan et al.
ICML 2024arXiv:2403.00188
6
citations
Prediction Accuracy of Learning in Games : Follow-the-Regularized-Leader meets Heisenberg
Yi Feng, Georgios Piliouras, Xiao Wang
ICML 2024arXiv:2406.10603
2
citations
Self-attention Networks Localize When QK-eigenspectrum Concentrates
Han Bao, Ryuichiro Hataya, Ryo Karakida
ICML 2024arXiv:2402.02098
11
citations