Poster "generalization analysis" Papers
7 papers found
Conference
Impact of Layer Norm on Memorization and Generalization in Transformers
Rishi Singhal, Jung-Eun Kim
NEURIPS 2025arXiv:2511.10566
1
citations
Rethinking Evaluation of Infrared Small Target Detection
Youwei Pang, Xiaoqi Zhao, Lihe Zhang et al.
NEURIPS 2025arXiv:2509.16888
Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Xiao Li, Zekai Zhang, Xiang Li et al.
NEURIPS 2025arXiv:2502.05743
6
citations
Generalization Analysis of Stochastic Weight Averaging with General Sampling
Wang Peng, Li Shen, Zerui Tao et al.
ICML 2024
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
Hongkang Li, Meng Wang, Songtao Lu et al.
ICML 2024arXiv:2402.15607
34
citations
Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to $K$-Level Stochastic Optimizations
Xiaokang Pan, Xingyu Li, Jin Liu et al.
ICML 2024arXiv:2407.05286
Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms
Ming Yang, Xiyuan Wei, Tianbao Yang et al.
ICML 2024arXiv:2307.03357
3
citations