"generalization analysis" Papers

11 papers found

Impact of Layer Norm on Memorization and Generalization in Transformers

Rishi Singhal, Jung-Eun Kim

NEURIPS 2025arXiv:2511.10566
1
citations

Rethinking Evaluation of Infrared Small Target Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang et al.

NEURIPS 2025arXiv:2509.16888

Stability-based Generalization Analysis of Randomized Coordinate Descent for Pairwise Learning

Liang Wu, Ruixi Hu, Yunwen Lei

AAAI 2025paperarXiv:2503.01530

Towards Macro-AUC Oriented Imbalanced Multi-Label Continual Learning

Yan Zhang, Guoqiang Wu, Bingzheng Wang et al.

AAAI 2025paperarXiv:2412.18231
1
citations

Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling

Xiao Li, Zekai Zhang, Xiang Li et al.

NEURIPS 2025arXiv:2502.05743
6
citations

From Generalization Analysis to Optimization Designs for State Space Models

Fusheng Liu, Qianxiao Li

ICML 2024oralarXiv:2405.02670
11
citations

Generalization Analysis of Stochastic Weight Averaging with General Sampling

Wang Peng, Li Shen, Zerui Tao et al.

ICML 2024

How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?

Hongkang Li, Meng Wang, Songtao Lu et al.

ICML 2024arXiv:2402.15607
34
citations

Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

Feiran Li, Qianqian Xu, Shilong Bao et al.

ICML 2024spotlightarXiv:2405.09782
14
citations

Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to $K$-Level Stochastic Optimizations

Xiaokang Pan, Xingyu Li, Jin Liu et al.

ICML 2024arXiv:2407.05286

Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms

Ming Yang, Xiyuan Wei, Tianbao Yang et al.

ICML 2024arXiv:2307.03357
3
citations