"generalization improvement" Papers

12 papers found

Neural Collapse Inspired Knowledge Distillation

Shuoxi Zhang, Zijian Song, Kun He

AAAI 2025paperarXiv:2412.11788
1
citations

Not All Data are Good Labels: On the Self-supervised Labeling for Time Series Forecasting

Yuxuan Yang, Dalin Zhang, Yuxuan Liang et al.

NEURIPS 2025spotlightarXiv:2502.14704
1
citations

PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data

CHANGHEE YANG, Hyeonseop Song, Seokhun Choi et al.

ICCV 2025arXiv:2503.13025
1
citations

Reparameterized LLM Training via Orthogonal Equivalence Transformation

Zeju Qiu, Simon Buchholz, Tim Xiao et al.

NEURIPS 2025arXiv:2506.08001
3
citations

Sharpness-Aware Minimization: General Analysis and Improved Rates

Dimitris Oikonomou, Nicolas Loizou

ICLR 2025arXiv:2503.02225
8
citations

Toward Real-world Text Image Forgery Localization: Structured and Interpretable Data Synthesis

Zeqin Yu, Haotao Xie, Jian Zhang et al.

NEURIPS 2025oralarXiv:2511.12658

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.

ICML 2024arXiv:2306.04815
25
citations

Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN

Minsoo Kang, Minkoo Kang, Suhyun Kim

AAAI 2024paperarXiv:2401.13193
7
citations

DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?

Victor Quetu, Enzo Tartaglione

AAAI 2024paperarXiv:2303.01213
6
citations

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024arXiv:2409.06290
14
citations

Improving Sharpness-Aware Minimization by Lookahead

Runsheng Yu, Youzhi Zhang, James Kwok

ICML 2024

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Damien Teney, Jindong Wang, Ehsan Abbasnejad

ICML 2024arXiv:2305.16817
9
citations