"activation function analysis" Papers
3 papers found
Conference
Emergence and scaling laws in SGD learning of shallow neural networks
Yunwei Ren, Eshaan Nichani, Denny Wu et al.
NEURIPS 2025arXiv:2504.19983
17
citations
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures
Julian Kranz, Davide Gallon, Steffen Dereich et al.
NEURIPS 2025arXiv:2505.09572
4
citations
How Spurious Features are Memorized: Precise Analysis for Random and NTK Features
Simone Bombari, Marco Mondelli
ICML 2024arXiv:2305.12100
9
citations