"weight normalization" Papers
3 papers found
Conference
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
Yonggan Fu, Xin Dong, Shizhe Diao et al.
NEURIPS 2025arXiv:2511.18890
2
citations
Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization
Daniel Palenicek, Florian Vogt, Joe Watson et al.
NEURIPS 2025arXiv:2502.07523
9
citations
PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning
Jaejun Lee, Minsung Hwang, Joyce Whang
ICML 2024arXiv:2405.06418
2
citations