Poster "initialization schemes" Papers
2 papers found
Conference
Fast Training of Sinusoidal Neural Fields via Scaling Initialization
Taesun Yeom, Sangyoon Lee, Jaeho Lee
ICLR 2025arXiv:2410.04779
9
citations
Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models
Akhil Kedia, Mohd Abbas Zaidi, Sushil Khyalia et al.
ICML 2024arXiv:2403.09635
11
citations