"sparse neural networks" Papers
8 papers found
Conference
Brain network science modelling of sparse neural networks enables Transformers and LLMs to perform as fully connected
Yingtao Zhang, Diego Cerretti, Jialin Zhao et al.
NEURIPS 2025arXiv:2501.19107
3
citations
Global Minimizers of $\ell^p$-Regularized Objectives Yield the Sparsest ReLU Neural Networks
Julia Nakhleh, Robert Nowak
NEURIPS 2025arXiv:2505.21791
More Experts Than Galaxies: Conditionally-Overlapping Experts with Biologically-Inspired Fixed Routing
Sagi Shaier, Francisco Pereira, Katharina Kann et al.
ICLR 2025arXiv:2410.08003
Nonparametric Quantile Regression with ReLU-Activated Recurrent Neural Networks
Hang Yu, Lyumin Wu, Wenxin Zhou et al.
NEURIPS 2025
Sign-In to the Lottery: Reparameterizing Sparse Training
Advait Gadhikar, Tom Jacobs, chao zhou et al.
NEURIPS 2025arXiv:2504.12801
1
citations
Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment
Jun Liu, Zhenglun Kong, Pu Zhao et al.
AAAI 2025paperarXiv:2403.10799
14
citations
No Free Prune: Information-Theoretic Barriers to Pruning at Initialization
Tanishq Kumar, Kevin Luo, Mark Sellke
ICML 2024arXiv:2402.01089
9
citations
Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once
Zhangheng Li, Shiwei Liu, Tianlong Chen et al.
ICML 2024