"layer pruning" Papers
5 papers found
Conference
ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation
Xiaomeng Yang, LEI LU, Qihui Fan et al.
NEURIPS 2025oralarXiv:2505.21817
1
citations
EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models
Yantai Yang, Yuhao Wang, Zichen Wen et al.
NEURIPS 2025oralarXiv:2506.10100
34
citations
Streamlining Redundant Layers to Compress Large Language Models
Xiaodong Chen, Yuxuan Hu, Jing Zhang et al.
ICLR 2025arXiv:2403.19135
19
citations
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov, Kushal Tirumala, Hassan Shapourian et al.
ICLR 2025arXiv:2403.17887
172
citations
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
Jinuk Kim, Marwa El Halabi, Mingi Ji et al.
ICML 2024arXiv:2406.12837
5
citations