Poster "layer pruning" Papers
3 papers found
Conference
Streamlining Redundant Layers to Compress Large Language Models
Xiaodong Chen, Yuxuan Hu, Jing Zhang et al.
ICLR 2025arXiv:2403.19135
19
citations
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov, Kushal Tirumala, Hassan Shapourian et al.
ICLR 2025arXiv:2403.17887
172
citations
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
Jinuk Kim, Marwa El Halabi, Mingi Ji et al.
ICML 2024arXiv:2406.12837
5
citations