"network compression" Papers
5 papers found
Conference
How Low Can You Go? Searching for the Intrinsic Dimensionality of Complex Networks using Metric Node Embeddings
Nikolaos Nakis, Niels Raunkjær Holm, Andreas Lyhne Fiehn et al.
ICLR 2025arXiv:2503.01723
2
citations
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
Yiming Wu, Huan Wang, Zhenghao Chen et al.
ICCV 2025arXiv:2508.00697
2
citations
Till the Layers Collapse: Compressing a Deep Neural Network Through the Lenses of Batch Normalization Layers.
Zhu Liao, Nour Hezbri, Victor Quétu et al.
AAAI 2025paperarXiv:2412.15077
Outlier-aware Slicing for Post-Training Quantization in Vision Transformer
Yuexiao Ma, Huixia Li, Xiawu Zheng et al.
ICML 2024
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
Lu Yin, You Wu, Zhenyu Zhang et al.
ICML 2024arXiv:2310.05175
152
citations