"resource-constrained deployment" Papers
8 papers found
Conference
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
Yuxuan Cai, Jiangning Zhang, Haoyang He et al.
ICCV 2025arXiv:2410.16236
27
citations
QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models
Yutong Wang, Haiyu Wang, Sai Qian Zhang
NEURIPS 2025spotlightarXiv:2510.16292
1
citations
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity
Ke Ma, Jiaqi Tang, Bin Guo et al.
CVPR 2025highlightarXiv:2503.20354
4
citations
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Yoojin Jung, Byung Cheol Song
CVPR 2025arXiv:2504.04747
1
citations
Building Variable-Sized Models via Learngene Pool
Boyu Shi, Shiyu Xia, Xu Yang et al.
AAAI 2024paperarXiv:2312.05743
5
citations
Efficient Stitchable Task Adaptation
Haoyu He, Zizheng Pan, Jing Liu et al.
CVPR 2024arXiv:2311.17352
7
citations
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh, Shangqian Gao, Heng Huang
CVPR 2024arXiv:2403.19490
17
citations
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization
Jialong Guo, Xinghao Chen, Yehui Tang et al.
ICML 2024arXiv:2405.11582
34
citations