Poster "mixed-precision quantization" Papers
4 papers found
Conference
Mixture Compressor for Mixture-of-Experts LLMs Gains More
Wei Huang, Yue Liao, Jianhui Liu et al.
ICLR 2025arXiv:2410.06270
24
citations
OuroMamba: A Data-Free Quantization Framework for Vision Mamba
Akshat Ramachandran, Mingyu Lee, Huan Xu et al.
ICCV 2025arXiv:2503.10959
4
citations
Progressive Mixed-Precision Decoding for Efficient LLM Inference
Hao (Mark) Chen, Fuwen Tan, Alexandros Kouris et al.
ICLR 2025arXiv:2410.13461
8
citations
AMPA: Adaptive Mixed Precision Allocation for Low-Bit Integer Training
Li Ding, Wen Fei, Yuyang Huang et al.
ICML 2024