Paper "low-bit quantization" Papers
2 papers found
Conference
ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization
Weibo Zhao, Yubin Shi, Xinyu Lyu et al.
AAAI 2025paperarXiv:2411.07762
3
citations
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.
AAAI 2025paperarXiv:2407.20021
7
citations