Poster by XUCHEN Papers
4 papers found
Conference
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods
Dawei Yang, Yuxuan Yue, Xing Hu et al.
ICLR 2025
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Zhixuan Chen, Xing Hu, Dawei Yang et al.
ICML 2025arXiv:2505.03804
12
citations
OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
Xing Hu, Yuan Cheng, Dawei Yang et al.
ICLR 2025arXiv:2501.13987
47
citations
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
XUCHEN, Yuxuan Yue, Zukang Xu et al.
ICML 2025