Poster "weight quantization" Papers
7 papers found
Conference
Cauchy-Schwarz Regularizers
Sueda Taner, Ziyi Wang, Christoph Studer
ICLR 2025arXiv:2503.01639
Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression
Xi Zhang, Xiaolin Wu, Jiamang Wang et al.
NEURIPS 2025arXiv:2510.20984
S$^2$NN: Sub-bit Spiking Neural Networks
Wenjie Wei, Malu Zhang, Jieyuan (Eric) Zhang et al.
NEURIPS 2025arXiv:2509.24266
Training-Free Activation Sparsity in Large Language Models
James Liu, Pragaash Ponnusamy, Tianle Cai et al.
ICLR 2025arXiv:2408.14690
39
citations
A2Q+: Improving Accumulator-Aware Weight Quantization
Ian Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig et al.
ICML 2024arXiv:2401.10432
10
citations
ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking
Wenshuo Li, Xinghao Chen, Han Shu et al.
ICML 2024arXiv:2406.11257
9
citations
Extreme Compression of Large Language Models via Additive Quantization
Vage Egiazarian, Andrei Panferov, Denis Kuznedelev et al.
ICML 2024arXiv:2401.06118
160
citations