Poster "activation quantization" Papers
3 papers found
Conference
COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training
Haocheng Xi, Han Cai, Ligeng Zhu et al.
ICLR 2025arXiv:2410.19313
19
citations
GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers
Guang Liang, Xinyao Liu, Jianxin Wu
NEURIPS 2025arXiv:2506.11784
4
citations
Instance-Aware Group Quantization for Vision Transformers
Jaehyeon Moon, Dohyung Kim, Jun Yong Cheon et al.
CVPR 2024arXiv:2404.00928
15
citations