Paper "quantization" Papers
2 papers found
Conference
KVSink: Understanding and Enhancing the Preservation of Attention Sinks in KV Cache Quantization for LLMs
Zunhai Su, Kehong Yuan
COLM 2025paperarXiv:2508.04257
8
citations
Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity
Yiyue Chen, Haris Vikalo, Chianing Wang
AAAI 2024paperarXiv:2312.13380
13
citations