Poster "optimizer state compression" Papers
3 papers found
Conference
COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training
Haocheng Xi, Han Cai, Ligeng Zhu et al.
ICLR 2025arXiv:2410.19313
19
citations
Irrational Complex Rotations Empower Low-bit Optimizers
Zhen Tian, Xin Zhao, Ji-Rong Wen
NEURIPS 2025arXiv:2501.12896
ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking
Wenshuo Li, Xinghao Chen, Han Shu et al.
ICML 2024arXiv:2406.11257
9
citations