"quantized training" Papers
2 papers found
Conference
HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs
Saleh Ashkboos, Mahdi Nikdan, Rush Tabesh et al.
NEURIPS 2025arXiv:2501.02625
6
citations
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Haocheng Xi, Yuxiang Chen, Kang Zhao et al.
ICML 2024spotlightarXiv:2403.12422
33
citations