by Shuqing Luo Papers
2 papers found
Conference
Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures
Shuqing Luo, Ye Han, Pingzhi Li et al.
NEURIPS 2025spotlight
Occult: Optimizing Collaborative Communications across Experts for Accelerated Parallel MoE Training and Inference
Shuqing Luo, Pingzhi Li, Jie Peng et al.
ICML 2025arXiv:2505.13345
2
citations