Poster "expert load balancing" Papers
2 papers found
Conference
CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing
Yifan Zhou, Tianshi Xu, Jue Hong et al.
NEURIPS 2025arXiv:2511.01197
1
citations
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
Ziteng Wang, Jun Zhu, Jianfei Chen
ICLR 2025arXiv:2412.14711
31
citations