"sparse mixture-of-experts" Papers
3 papers found
Conference
Tight Clusters Make Specialized Experts
Stefan Nielsen, Rachel Teo, Laziz Abdullaev et al.
ICLR 2025arXiv:2502.15315
6
citations
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park, Hyojun Go, Jin-Young Kim et al.
ECCV 2024arXiv:2403.09176
23
citations
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
Shengzhuang Chen, Jihoon Tack, Yunqiao Yang et al.
ICML 2024arXiv:2403.08477
4
citations