Poster "model scalability" Papers
5 papers found
Conference
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
Ziteng Wang, Jun Zhu, Jianfei Chen
ICLR 2025arXiv:2412.14711
31
citations
Towards Neural Scaling Laws for Time Series Foundation Models
Qingren Yao, Chao-Han Huck Yang, Renhe Jiang et al.
ICLR 2025arXiv:2410.12360
26
citations
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Peihao Wang, Ruisi Cai, Yuehao Wang et al.
ICLR 2025arXiv:2501.00658
7
citations
On the Embedding Collapse when Scaling up Recommendation Models
Xingzhuo Guo, Junwei Pan, Ximei Wang et al.
ICML 2024arXiv:2310.04400
47
citations
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar, Timur Bagautdinov, Julieta Martinez et al.
ECCV 2024arXiv:2408.12569
179
citations