by Bingni Zhang Papers
3 papers found
Conference
Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining
Ping Guo, Yubing Ren, BINBINLIU et al.
NEURIPS 2025arXiv:2509.15556
1
citations
Frame-Voyager: Learning to Query Frames for Video Large Language Models
Sicheng Yu, CHENGKAI JIN, Huanyu Wang et al.
ICLR 2025arXiv:2410.03226
42
citations
MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Zhixun Chen, Ping Guo, Wenhan Han et al.
NEURIPS 2025arXiv:2507.01785