"language model pretraining" Papers
4 papers found
Conference
Analyzing Similarity Metrics for Data Selection for Language Model Pretraining
Dylan Sam, Ayan Chakrabarti, Afshin Rostamizadeh et al.
NEURIPS 2025arXiv:2502.02494
1
citations
Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining
Ping Guo, Yubing Ren, BINBINLIU et al.
NEURIPS 2025arXiv:2509.15556
1
citations
Group-Level Data Selection for Efficient Pretraining
Zichun Yu, Fei Peng, Jie Lei et al.
NEURIPS 2025arXiv:2502.14709
2
citations
Multi-Token Prediction Needs Registers
Anastasios Gerontopoulos, Spyridon Gidaris, Nikos Komodakis
NEURIPS 2025arXiv:2505.10518
4
citations