"pretraining efficiency" Papers
2 papers found
Conference
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
Daouda Sow, Herbert Woisetschläger, Saikiran Bulusu et al.
ICLR 2025arXiv:2502.06733
13
citations
Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models
Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri et al.
ICML 2024arXiv:2311.18237
13
citations