α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Dirk Groeneveld
Dirk Groeneveld
5
papers
283
total citations
papers (5)
What's In My Big Data?
ICLR 2024
arXiv
126
citations
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
arXiv
111
citations
Establishing Task Scaling Laws via Compute-Efficient Model Ladders
COLM 2025
arXiv
22
citations
DataDecide: How to Predict Best Pretraining Data with Small Experiments
ICML 2025
arXiv
18
citations
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training
NEURIPS 2025
arXiv
6
citations