"compute allocation" Papers
2 papers found
Conference
Predictable Scale (Part II) --- Farseer: A Refined Scaling Law in LLMs
Houyi Li, Wenzhen Zheng, Qiufeng Wang et al.
NEURIPS 2025spotlight
Navigating Scaling Laws: Compute Optimality in Adaptive Model Training
Sotiris Anagnostidis, Gregor Bachmann, Imanol Schlag et al.
ICML 2024spotlightarXiv:2311.03233
2
citations