"large-scale pretraining" Papers
2 papers found
Conference
HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts
Neil He, Rishabh Anand, Hiren Madhu et al.
NEURIPS 2025arXiv:2505.24722
9
citations
Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Florian Tramer, Gautam Kamath, Nicholas Carlini
ICML 2024