"language model generalization" Papers
4 papers found
Conference
Characterizing the Expressivity of Fixed-Precision Transformer Language Models
Jiaoda Li, Ryan Cotterell
NEURIPS 2025oralarXiv:2505.23623
4
citations
Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data
Xinyi Wang, Antonis Antoniades, Yanai Elazar et al.
ICLR 2025arXiv:2407.14985
80
citations
Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models
Kyle Cox, Jiawei Xu, Yikun Han et al.
AAAI 2025paperarXiv:2510.17028
3
citations
To Code or Not To Code? Exploring Impact of Code in Pre-training
Viraat Aryabumi, Yixuan Su, Raymond Ma et al.
ICLR 2025arXiv:2408.10914
44
citations