"autoregressive language modeling" Papers
4 papers found
Conference
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
Rosie Zhao, Depen Morwani, David Brandfonbrener et al.
ICLR 2025
37
citations
STAR: Synthesis of Tailored Architectures
Armin Thomas, Rom Parnichkun, Alexander Amini et al.
ICLR 2025arXiv:2411.17800
8
citations
Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension
Fan Yin, Jayanth Srinivasa, Kai-Wei Chang
ICML 2024arXiv:2402.18048
40
citations
LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions
Victor Agostinelli III, Sanghyun Hong, Lizhong Chen
ICML 2024