"autoregressive language models" Papers
11 papers found
Conference
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye, Jiahui Gao, Shansan Gong et al.
ICLR 2025arXiv:2410.14157
84
citations
Correlation Dimension of Autoregressive Large Language Models
Xin Du, Kumiko Tanaka-Ishii
NEURIPS 2025
Generating Physically Stable and Buildable Brick Structures from Text
Ava Pun, Kangle Deng, Ruixuan Liu et al.
ICCV 2025arXiv:2505.05469
8
citations
Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale
James Michaelov, Roger Levy, Benjamin Bergen
NEURIPS 2025oralarXiv:2510.24963
2
citations
Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation
Xinyu Yang, Yuwei An, Hongyi Liu et al.
NEURIPS 2025spotlightarXiv:2506.09991
21
citations
Repetition Improves Language Model Embeddings
Jacob Springer, Suhas Kotha, Daniel Fried et al.
ICLR 2025arXiv:2402.15449
60
citations
Unifying Autoregressive and Diffusion-Based Sequence Generation
Nima Fathi, Torsten Scholak, Pierre-Andre Noel
COLM 2025paperarXiv:2504.06416
11
citations
Arrows of Time for Large Language Models
Vassilis Papadopoulos, Jérémie Wenger, Clement Hongler
ICML 2024arXiv:2401.17505
14
citations
GiLOT: Interpreting Generative Language Models via Optimal Transport
Xuhong Li, Jiamin Chen, Yekun Chai et al.
ICML 2024
Quantifying and Analyzing Entity-Level Memorization in Large Language Models
Zhenhong Zhou, Jiuyang Xiang, Chaomeng Chen et al.
AAAI 2024paperarXiv:2308.15727
21
citations
Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
Tanmay Gautam, Youngsuk Park, Hao Zhou et al.
ICML 2024arXiv:2404.08080
39
citations