"autoregressive language models" Papers

11 papers found

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

Jiacheng Ye, Jiahui Gao, Shansan Gong et al.

ICLR 2025arXiv:2410.14157
84
citations

Correlation Dimension of Autoregressive Large Language Models

Xin Du, Kumiko Tanaka-Ishii

NEURIPS 2025

Generating Physically Stable and Buildable Brick Structures from Text

Ava Pun, Kangle Deng, Ruixuan Liu et al.

ICCV 2025arXiv:2505.05469
8
citations

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

James Michaelov, Roger Levy, Benjamin Bergen

NEURIPS 2025oralarXiv:2510.24963
2
citations

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Xinyu Yang, Yuwei An, Hongyi Liu et al.

NEURIPS 2025spotlightarXiv:2506.09991
21
citations

Repetition Improves Language Model Embeddings

Jacob Springer, Suhas Kotha, Daniel Fried et al.

ICLR 2025arXiv:2402.15449
60
citations

Unifying Autoregressive and Diffusion-Based Sequence Generation

Nima Fathi, Torsten Scholak, Pierre-Andre Noel

COLM 2025paperarXiv:2504.06416
11
citations

Arrows of Time for Large Language Models

Vassilis Papadopoulos, Jérémie Wenger, Clement Hongler

ICML 2024arXiv:2401.17505
14
citations

GiLOT: Interpreting Generative Language Models via Optimal Transport

Xuhong Li, Jiamin Chen, Yekun Chai et al.

ICML 2024

Quantifying and Analyzing Entity-Level Memorization in Large Language Models

Zhenhong Zhou, Jiuyang Xiang, Chaomeng Chen et al.

AAAI 2024paperarXiv:2308.15727
21
citations

Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models

Tanmay Gautam, Youngsuk Park, Hao Zhou et al.

ICML 2024arXiv:2404.08080
39
citations