Poster "next-token prediction" Papers
16 papers found
Conference
Context-Aware Regularization with Markovian Integration for Attention-Based Nucleotide Analysis
Mohammad Saleh Refahi, Mahdi Abavisani, Bahrad Sokhansanj et al.
NEURIPS 2025arXiv:2507.09378
Correlation and Navigation in the Vocabulary Key Representation Space of Language Models
Letian Peng, Chenyang An, Jingbo Shang
ICLR 2025arXiv:2410.02284
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Chenxin Tao, Shiqian Su, Xizhou Zhu et al.
CVPR 2025arXiv:2412.16158
5
citations
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye, Zhenyu Wu, Jiahui Gao et al.
ICLR 2025arXiv:2502.19805
14
citations
Lines of Thought in Large Language Models
Raphaël Sarfati, Toni Liu, Nicolas Boulle et al.
ICLR 2025arXiv:2410.01545
3
citations
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation
Xiuyu Yang, Shuhan Tan, Philipp Kraehenbuehl
ICCV 2025arXiv:2506.17213
3
citations
OmniGen-AR: AutoRegressive Any-to-Image Generation
Junke Wang, Xun Wang, Qiushan Guo et al.
NEURIPS 2025
On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study
Riccardo Alberghi, Elizaveta Demyanenko, Luca Biggio et al.
NEURIPS 2025arXiv:2507.05362
1
citations
Re-Thinking Inverse Graphics With Large Language Models
Haiwen Feng, Michael J Black, Weiyang Liu et al.
ICLR 2025arXiv:2404.15228
16
citations
Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipf’s Law
Frederik Kunstner, Francis Bach
NEURIPS 2025arXiv:2505.19227
12
citations
VladVA: Discriminative Fine-tuning of LVLMs
Yassine Ouali, Adrian Bulat, ALEXANDROS XENOS et al.
CVPR 2025arXiv:2412.04378
11
citations
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
ICML 2024arXiv:2309.06979
55
citations
PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek et al.
CVPR 2024arXiv:2402.08657
15
citations
Tandem Transformers for Inference Efficient LLMs
Aishwarya P S, Pranav Nair, Yashas Samaga et al.
ICML 2024arXiv:2402.08644
10
citations
The Pitfalls of Next-Token Prediction
Gregor Bachmann, Vaishnavh Nagarajan
ICML 2024arXiv:2403.06963
139
citations
Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
Xinyi Wang, Alfonso Amayuelas, Kexun Zhang et al.
ICML 2024arXiv:2402.03268
25
citations