"parallel token prediction" Papers
3 papers found
Conference
DINGO: Constrained Inference for Diffusion LLMs
Tarun Suresh, Debangshu Banerjee, Shubham Ugare et al.
NEURIPS 2025arXiv:2505.23061
4
citations
Neighboring Autoregressive Modeling for Efficient Visual Generation
Yefei He, Yuanyu He, Shaoxuan He et al.
ICCV 2025arXiv:2503.10696
19
citations
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Tianle Cai, Yuhong Li, Zhengyang Geng et al.
ICML 2024arXiv:2401.10774
549
citations