"speculative sampling" Papers
4 papers found
Conference
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test
Yuhui Li, Fangyun Wei, Chao Zhang et al.
NEURIPS 2025arXiv:2503.01840
115
citations
Multi-Draft Speculative Sampling: Canonical Decomposition and Theoretical Limits
Ashish Khisti, MohammadReza Ebrahimi, Hassan Dbouk et al.
ICLR 2025arXiv:2410.18234
4
citations
Accelerated Speculative Sampling Based on Tree Monte Carlo
Zhengmian Hu, Heng Huang
ICML 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li, Fangyun Wei, Chao Zhang et al.
ICML 2024arXiv:2401.15077
338
citations