"inference-time compute" Papers
2 papers found
Conference
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Yinlam Chow, Guy Tennenholtz, Izzeddin Gur et al.
ICLR 2025arXiv:2412.15287
49
citations
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Rui Pan, Yinwei Dai, Zhihao Zhang et al.
NEURIPS 2025arXiv:2504.07891
37
citations