Poster "test-time compute scaling" Papers
4 papers found
Conference
Chain-of-Retrieval Augmented Generation
Liang Wang, Haonan Chen, Nan Yang et al.
NEURIPS 2025arXiv:2501.14342
28
citations
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
Yulei Qin, Gang Li, Zongyi Li et al.
NEURIPS 2025arXiv:2506.01413
5
citations
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
Ziniu Li, Congliang Chen, Tian Xu et al.
ICLR 2025arXiv:2408.16673
37
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Kaiwen Wang, Jin Zhou, Jonathan Chang et al.
NEURIPS 2025arXiv:2505.17373
7
citations