"reasoning llms" Papers
3 papers found
Conference
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Pranjal Aggarwal, Sean Welleck
COLM 2025paperarXiv:2503.04697
250
citations
LIFEBENCH: Evaluating Length Instruction Following in Large Language Models
Wei Zhang, Zhenhong Zhou, Kun Wang et al.
NEURIPS 2025arXiv:2505.16234
2
citations
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
Bingchen Zhao, Despoina Magka, Minqi Jiang et al.
NEURIPS 2025arXiv:2506.22419
2
citations