"reinforcement learning with verifiable rewards" Papers
2 papers found
Conference
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue, Zhiqi Chen, Rui Lu et al.
NEURIPS 2025oralarXiv:2504.13837
540
citations
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
Yulei Qin, Gang Li, Zongyi Li et al.
NEURIPS 2025arXiv:2506.01413
5
citations