Poster "verifiable rewards" Papers
4 papers found
Conference
Generalizing Verifiable Instruction Following
Valentina Pyatkin, Saumya Malik, Victoria Graf et al.
NEURIPS 2025arXiv:2507.02833
38
citations
Rethinking Verification for LLM Code Generation: From Generation to Testing
Zihan Ma, Taolin Zhang, Maosongcao et al.
NEURIPS 2025arXiv:2507.06920
7
citations
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
Junteng Liu, Yuanxiang Fan, Jiang Zhuo et al.
NEURIPS 2025arXiv:2505.19641
23
citations
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
Xiaoyuan Liu, Tian Liang, Zhiwei He et al.
NEURIPS 2025arXiv:2505.13445
18
citations