by Jiaze Chen Papers
2 papers found
Conference
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Qiying Yu, Zheng Zhang, Ruofei Zhu et al.
NEURIPS 2025arXiv:2503.14476
1213
citations
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
Jiangjie Chen, Qianyu He, Siyu Yuan et al.
NEURIPS 2025spotlightarXiv:2505.19914
29
citations