by Zhenqing Ling Papers
2 papers found
Conference
Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data
Zhenqing Ling, Daoyuan Chen, Liuyi Yao et al.
NEURIPS 2025arXiv:2502.04380
8
citations
MindGYM: What Matters in Question Synthesis for Thinking-Centric Fine-Tuning?
Zhe Xu, Daoyuan Chen, Zhenqing Ling et al.
NEURIPS 2025arXiv:2503.09499
5
citations