by Shuibai Zhang Papers
2 papers found
Conference
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Thomas Zeng, Shuibai Zhang, Shutong Wu et al.
ICML 2025oralarXiv:2502.06737
20
citations
Supervised Knowledge Makes Large Language Models Better In-context Learners
Linyi Yang, Shuibai Zhang, Zhuohao Yu et al.
ICLR 2024arXiv:2312.15918
26
citations