"reward model evaluation" Papers
4 papers found
Conference
Agent-Oriented Planning in Multi-Agent Systems
Ao LI, Yuexiang Xie, Songze Li et al.
ICLR 2025arXiv:2410.02189
24
citations
How to Evaluate Reward Models for RLHF
Evan Frick, Tianle Li, Connor Chen et al.
ICLR 2025arXiv:2410.14872
58
citations
RMB: Comprehensively benchmarking reward models in LLM alignment
Enyu Zhou, Guodong Zheng, Binghai Wang et al.
ICLR 2025arXiv:2410.09893
47
citations
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Hyungjoo Chae, Seonghwan Kim, Junhee Cho et al.
NEURIPS 2025spotlightarXiv:2505.15277
8
citations