Poster "reward model evaluation" Papers
3 papers found
Conference
Agent-Oriented Planning in Multi-Agent Systems
Ao LI, Yuexiang Xie, Songze Li et al.
ICLR 2025arXiv:2410.02189
24
citations
How to Evaluate Reward Models for RLHF
Evan Frick, Tianle Li, Connor Chen et al.
ICLR 2025arXiv:2410.14872
58
citations
RMB: Comprehensively benchmarking reward models in LLM alignment
Enyu Zhou, Guodong Zheng, Binghai Wang et al.
ICLR 2025arXiv:2410.09893
47
citations