Paper "reward model training" Papers
3 papers found
Conference
An Evaluation Framework for Product Images Background Inpainting Based on Human Feedback and Product Consistency
Yuqi Liang, Jun Luo, Xiaoxi Guo et al.
AAAI 2025paperarXiv:2412.17504
1
citations
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
Zilin Wang, Haolin Zhuang, Lu Li et al.
AAAI 2024paperarXiv:2312.11442
5
citations
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
Lei Shu, Liangchen Luo, Jayakumar Hoskere et al.
AAAI 2024paperarXiv:2305.15685
78
citations