"human feedback learning" Papers
2 papers found
Conference
Efficient Preference-Based Reinforcement Learning: Randomized Exploration meets Experimental Design
Andreas Schlaginhaufen, Reda Ouhamma, Maryam Kamgarpour
NEURIPS 2025arXiv:2506.09508
3
citations
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
Heewoong Choi, Sangwon Jung, Hongjoon Ahn et al.
ICML 2024arXiv:2408.04190
11
citations