Paper "preference-based learning" Papers
2 papers found
Conference
Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment
Yuang Cai, Yuyu Yuan, Jinsheng Shi et al.
AAAI 2025paperarXiv:2411.09341
4
citations
Rating-Based Reinforcement Learning
Devin White, Mingkang Wu, Ellen Novoseller et al.
AAAI 2024paperarXiv:2307.16348
14
citations