Spotlight "preference modeling" Papers
3 papers found
Conference
Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs
Mehran Shakerinava, Siamak Ravanbakhsh, Adam Oberman
NEURIPS 2025spotlightarXiv:2505.12049
Generalized Top-k Mallows Model for Ranked Choices
Shahrzad Haddadan, Sara Ahmadian
NEURIPS 2025spotlightarXiv:2510.22040
Nash Learning from Human Feedback
REMI MUNOS, Michal Valko, Daniele Calandriello et al.
ICML 2024spotlightarXiv:2312.00886
195
citations