"user preference alignment" Papers
3 papers found
Conference
Causally Motivated Sycophancy Mitigation for Large Language Models
Haoxi Li, Xueyang Tang, Jie ZHANG et al.
ICLR 2025
8
citations
MTRec: Learning to Align with User Preferences via Mental Reward Models
Mengchen Zhao, Yifan Gao, Yaqing Hou et al.
NEURIPS 2025arXiv:2509.22807
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
Connor Dunlop, Matthew Zheng, Kavana Venkatesh et al.
NEURIPS 2025arXiv:2511.05616
1
citations