"preference-based optimization" Papers
2 papers found
Conference
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
Sheryl Hsu, Omar Khattab, Chelsea Finn et al.
ICLR 2025arXiv:2410.23214
16
citations
Pareto Set Learning for Multi-Objective Reinforcement Learning
Erlong Liu, Yu-Chang Wu, Xiaobin Huang et al.
AAAI 2025paperarXiv:2501.06773
14
citations