"reinforcement learning with human feedback" Papers
2 papers found
Conference
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning
Jared Joselowitz, Ritam Majumdar, Arjun Jagota et al.
COLM 2025paperarXiv:2410.12491
4
citations
Preference Optimization on Pareto Sets: On a Theory of Multi-Objective Optimization
Abhishek Roy, Geelon So, Yian Ma
NEURIPS 2025
11
citations