"reward redistribution" Papers
2 papers found
Conference
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu, Yuhang Jiang, Boyuan Wang et al.
AAAI 2025paperarXiv:2412.11120
24
citations
Dense Reward for Free in Reinforcement Learning from Human Feedback
Alexander Chan, Hao Sun, Samuel Holt et al.
ICML 2024arXiv:2402.00782
65
citations