"proxy reward function" Papers
2 papers found
Conference
Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards
Zhaohui JIANG, Xuening Feng, Paul Weng et al.
ICLR 2025arXiv:2410.05782
3
citations
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward
Haoxin Lin, Hongqiu Wu, Jiaji Zhang et al.
AAAI 2024paperarXiv:2312.10642
3
citations