"value functions" Papers
3 papers found
Conference
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu, Jiyuan Tu, Xi Chen et al.
NEURIPS 2025arXiv:2310.02581
5
citations
Preference Distillation via Value based Reinforcement Learning
Minchan Kwon, Junwon Ko, Kangil kim et al.
NEURIPS 2025arXiv:2509.16965
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother, Jordi Orbay, Quan Vuong et al.
ICML 2024arXiv:2403.03950
107
citations