"offline policy optimization" Papers
4 papers found
Conference
Neural Stochastic Differential Equations for Uncertainty-Aware Offline RL
Cevahir Koprulu, Franck Djeumou, ufuk topcu
ICLR 2025
Offline Actor-Critic for Average Reward MDPs
William Powell, Jeongyeol Kwon, Qiaomin Xie et al.
NEURIPS 2025
73
citations
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chen-Xiao Gao, Chenyang Wu, Mingjun Cao et al.
AAAI 2024paperarXiv:2309.05915
26
citations
Policy-conditioned Environment Models are More Generalizable
Ruifeng Chen, Xiong-Hui Chen, Yihao Sun et al.
ICML 2024