Poster "average-reward mdps" Papers
3 papers found
Conference
Offline Actor-Critic for Average Reward MDPs
William Powell, Jeongyeol Kwon, Qiaomin Xie et al.
NEURIPS 2025
73
citations
Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes
Jongmin Lee, Ernest Ryu
ICLR 2025arXiv:2504.09913
5
citations
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach
Swetha Ganesh, Vaneet Aggarwal
NEURIPS 2025arXiv:2505.19986
3
citations