by Washim Mondal Papers
3 papers found
Conference
A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach
Swetha Ganesh, Washim Mondal, Vaneet Aggarwal
ICML 2025arXiv:2407.18878
9
citations
Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning
Yang Xu, Washim Mondal, Vaneet Aggarwal
NEURIPS 2025arXiv:2502.16816
8
citations
Global Convergence for Average Reward Constrained MDPs with Primal-Dual Actor Critic Algorithm
Yang Xu, Swetha Ganesh, Washim Mondal et al.
NEURIPS 2025arXiv:2505.15138
3
citations