"constrained markov decision processes" Papers
5 papers found
Conference
Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation
Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno et al.
NEURIPS 2025spotlightarXiv:2502.10138
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves
Martin Kurečka, Václav Nevyhoštěný, Petr Novotný et al.
AAAI 2025paperarXiv:2412.13962
1
citations
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri, Rahul Jain, Haipeng Luo
ICML 2024arXiv:2302.00808
2
citations
Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints
Francesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti et al.
ICML 2024
Truly No-Regret Learning in Constrained MDPs
Adrian Müller, Pragnya Alatur, Volkan Cevher et al.
ICML 2024spotlightarXiv:2402.15776
16
citations