Poster "episodic reinforcement learning" Papers
2 papers found
Conference
Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs
Shulun Chen, Runlong Zhou, Zihan Zhang et al.
NEURIPS 2025arXiv:2506.06521
1
citations
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Xutong Liu, Siwei Wang, Jinhang Zuo et al.
ICML 2024arXiv:2406.01386
8
citations