Poster "q-learning algorithms" Papers
2 papers found
Conference
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Zhong Zheng, Haochen Zhang, Lingzhou Xue
ICLR 2025arXiv:2410.07574
9
citations
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
Jiin Woo, Laixi Shi, Gauri Joshi et al.
ICML 2024arXiv:2402.05876
9
citations