"reference-advantage decomposition" Papers
3 papers found
Conference
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Zhong Zheng, Haochen Zhang, Lingzhou Xue
ICLR 2025arXiv:2410.07574
9
citations
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games
Songtao Feng, Ming Yin, Yu-Xiang Wang et al.
ICML 2024arXiv:2308.08858
2
citations
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang, Boxiang Lyu, Shuang Qiu et al.
ICML 2024spotlightarXiv:2407.07631
1
citations