Paper "offline reinforcement learning" Papers
11 papers found
Conference
Active Reinforcement Learning Strategies for Offline Policy Improvement
Ambedkar Dukkipati, Ranga Shaarad Ayyagari, Bodhisattwa Dasgupta et al.
AAAI 2025paperarXiv:2412.13106
3
citations
Are Expressive Models Truly Necessary for Offline RL?
Guan Wang, Haoyi Niu, Jianxiong Li et al.
AAAI 2025paperarXiv:2412.11253
6
citations
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
Zongkai Liu, Qian Lin, Chao Yu et al.
AAAI 2025paperarXiv:2412.07639
8
citations
SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch
Shengyu Feng, Yiming Yang
AAAI 2025paperarXiv:2412.15534
5
citations
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang, Jie Liu, Chuming Li et al.
AAAI 2024paperarXiv:2312.07685
25
citations
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Jinxin Liu, Ziqi Zhang, Zhenyu Wei et al.
AAAI 2024paperarXiv:2306.12755
27
citations
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning
Chenyu Sun, Hangwei Qian, Chunyan Miao
AAAI 2024paperarXiv:2312.12191
1
citations
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu, Yuling Jiao, Li Shen et al.
AAAI 2024paperarXiv:2312.11863
1
citations
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai, Yiying Li, Zijian Gao et al.
AAAI 2024paperarXiv:2401.05899
3
citations
Reinforcement Learning and Data
Generation for Syntax-Guided Synthesis
AAAI 2024paperarXiv:2210.09241
26
citations
Stitching Sub-trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim, Yunseon Choi, Daiki Matsunaga et al.
AAAI 2024paperarXiv:2402.07226
18
citations