Paper "offline reinforcement learning" Papers

11 papers found

Active Reinforcement Learning Strategies for Offline Policy Improvement

Ambedkar Dukkipati, Ranga Shaarad Ayyagari, Bodhisattwa Dasgupta et al.

AAAI 2025paperarXiv:2412.13106
3
citations

Are Expressive Models Truly Necessary for Offline RL?

Guan Wang, Haoyi Niu, Jianxiong Li et al.

AAAI 2025paperarXiv:2412.11253
6
citations

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization

Zongkai Liu, Qian Lin, Chao Yu et al.

AAAI 2025paperarXiv:2412.07639
8
citations

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch

Shengyu Feng, Yiming Yang

AAAI 2025paperarXiv:2412.15534
5
citations

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Yinmin Zhang, Jie Liu, Chuming Li et al.

AAAI 2024paperarXiv:2312.07685
25
citations

Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning

Jinxin Liu, Ziqi Zhang, Zhenyu Wei et al.

AAAI 2024paperarXiv:2306.12755
27
citations

CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning

Chenyu Sun, Hangwei Qian, Chunyan Miao

AAAI 2024paperarXiv:2312.12191
1
citations

Neural Network Approximation for Pessimistic Offline Reinforcement Learning

Di Wu, Yuling Jiao, Li Shen et al.

AAAI 2024paperarXiv:2312.11863
1
citations

Optimistic Model Rollouts for Pessimistic Offline Policy Optimization

Yuanzhao Zhai, Yiying Li, Zijian Gao et al.

AAAI 2024paperarXiv:2401.05899
3
citations

Reinforcement Learning and Data

Generation for Syntax-Guided Synthesis

AAAI 2024paperarXiv:2210.09241
26
citations

Stitching Sub-trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL

Sungyoon Kim, Yunseon Choi, Daiki Matsunaga et al.

AAAI 2024paperarXiv:2402.07226
18
citations