"out-of-distribution actions" Papers
3 papers found
Conference
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao, Yun Qu, Qi Wang et al.
NEURIPS 2025spotlightarXiv:2511.02567
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
Zongkai Liu, Qian Lin, Chao Yu et al.
AAAI 2025paperarXiv:2412.07639
8
citations
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu, Yang Li, Yixing Lan et al.
ICML 2024arXiv:2405.19909
13
citations