"offline data" Papers
6 papers found
Conference
Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data
Shilong Deng, Zetao Zheng, Hongcai He et al.
AAAI 2025paperarXiv:2501.07346
Offline-to-Online Hyperparameter Transfer for Stochastic Bandits
Dravyansh Sharma, Arun Suggala
AAAI 2025paperarXiv:2501.02926
8
citations
Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design
Shuze Liu, Shangtong Zhang
ICML 2024arXiv:2301.13734
7
citations
Foundation Policies with Hilbert Representations
Seohong Park, Tobias Kreiman, Sergey Levine
ICML 2024oralarXiv:2402.15567
59
citations
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Wang Chi Cheung, Lixing Lyu
ICML 2024spotlight
Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach
Wen Huang, Xintao Wu
AAAI 2024paperarXiv:2312.12731
2
citations