"offline learning" Papers
14 papers found
Conference
Contextual Thompson Sampling via Generation of Missing Data
Kelly W Zhang, Tianhui Cai, Hongseok Namkoong et al.
NEURIPS 2025arXiv:2502.07064
2
citations
Efficient Reinforcement Learning with Large Language Model Priors
Xue Yan, Yan Song, Xidong Feng et al.
ICLR 2025arXiv:2410.07927
21
citations
Improved Confidence Regions and Optimal Algorithms for Online and Offline Linear MNL Bandits
Yuxuan Han, Jose Blanchet, Zhengyuan Zhou
NEURIPS 2025
Learning to Reuse Policies in State Evolvable Environments
Ziqian Zhang, Bohan Yang, Lihe Li et al.
ICML 2025oral
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
Rishabh Agrawal, Nathan Dahlin, Rahul Jain et al.
AAAI 2025paperarXiv:2408.09125
1
citations
Mixture of Online and Offline Experts for Non-Stationary Time Series
Zhilin Zhao, Longbing Cao, Yuanyu Wan
AAAI 2025paperarXiv:2202.05996
Modelling the control of offline processing with reinforcement learning
Eleanor Spens, Neil Burgess, Tim Behrens
NEURIPS 2025
Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline
Hongjoon Ahn, Jinu Hyeon, Youngmin Oh et al.
ICLR 2025
2
citations
Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach
Chenbei Lu, Zaiwei Chen, Tongxin Li et al.
NEURIPS 2025spotlightarXiv:2510.18687
1
citations
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
Kihyun Kim, Jiawei Zhang, Asuman Ozdaglar et al.
ICML 2024arXiv:2405.12421
2
citations
Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Nikhil Vyas, Depen Morwani, Rosie Zhao et al.
ICML 2024spotlightarXiv:2306.08590
7
citations
Learning Constraints from Offline Demonstrations via Superior Distribution Correction Estimation
Guorui Quan, Zhiqiang Xu, Guiliang Liu
ICML 2024
Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms
Filippo Lazzati, Mirco Mutti, Alberto Maria Metelli
ICML 2024arXiv:2402.15392
7
citations
Parameterized Projected Bellman Operator
Théo Vincent, Alberto Maria Metelli, Boris Belousov et al.
AAAI 2024paperarXiv:2312.12869
4
citations