"offline learning" Papers

14 papers found

Contextual Thompson Sampling via Generation of Missing Data

Kelly W Zhang, Tianhui Cai, Hongseok Namkoong et al.

NEURIPS 2025arXiv:2502.07064
2
citations

Efficient Reinforcement Learning with Large Language Model Priors

Xue Yan, Yan Song, Xidong Feng et al.

ICLR 2025arXiv:2410.07927
21
citations

Improved Confidence Regions and Optimal Algorithms for Online and Offline Linear MNL Bandits

Yuxuan Han, Jose Blanchet, Zhengyuan Zhou

NEURIPS 2025

Learning to Reuse Policies in State Evolvable Environments

Ziqian Zhang, Bohan Yang, Lihe Li et al.

ICML 2025oral

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

Rishabh Agrawal, Nathan Dahlin, Rahul Jain et al.

AAAI 2025paperarXiv:2408.09125
1
citations

Mixture of Online and Offline Experts for Non-Stationary Time Series

Zhilin Zhao, Longbing Cao, Yuanyu Wan

AAAI 2025paperarXiv:2202.05996

Modelling the control of offline processing with reinforcement learning

Eleanor Spens, Neil Burgess, Tim Behrens

NEURIPS 2025

Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline

Hongjoon Ahn, Jinu Hyeon, Youngmin Oh et al.

ICLR 2025
2
citations

Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach

Chenbei Lu, Zaiwei Chen, Tongxin Li et al.

NEURIPS 2025spotlightarXiv:2510.18687
1
citations

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

Kihyun Kim, Jiawei Zhang, Asuman Ozdaglar et al.

ICML 2024arXiv:2405.12421
2
citations

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

Nikhil Vyas, Depen Morwani, Rosie Zhao et al.

ICML 2024spotlightarXiv:2306.08590
7
citations

Learning Constraints from Offline Demonstrations via Superior Distribution Correction Estimation

Guorui Quan, Zhiqiang Xu, Guiliang Liu

ICML 2024

Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms

Filippo Lazzati, Mirco Mutti, Alberto Maria Metelli

ICML 2024arXiv:2402.15392
7
citations

Parameterized Projected Bellman Operator

Théo Vincent, Alberto Maria Metelli, Boris Belousov et al.

AAAI 2024paperarXiv:2312.12869
4
citations