"offline policy evaluation" Papers
3 papers found
Conference
Deployment Efficient Reward-Free Exploration with Linear Function Approximation
Zihan Zhang, Yuxin Chen, Jason Lee et al.
NEURIPS 2025
Online Optimization for Offline Safe Reinforcement Learning
Yassine Chemingui, Aryan Deshwal, Alan Fern et al.
NEURIPS 2025arXiv:2510.22027
A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric Models
Jiayi Wang, Zhengling Qi, Raymond K. W. Wong
ICML 2024arXiv:2406.10438