"off-policy evaluation" Papers
13 papers found
Conference
Breaking the Order Barrier: Off-Policy Evaluation for Confounded POMDPs
Qi Kuang, Jiayi Wang, Fan Zhou et al.
NEURIPS 2025
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
Feichen Gan, Lu Youcun, Yingying Zhang et al.
NEURIPS 2025oralarXiv:2510.26026
Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits
Yuta Natsubori, Masataka Ushiku, Yuta Saito
ICLR 2025
Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings
Houssam Zenati, Bariscan Bozkurt, Arthur Gretton
NEURIPS 2025arXiv:2506.02793
1
citations
Efficient Multi-Policy Evaluation for Reinforcement Learning
Shuze Daniel Liu, Claire Chen, Shangtong Zhang
AAAI 2025paperarXiv:2408.08706
2
citations
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu, Lingfeng Zhao, Shivangi Agarwal et al.
NEURIPS 2025arXiv:2502.08021
4
citations
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli, Michael Gimelfarb, Nathan de Lara et al.
NEURIPS 2025spotlightarXiv:2505.20781
2
citations
Model-based Reinforcement Learning for Confounded POMDPs
Mao Hong, Zhengling Qi, Yanxun Xu
ICML 2024
Offline Transition Modeling via Contrastive Energy Learning
Ruifeng Chen, Chengxing Jia, Zefang Huang et al.
ICML 2024
Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under Smoothness
Samir Khan, Martin Saveski, Johan Ugander
ICML 2024
Policy-conditioned Environment Models are More Generalizable
Ruifeng Chen, Xiong-Hui Chen, Yihao Sun et al.
ICML 2024
Predictive Performance Comparison of Decision Policies Under Confounding
Luke Guerdan, Amanda Coston, Ken Holstein et al.
ICML 2024arXiv:2404.00848
1
citations
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation
Longchao Da, Porter Jenkins, Trevor Schwantes et al.
AAAI 2024paperarXiv:2312.11551
3
citations