"off-policy evaluation" Papers

13 papers found

Breaking the Order Barrier: Off-Policy Evaluation for Confounded POMDPs

Qi Kuang, Jiayi Wang, Fan Zhou et al.

NEURIPS 2025

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation

Feichen Gan, Lu Youcun, Yingying Zhang et al.

NEURIPS 2025oralarXiv:2510.26026

Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits

Yuta Natsubori, Masataka Ushiku, Yuta Saito

ICLR 2025

Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings

Houssam Zenati, Bariscan Bozkurt, Arthur Gretton

NEURIPS 2025arXiv:2506.02793
1
citations

Efficient Multi-Policy Evaluation for Reinforcement Learning

Shuze Daniel Liu, Claire Chen, Shangtong Zhang

AAAI 2025paperarXiv:2408.08706
2
citations

Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol

Pai Liu, Lingfeng Zhao, Shivangi Agarwal et al.

NEURIPS 2025arXiv:2502.08021
4
citations

STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation

Hossein Goli, Michael Gimelfarb, Nathan de Lara et al.

NEURIPS 2025spotlightarXiv:2505.20781
2
citations

Model-based Reinforcement Learning for Confounded POMDPs

Mao Hong, Zhengling Qi, Yanxun Xu

ICML 2024

Offline Transition Modeling via Contrastive Energy Learning

Ruifeng Chen, Chengxing Jia, Zefang Huang et al.

ICML 2024

Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under Smoothness

Samir Khan, Martin Saveski, Johan Ugander

ICML 2024

Policy-conditioned Environment Models are More Generalizable

Ruifeng Chen, Xiong-Hui Chen, Yihao Sun et al.

ICML 2024

Predictive Performance Comparison of Decision Policies Under Confounding

Luke Guerdan, Amanda Coston, Ken Holstein et al.

ICML 2024arXiv:2404.00848
1
citations

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Longchao Da, Porter Jenkins, Trevor Schwantes et al.

AAAI 2024paperarXiv:2312.11551
3
citations