"off-policy evaluation" Papers

13 papers found

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

Breaking the Order Barrier: Off-Policy Evaluation for Confounded POMDPs

Qi Kuang, Jiayi Wang, Fan Zhou et al.

NEURIPS 2025

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation

Feichen Gan, Lu Youcun, Yingying Zhang et al.

NEURIPS 2025oralarXiv:2510.26026

Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits

Yuta Natsubori, Masataka Ushiku, Yuta Saito

ICLR 2025

Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings

Houssam Zenati, Bariscan Bozkurt, Arthur Gretton

NEURIPS 2025arXiv:2506.02793

citations

Efficient Multi-Policy Evaluation for Reinforcement Learning

Shuze Daniel Liu, Claire Chen, Shangtong Zhang

AAAI 2025paperarXiv:2408.08706

citations

Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol

Pai Liu, Lingfeng Zhao, Shivangi Agarwal et al.

NEURIPS 2025arXiv:2502.08021

citations

STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation

Hossein Goli, Michael Gimelfarb, Nathan de Lara et al.

NEURIPS 2025spotlightarXiv:2505.20781

citations

Model-based Reinforcement Learning for Confounded POMDPs

Mao Hong, Zhengling Qi, Yanxun Xu

ICML 2024

Offline Transition Modeling via Contrastive Energy Learning

Ruifeng Chen, Chengxing Jia, Zefang Huang et al.

ICML 2024

Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under Smoothness

Samir Khan, Martin Saveski, Johan Ugander

ICML 2024

Policy-conditioned Environment Models are More Generalizable

Ruifeng Chen, Xiong-Hui Chen, Yihao Sun et al.

ICML 2024

Predictive Performance Comparison of Decision Policies Under Confounding

Luke Guerdan, Amanda Coston, Ken Holstein et al.

ICML 2024arXiv:2404.00848

citations

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Longchao Da, Porter Jenkins, Trevor Schwantes et al.

AAAI 2024paperarXiv:2312.11551

citations