α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Masatoshi Uehara
Masatoshi Uehara
1
Affiliations
Affiliations
Harvard
11
papers
307
total citations
papers (11)
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift
NEURIPS 2020
arXiv
56
citations
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
ICLR 2025
arXiv
45
citations
Feedback Efficient Online Fine-Tuning of Diffusion Models
ICML 2024
arXiv
44
citations
Provable Offline Preference-Based Reinforcement Learning
ICLR 2024
arXiv
43
citations
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
NEURIPS 2022
arXiv
43
citations
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
NEURIPS 2023
arXiv
24
citations
Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies
NEURIPS 2020
arXiv
16
citations
Provable Reward-Agnostic Preference-Based Reinforcement Learning
ICLR 2024
arXiv
14
citations
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
ICML 2025
arXiv
14
citations
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
NEURIPS 2023
arXiv
8
citations
Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage
NEURIPS 2021
0
citations