α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Alekh Agarwal
Alekh Agarwal
14
papers
1,099
total citations
papers (14)
Bellman-consistent Pessimism for Offline Reinforcement Learning
NEURIPS 2021
arXiv
308
citations
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
NEURIPS 2020
arXiv
249
citations
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
ICML 2024
arXiv
139
citations
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
NEURIPS 2020
arXiv
120
citations
Safe Reinforcement Learning via Curriculum Induction
NEURIPS 2020
arXiv
99
citations
Theoretical guarantees on the best-of-n alignment policy
ICML 2025
arXiv
95
citations
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
NEURIPS 2022
arXiv
29
citations
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity
NEURIPS 2022
arXiv
28
citations
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
ICML 2024
arXiv
17
citations
Policy Improvement via Imitation of Multiple Oracles
NEURIPS 2020
arXiv
7
citations
Design Considerations in Offline Preference-based RL
ICML 2025
arXiv
4
citations
Ordering-based Conditions for Global Convergence of Policy Gradient Methods
NEURIPS 2023
arXiv
4
citations
The Non-linear $F$-Design and Applications to Interactive Learning
ICML 2024
0
citations
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration
NEURIPS 2020
0
citations