α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jiantao Jiao
Jiantao Jiao
13
papers
689
total citations
papers (13)
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
NEURIPS 2021
arXiv
318
citations
Toward the Fundamental Limits of Imitation Learning
NEURIPS 2020
arXiv
108
citations
How to Evaluate Reward Models for RLHF
ICLR 2025
arXiv
58
citations
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
ICML 2025
arXiv
52
citations
MADE: Exploration via Maximizing Deviation from Explored Regions
NEURIPS 2021
arXiv
50
citations
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
ICML 2024
arXiv
48
citations
Minimax Optimal Online Imitation Learning via Replay Estimation
NEURIPS 2022
arXiv
23
citations
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
NEURIPS 2023
arXiv
20
citations
SLIP: Learning to predict in unknown dynamical systems with long-term memory
NEURIPS 2020
arXiv
12
citations
On the Value of Interaction and Function Approximation in Imitation Learning
NEURIPS 2021
0
citations
Beyond the Best: Distribution Functional Estimation in Infinite-Armed Bandits
NEURIPS 2022
0
citations
Towards Optimal Caching and Model Selection for Large Model Inference
NEURIPS 2023
0
citations
Doubly-Robust Self-Training
NEURIPS 2023
0
citations