α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wenhao Zhan
Wenhao Zhan
5
papers
104
total citations
papers (5)
Provable Offline Preference-Based Reinforcement Learning
ICLR 2024
arXiv
43
citations
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025
arXiv
18
citations
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
NEURIPS 2023
arXiv
17
citations
Provable Reward-Agnostic Preference-Based Reinforcement Learning
ICLR 2024
arXiv
14
citations
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
NEURIPS 2025
arXiv
12
citations