α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhaolin Gao
Zhaolin Gao
5
papers
49
total citations
papers (5)
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025
arXiv
18
citations
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NEURIPS 2025
arXiv
12
citations
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
NEURIPS 2025
arXiv
12
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NEURIPS 2025
arXiv
7
citations
Shoestring: Graph-Based Semi-Supervised Classification With Severely Limited Labeled Data
CVPR 2020
0
citations