α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Heyang Zhao
Heyang Zhao
2
papers
34
total citations
papers (2)
Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits
ICLR 2024
arXiv
18
citations
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
NEURIPS 2025
arXiv
16
citations