α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yunhao Tang
Yunhao Tang
10
papers
601
total citations
papers (10)
Nash Learning from Human Feedback
ICML 2024
arXiv
195
citations
Generalized Preference Optimization: A Unified Approach to Offline Alignment
ICML 2024
arXiv
150
citations
BYOL-Explore: Exploration by Bootstrapped Prediction
NEURIPS 2022
arXiv
88
citations
Human Alignment of Large Language Models through Online Preference Optimisation
ICML 2024
arXiv
88
citations
Self-Imitation Learning via Generalized Lower Bound Q-learning
NEURIPS 2020
arXiv
29
citations
Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards
NEURIPS 2025
arXiv
17
citations
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning
NEURIPS 2022
arXiv
12
citations
A Distributional Analogue to the Successor Representation
ICML 2024
arXiv
10
citations
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
NEURIPS 2021
arXiv
9
citations
Learning Uncertainty-Aware Temporally-Extended Actions
AAAI 2024
arXiv
3
citations