ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Yunhao Tang

Yunhao Tang

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 1:13 AM AMS

10

papers

601

total citations

papers (10)

Nash Learning from Human Feedback

Generalized Preference Optimization: A Unified Approach to Offline Alignment

BYOL-Explore: Exploration by Bootstrapped Prediction

NEURIPS 2022arXiv

Human Alignment of Large Language Models through Online Preference Optimisation

Self-Imitation Learning via Generalized Lower Bound Q-learning

NEURIPS 2020arXiv

Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards

NEURIPS 2025arXiv

The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning

NEURIPS 2022arXiv

A Distributional Analogue to the Successor Representation

Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation

NEURIPS 2021arXiv

Learning Uncertainty-Aware Temporally-Extended Actions