α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hanyang Zhao
Hanyang Zhao
3
papers
52
total citations
papers (3)
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
ICLR 2025
arXiv
19
citations
Score as Action: Fine Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
ICML 2025
arXiv
18
citations
MallowsPO: Fine-Tune Your LLM with Preference Dispersions
ICLR 2025
arXiv
15
citations