α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Junkang Wu
Junkang Wu
1
papers
14
total citations
papers (1)
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
ICML 2025
arXiv
14
citations