α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wenpin Tang
Wenpin Tang
4
papers
85
total citations
papers (4)
Policy Optimization for Continuous Reinforcement Learning
NEURIPS 2023
arXiv
33
citations
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
ICLR 2025
arXiv
19
citations
Score as Action: Fine Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
ICML 2025
arXiv
18
citations
MallowsPO: Fine-Tune Your LLM with Preference Dispersions
ICLR 2025
arXiv
15
citations