α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wei Fu
Wei Fu
3
papers
267
total citations
papers (3)
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
ICML 2024
arXiv
253
citations
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
ICLR 2024
arXiv
9
citations
Iteratively Learn Diverse Strategies with State Distance Information
NEURIPS 2023
arXiv
5
citations