α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jianshu Chen
Jianshu Chen
1
papers
1
total citations
papers (1)
Self-Rewarding PPO: Aligning Large Language Models with Demonstrations Only
COLM 2025
arXiv
1
citations