α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Johan Ferret
Johan Ferret
4
papers
596
total citations
papers (4)
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
ICML 2024
arXiv
527
citations
BOND: Aligning LLMs with Best-of-N Distillation
ICLR 2025
arXiv
53
citations
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
NEURIPS 2021
arXiv
16
citations
WARM: On the Benefits of Weight Averaged Reward Models
ICML 2024
0
citations