ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Johan Ferret

Johan Ferret

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 7:26 AM AMS

4

papers

596

total citations

papers (4)

RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

BOND: Aligning LLMs with Best-of-N Distillation

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

NEURIPS 2021arXiv

WARM: On the Benefits of Weight Averaged Reward Models