α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Rafael Rafailov
Rafael Rafailov
6
papers
8,534
total citations
papers (6)
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
NEURIPS 2023
arXiv
7,188
citations
Diffusion Model Alignment Using Direct Preference Optimization
CVPR 2024
arXiv
561
citations
COMBO: Conservative Offline Model-Based Policy Optimization
NEURIPS 2021
arXiv
488
citations
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
ICML 2024
arXiv
179
citations
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
NEURIPS 2025
arXiv
60
citations
Visual Adversarial Imitation Learning using Variational Models
NEURIPS 2021
arXiv
58
citations