α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Bilal Piot
Bilal Piot
8
papers
8,621
total citations
papers (8)
Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning
NEURIPS 2020
arXiv
8,033
citations
Nash Learning from Human Feedback
ICML 2024
arXiv
195
citations
Generalized Preference Optimization: A Unified Approach to Offline Alignment
ICML 2024
arXiv
150
citations
Human Alignment of Large Language Models through Online Preference Optimisation
ICML 2024
arXiv
88
citations
BYOL-Explore: Exploration by Bootstrapped Prediction
NEURIPS 2022
arXiv
88
citations
RRM: Robust Reward Model Training Mitigates Reward Hacking
ICLR 2025
arXiv
50
citations
Unlocking the Power of Representations in Long-term Novelty-based Exploration
ICLR 2024
arXiv
9
citations
Learning from negative feedback, or positive feedback or both
ICLR 2025
arXiv
8
citations