α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Daniele Calandriello
Daniele Calandriello
11
papers
647
total citations
papers (11)
Nash Learning from Human Feedback
ICML 2024
arXiv
195
citations
Generalized Preference Optimization: A Unified Approach to Offline Alignment
ICML 2024
arXiv
150
citations
Human Alignment of Large Language Models through Online Preference Optimisation
ICML 2024
arXiv
88
citations
BYOL-Explore: Exploration by Bootstrapped Prediction
NEURIPS 2022
arXiv
88
citations
Decoding-time Realignment of Language Models
ICML 2024
arXiv
59
citations
Sampling from a k-DPP without looking at all items
NEURIPS 2020
arXiv
29
citations
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
NEURIPS 2022
arXiv
12
citations
Unlocking the Power of Representations in Long-term Novelty-based Exploration
ICLR 2024
arXiv
9
citations
ParK: Sound and Efficient Kernel Ridge Regression by Feature Space Partitions
NEURIPS 2021
arXiv
7
citations
Model-free Posterior Sampling via Learning Rate Randomization
NEURIPS 2023
arXiv
5
citations
Demonstration-Regularized RL
ICLR 2024
arXiv
5
citations