Diffusion Models Meet Contextual Bandits

5citations

arXiv:2402.10028

citations

#1136

in NEURIPS 2025

of 5858 papers

Top Authors

Data Points

Top Authors

Imad Aouali

Abstract

Efficient online decision-making in contextual bandits is challenging, as methods without informative priors often suffer from computational or statistical inefficiencies. In this work, we leverage pre-trained diffusion models as expressive priors to capture complex action dependencies and develop a practical algorithm that efficiently approximates posteriors under such priors, enabling both fast updates and sampling. Empirical results demonstrate the effectiveness and versatility of our approach across diverse contextual bandit settings.

Citation History

Jan 25, 2026

Jan 27, 2026

Jan 28, 2026

Feb 13, 2026

5+5

Feb 13, 2026