Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

2citations

arXiv:2507.00518 Project

citations

#1626

in ICML 2025

of 3340 papers

Top Authors

Data Points

Top Authors

Walid Bendada Guillaume Salha-Galvan Romain Hennequin Théo Bontempelli Thomas Bouabca Tristan Cazenave

Abstract

This paper introduces von Mises-Fisher exploration (vMF-exp), a scalable method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent these actions. vMF-exp involves initially sampling a state embedding representation using a von Mises-Fisher distribution, then exploring this representation's nearest neighbors, which scales to virtually unlimited numbers of candidate actions.We show that, under theoretical assumptions, vMF-exp asymptotically maintains the same probability of exploring each action as Boltzmann Exploration (B-exp), a popular alternative that, nonetheless, suffers from scalability issues as it requires computing softmax values for each action.Consequently, vMF-exp serves as a scalable alternative to B-exp for exploring large action sets with hyperspherical embeddings. Experiments on simulated data, real-world public data, and the successful large-scale deployment of vMF-exp on the recommender system of a global music streaming service empirically validate the key properties of the proposed method.

Citation History

Jan 28, 2026

Feb 13, 2026

2+2

Feb 13, 2026