Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

2citations

arXiv:2301.11442

citations

#1499

in AAAI 2024

of 2289 papers

Top Authors

Data Points

Top Authors

Nikolai Karpov Qin Zhang

Topics

multi-armed bandits collaborative learning regret minimization communication efficiency multi-agent systems parallelism tradeoffs

Abstract

In this paper, we study the collaborative learning model, which concerns the tradeoff between parallelism and communication overhead in multi-agent multi-armed bandits. For regret minimization in multi-armed bandits, we present the first set of tradeoffs between the number of rounds of communication among the agents and the regret of the collaborative learning process.

Citation History

Jan 27, 2026

Feb 13, 2026