Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits
2citations
arXiv:2301.114422
citations
#1499
in AAAI 2024
of 2289 papers
2
Top Authors
5
Data Points
Top Authors
Topics
Abstract
In this paper, we study the collaborative learning model, which concerns the tradeoff between parallelism and communication overhead in multi-agent multi-armed bandits. For regret minimization in multi-armed bandits, we present the first set of tradeoffs between the number of rounds of communication among the agents and the regret of the collaborative learning process.
Citation History
Jan 27, 2026
2
Feb 13, 2026
2
Feb 13, 2026
2
Feb 13, 2026
2
Feb 13, 2026
2