Paper "regret minimization" Papers
5 papers found
Conference
Heterogeneous Multi-Agent Bandits with Parsimonious Hints
Amirmahdi Mirfakhar, Xuchuang Wang, Jinhang Zuo et al.
AAAI 2025paperarXiv:2502.16128
3
citations
Neural Combinatorial Clustered Bandits for Recommendation Systems
Baran Atalar, Carlee Joe-Wong
AAAI 2025paperarXiv:2410.14586
3
citations
Scenario-Based Robust Optimization of Tree Structures
Spyros Angelopoulos, Christoph Dürr, Alex Elenter et al.
AAAI 2025paperarXiv:2408.11422
Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits
Nikolai Karpov, Qin Zhang
AAAI 2024paperarXiv:2301.11442
2
citations
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs
Tianyuan Jin, Hao-Lun Hsu, William Chang et al.
AAAI 2024paperarXiv:2312.15549
3
citations