Paper "regret bounds" Papers
6 papers found
Conference
Delay as Payoff in MAB
Ofir Schlisselberg, Ido Cohen, Tal Lancewicki et al.
AAAI 2025paperarXiv:2408.15158
4
citations
Improved Regret Bounds for Online Fair Division with Bandit Learning
Benjamin Schiffer, Shirley Zhang
AAAI 2025paperarXiv:2501.07022
5
citations
Mixture of Online and Offline Experts for Non-Stationary Time Series
Zhilin Zhao, Longbing Cao, Yuanyu Wan
AAAI 2025paperarXiv:2202.05996
Online Nonsubmodular Optimization with Delayed Feedback in the Bandit Setting
Sifan Yang, Yuanyu Wan, Lijun Zhang
AAAI 2025paperarXiv:2508.00523
1
citations
p-Mean Regret for Stochastic Bandits
Anand Krishna, Philips George John, Adarsh Barik et al.
AAAI 2025paperarXiv:2412.10751
5
citations
Revisiting Projection-Free Online Learning with Time-Varying Constraints
Yibo Wang, Yuanyu Wan, Lijun Zhang
AAAI 2025paperarXiv:2501.16046
5
citations