Paper "bandit feedback" Papers
3 papers found
Conference
Last-iterate Convergence in Regularized Graphon Mean Field Game
Jing Dong, Baoxiang Wang, Yaoliang Yu
AAAI 2025paperarXiv:2410.08746
2
citations
Online Nonsubmodular Optimization with Delayed Feedback in the Bandit Setting
Sifan Yang, Yuanyu Wan, Lijun Zhang
AAAI 2025paperarXiv:2508.00523
1
citations
Revisiting Projection-Free Online Learning with Time-Varying Constraints
Yibo Wang, Yuanyu Wan, Lijun Zhang
AAAI 2025paperarXiv:2501.16046
5
citations