Poster "bandit feedback" Papers

12 papers found

Comparing Uniform Price and Discriminatory Multi-Unit Auctions through Regret Minimization

Marius Potfer, Vianney Perchet

NEURIPS 2025arXiv:2510.19591

No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.

NEURIPS 2025arXiv:2506.13244
2
citations

No-Regret Online Autobidding Algorithms in First-price Auctions

Yilin LI, Yuan Deng, Wei Tang et al.

NEURIPS 2025arXiv:2510.16869
1
citations

Uniform Wrappers: Bridging Concave to Quadratizable Functions in Online Optimization

Mohammad Pedramfar, Christopher Quinn, Vaneet Aggarwal

NEURIPS 2025

Efficient Online Set-valued Classification with Bandit Feedback

Zhou Wang, Xingye Qiao

ICML 2024arXiv:2405.04393
1
citations

Federated Combinatorial Multi-Agent Multi-Armed Bandits

Fares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal

ICML 2024arXiv:2405.05950
8
citations

On Interpolating Experts and Multi-Armed Bandits

Houshuang Chen, Yuchen He, Chihao Zhang

ICML 2024arXiv:2307.07264
5
citations

Performative Prediction with Bandit Feedback: Learning through Reparameterization

Yatong Chen, Wei Tang, Chien-Ju Ho et al.

ICML 2024arXiv:2305.01094
12
citations

Projection-Free Online Convex Optimization with Time-Varying Constraints

Dan Garber, Ben Kretzu

ICML 2024arXiv:2402.08799
5
citations

Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback

GUOJUN XIONG, Jian Li

ICML 2024arXiv:2405.00950
1
citations

Quantum Algorithm for Online Exp-concave Optimization

Jianhao He, Chengchang Liu, Xutong Liu et al.

ICML 2024arXiv:2410.19688
3
citations

Rate-Optimal Policy Optimization for Linear Markov Decision Processes

Uri Sherman, Alon Cohen, Tomer Koren et al.

ICML 2024arXiv:2308.14642
9
citations