Poster "regret analysis" Papers

15 papers found

ADAM Optimization with Adaptive Batch Selection

Gyu Yeol Kim, Min-hwan Oh

ICLR 2025arXiv:2512.06795
2
citations

MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning

Sizhe Tang, Jiayu Chen, Tian Lan

NEURIPS 2025arXiv:2511.06142
4
citations

Online Two-Stage Submodular Maximization

Iasonas Nikolaou, Miltiadis Stouras, Stratis Ioannidis et al.

NEURIPS 2025arXiv:2510.19480

Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards

Kyungjae Lee, Dohyeong Kim, Taehyun Cho et al.

NEURIPS 2025

Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach

Swetha Ganesh, Vaneet Aggarwal

NEURIPS 2025arXiv:2505.19986
3
citations

Second Order Bounds for Contextual Bandits with Function Approximation

Aldo Pacchiano

ICLR 2025arXiv:2409.16197
7
citations

Toward Understanding In-context vs. In-weight Learning

Bryan Chan, Xinyi Chen, Andras Gyorgy et al.

ICLR 2025arXiv:2410.23042
15
citations

True Impact of Cascade Length in Contextual Cascading Bandits

Hyun-jun Choi, Joongkyu Lee, Min-hwan Oh

NEURIPS 2025

A General Online Algorithm for Optimizing Complex Performance Metrics

Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.

ICML 2024

High-dimensional Linear Bandits with Knapsacks

Wanteng Ma, Dong Xia, Jiashuo Jiang

ICML 2024arXiv:2311.01327

Matroid Semi-Bandits in Sublinear Time

Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu

ICML 2024arXiv:2405.17968
1
citations

Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization

Kwang-Sung Jun, Jungtaek Kim

ICML 2024arXiv:2402.07341
4
citations

On Multi-Armed Bandit with Impatient Arms

Yuming Shao, Zhixuan Fang

ICML 2024

Provable Interactive Learning with Hindsight Instruction Feedback

Dipendra Misra, Aldo Pacchiano, Robert Schapire

ICML 2024arXiv:2404.09123
1
citations

Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight Observation

Tonghe Zhang, Yu Chen, Longbo Huang

ICML 2024arXiv:2402.18149