"regret analysis" Papers

20 papers found

ADAM Optimization with Adaptive Batch Selection

Gyu Yeol Kim, Min-hwan Oh

ICLR 2025arXiv:2512.06795
2
citations

MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning

Sizhe Tang, Jiayu Chen, Tian Lan

NEURIPS 2025arXiv:2511.06142
4
citations

Online Two-Stage Submodular Maximization

Iasonas Nikolaou, Miltiadis Stouras, Stratis Ioannidis et al.

NEURIPS 2025arXiv:2510.19480

Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards

Kyungjae Lee, Dohyeong Kim, Taehyun Cho et al.

NEURIPS 2025

Precise Asymptotics and Refined Regret of Variance-Aware UCB

Yingying Fan, Yuxuan Han, Jinchi Lv et al.

NEURIPS 2025spotlightarXiv:2412.08843
1
citations

Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach

Swetha Ganesh, Vaneet Aggarwal

NEURIPS 2025arXiv:2505.19986
3
citations

Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control

Bruce D. Lee, Leonardo F. Toso, Thomas T. Zhang et al.

AAAI 2025paperarXiv:2407.05781
4
citations

Second Order Bounds for Contextual Bandits with Function Approximation

Aldo Pacchiano

ICLR 2025arXiv:2409.16197
7
citations

Toward Understanding In-context vs. In-weight Learning

Bryan Chan, Xinyi Chen, Andras Gyorgy et al.

ICLR 2025arXiv:2410.23042
15
citations

True Impact of Cascade Length in Contextual Cascading Bandits

Hyun-jun Choi, Joongkyu Lee, Min-hwan Oh

NEURIPS 2025

A General Online Algorithm for Optimizing Complex Performance Metrics

Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.

ICML 2024

High-dimensional Linear Bandits with Knapsacks

Wanteng Ma, Dong Xia, Jiashuo Jiang

ICML 2024arXiv:2311.01327

Matroid Semi-Bandits in Sublinear Time

Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu

ICML 2024arXiv:2405.17968
1
citations

Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization

Kwang-Sung Jun, Jungtaek Kim

ICML 2024arXiv:2402.07341
4
citations

On Multi-Armed Bandit with Impatient Arms

Yuming Shao, Zhixuan Fang

ICML 2024

Provable Interactive Learning with Hindsight Instruction Feedback

Dipendra Misra, Aldo Pacchiano, Robert Schapire

ICML 2024arXiv:2404.09123
1
citations

Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight Observation

Tonghe Zhang, Yu Chen, Longbo Huang

ICML 2024arXiv:2402.18149

Regret Analysis of Repeated Delegated Choice

Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.

AAAI 2024paperarXiv:2310.04884
8
citations

Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach

Wen Huang, Xintao Wu

AAAI 2024paperarXiv:2312.12731
2
citations

Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge

Meshal Alharbi, Mardavij Roozbehani, Munther Dahleh

AAAI 2024paperarXiv:2312.12558
4
citations