Poster "thompson sampling" Papers

17 papers found

Adaptive Variance Inflation in Thompson Sampling: Efficiency, Safety, Robustness, and Beyond

Feng Zhu, David Simchi-Levi

NEURIPS 2025

Contextual Thompson Sampling via Generation of Missing Data

Kelly W Zhang, Tianhui Cai, Hongseok Namkoong et al.

NEURIPS 2025arXiv:2502.07064
2
citations

Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling

Jasmine Bayrooti, Carl Ek, Amanda Prorok

ICLR 2025arXiv:2410.04988
3
citations

FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling

Hong Huang, Jinhai Yang, Yuan Chen et al.

NEURIPS 2025arXiv:2501.19122
4
citations

Feel-Good Thompson Sampling for Contextual Bandits: a Markov Chain Monte Carlo Showdown

Emile Anand, Sarah Liaw

NEURIPS 2025arXiv:2507.15290
3
citations

Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions

Marc Brooks, Gabriel Durham, Kihyuk Hong et al.

NEURIPS 2025arXiv:2505.16311
1
citations

Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits

Yuwei Luo, Mohsen Bayati

ICLR 2025arXiv:2306.14872
2
citations

LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search

Pengyi Li, Hongyao Tang, Jinbin Qiao et al.

NEURIPS 2025

Thompson Sampling in Function Spaces via Neural Operators

Rafael Oliveira, Xuesong Wang, Kian Ming Chai et al.

NEURIPS 2025arXiv:2506.21894

Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits

Xuheng Li, Quanquan Gu

NEURIPS 2025arXiv:2511.02123
1
citations

$\mathtt{VITS}$ : Variational Inference Thompson Sampling for contextual bandits

Pierre Clavier, Tom Huix, Alain Oliviero Durmus

ICML 2024

A Bayesian Approach to Online Planning

Nir Greshler, David Ben Eli, Carmel Rabinovitz et al.

ICML 2024arXiv:2406.02103
1
citations

Efficient Exploration for LLMs

Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao et al.

ICML 2024arXiv:2402.00396
37
citations

Feel-Good Thompson Sampling for Contextual Dueling Bandits

Xuheng Li, Heyang Zhao, Quanquan Gu

ICML 2024arXiv:2404.06013
17
citations

Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds

Shion Takeno, Yu Inatsu, Masayuki Karasuyama et al.

ICML 2024arXiv:2311.03760
9
citations

ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Andrew Jesson, Christopher Lu, Gunshi Gupta et al.

ICML 2024arXiv:2306.01460
10
citations

Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints

Yuantong Li, Guang Cheng, Xiaowu Dai

ICML 2024arXiv:2301.10230
8
citations