"exploration-exploitation tradeoff" Papers

12 papers found

Filters:exploration-exploitation tradeoff Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

Bayesian Optimization for Unknown Cost-Varying Variable Subsets with No-Regret Costs

Vu Viet Hoang, Quoc Anh Hoang Nguyen, Hung The Tran

AAAI 2025paperarXiv:2412.15863

Breaking the $\log(1/\Delta_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids

Tianyuan Jin, Qin Zhang, Dongruo Zhou

ICLR 2025

Feel-Good Thompson Sampling for Contextual Bandits: a Markov Chain Monte Carlo Showdown

Emile Anand, Sarah Liaw

NEURIPS 2025arXiv:2507.15290

citations

Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts

Ben Schiffer, Mark Sellke

NEURIPS 2025spotlightarXiv:2506.01685

LASeR: Towards Diversified and Generalizable Robot Design with Large Language Models

JUNRU SONG, Yang Yang, Huan Xiao et al.

ICLR 2025

citations

Learning to price with resource constraints: from full information to machine-learned prices

Ruicheng Ao, Jiashuo Jiang, David Simchi-Levi

NEURIPS 2025arXiv:2501.14155

citations

Offline-to-Online Hyperparameter Transfer for Stochastic Bandits

Dravyansh Sharma, Arun Suggala

AAAI 2025paperarXiv:2501.02926

citations

Online Feedback Efficient Active Target Discovery in Partially Observable Environments

Anindya Sarkar, Binglin Ji, Yevgeniy Vorobeychik

NEURIPS 2025arXiv:2505.06535

citations

PlanU: Large Language Model Reasoning through Planning under Uncertainty

Ziwei Deng, Mian Deng, Chenjing Liang et al.

NEURIPS 2025arXiv:2510.18442

Entropy-Reinforced Planning with Large Language Models for Drug Discovery

Xuefeng Liu, Chih-chan Tien, Peng Ding et al.

ICML 2024arXiv:2406.07025

citations

Optimal Batched Linear Bandits

Xuanfei Ren, Tianyuan Jin, Pan Xu

ICML 2024arXiv:2406.04137

citations

Stochastic Bandits with ReLU Neural Networks

Kan Xu, Hamsa Bastani, Surbhi Goel et al.

ICML 2024arXiv:2405.07331

citations