Poster "multi-armed bandit" Papers
3 papers found
Conference
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
NEURIPS 2025arXiv:2509.23666
1
citations
ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning
Mingqi Yuan, Bo Li, Xin Jin et al.
ICCV 2025arXiv:2503.06101
1
citations
On Multi-Armed Bandit with Impatient Arms
Yuming Shao, Zhixuan Fang
ICML 2024