"best-of-both-worlds algorithms" Papers
2 papers found
Conference
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
Shinji Ito, Kevin Jamieson, Haipeng Luo et al.
NEURIPS 2025arXiv:2510.17103
2
citations
Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring
Taira Tsuchiya, Shinji Ito, Junya Honda
ICML 2024arXiv:2402.08321
3
citations