"dueling bandits" Papers
5 papers found
Conference
Learning Across the Gap: Hybrid Multi-armed Bandits with Heterogeneous Offline and Online Data
Qijia He, Minghan Wang, Xutong Liu et al.
NEURIPS 2025
Non-Stationary Dueling Bandits Under a Weighted Borda Criterion
Joe Suk, Arpit Agarwal
ICLR 2025arXiv:2403.12950
2
citations
Borda Regret Minimization for Generalized Linear Dueling Bandits
Yue Wu, Tao Jin, Qiwei Di et al.
ICML 2024arXiv:2303.08816
15
citations
Eliciting Kemeny Rankings
Anne-Marie George, Christos Dimitrakakis
AAAI 2024paperarXiv:2312.11663
1
citations
Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought
Zhen-Yu Zhang, Siwei Han, Huaxiu Yao et al.
ICML 2024arXiv:2402.06918
4
citations