"count-based exploration" Papers
2 papers found
Conference
Online Preference Alignment for Language Models via Count-based Exploration
Chenjia Bai, Yang Zhang, Shuang Qiu et al.
ICLR 2025arXiv:2501.12735
20
citations
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm, Abdeslam Boularias
ICML 2024arXiv:2407.05511
1
citations