"online exploration" Papers
3 papers found
Conference
Self-Improvement in Language Models: The Sharpening Mechanism
Audrey Huang, Adam Block, Dylan Foster et al.
ICLR 2025arXiv:2412.01951
60
citations
Meta-Reinforcement Learning Robust to Distributional Shift Via Performing Lifelong In-Context Learning
TengYe Xu, Zihao Li, Qinyuan Ren
ICML 2024
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
Han Zhong, Jiachen Hu, Yecheng Xue et al.
ICML 2024arXiv:2302.10796
11
citations