"sparse reward settings" Papers
2 papers found
Conference
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
Georgios Papoudakis, Thomas Coste, Jianye Hao et al.
NEURIPS 2025arXiv:2509.01720
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation
Longchao Da, Porter Jenkins, Trevor Schwantes et al.
AAAI 2024paperarXiv:2312.11551
3
citations