"stochastic environments" Papers
5 papers found
Conference
PlanU: Large Language Model Reasoning through Planning under Uncertainty
Ziwei Deng, Mian Deng, Chenjing Liang et al.
NEURIPS 2025arXiv:2510.18442
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo, Ava Pettet, Aron Laszka et al.
ICLR 2025oralarXiv:2502.21186
3
citations
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Qining Zhang, Lei Ying
ICLR 2025arXiv:2409.17401
10
citations
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays
Qingyuan Wu, Simon Zhan, Yixuan Wang et al.
ICML 2024arXiv:2402.03141
4
citations
To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko, Wendelin Boehmer, Mathijs de Weerdt
ICML 2024arXiv:2402.01361
11
citations