"sparse-reward environments" Papers
2 papers found
Conference
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Chang Chen, Junyeob Baek, Fei Deng et al.
ICML 2024arXiv:2406.06793
4
citations
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions
Zhening Li, Gabriel Poesia, Armando Solar-Lezama
ICML 2024oralarXiv:2406.07897
1
citations