Poster "credit assignment problem" Papers
2 papers found
Conference
Improving Regret Approximation for Unsupervised Dynamic Environment Generation
Harry Mead, Bruno Lacerda, Jakob Foerster et al.
NEURIPS 2025arXiv:2601.14957
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Chang Chen, Junyeob Baek, Fei Deng et al.
ICML 2024arXiv:2406.06793
4
citations