"credit assignment problem" Papers
4 papers found
Conference
Improving Regret Approximation for Unsupervised Dynamic Environment Generation
Harry Mead, Bruno Lacerda, Jakob Foerster et al.
NEURIPS 2025arXiv:2601.14957
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Yekun Chai, Haoran Sun, Huang Fang et al.
ICLR 2025oralarXiv:2410.02743
9
citations
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning
Xu Wan, Chao Yang, Cheng Yang et al.
AAAI 2025paperarXiv:2503.01458
1
citations
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Chang Chen, Junyeob Baek, Fei Deng et al.
ICML 2024arXiv:2406.06793
4
citations