Poster "sequential decision problems" Papers
2 papers found
Conference
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael Matthews, Michael Beukman, Chris Lu et al.
ICLR 2025arXiv:2410.23208
21
citations
Factored-Reward Bandits with Intermediate Observations
Marco Mussi, Simone Drago, Marcello Restelli et al.
ICML 2024