by Ulrich Armel Mbou Sob Papers
2 papers found
Conference
Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies
Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan John de Kock et al.
NEURIPS 2025oralarXiv:2505.21236
2
citations
Oryx: a Scalable Sequence Model for Many-Agent Coordination in Offline MARL
Juan Formanek, Omayma Mahjoub, Louay Nessir et al.
NEURIPS 2025oralarXiv:2505.22151