by Juan Duque Papers
3 papers found
Conference
Advantage Alignment Algorithms
Juan Duque, Milad Aghajohari, Timotheus Cooijmans et al.
ICLR 2025arXiv:2406.14662
6
citations
Self-Play $Q$-Learners Can Provably Collude in the Iterated Prisoner's Dilemma
Quentin Bertrand, Juan Duque, Emilio Calvano et al.
ICML 2025arXiv:2312.08484
1
citations
LOQA: Learning with Opponent Q-Learning Awareness
Milad Aghajohari, Juan Duque, Timotheus Cooijmans et al.
ICLR 2024arXiv:2405.01035
7
citations