"multi-agent reinforcement learning" Papers

65 papers found • Page 1 of 2

Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning

Wenchang Duan, Yaoliang Yu, Jiwan He et al.

NEURIPS 2025oralarXiv:2510.26389

Advantage Alignment Algorithms

Juan Duque, Milad Aghajohari, Timotheus Cooijmans et al.

ICLR 2025arXiv:2406.14662
6
citations

A Generalist Hanabi Agent

Arjun V Sudhakar, Hadi Nekoei, Mathieu Reymond et al.

ICLR 2025arXiv:2503.14555
2
citations

AgentMixer: Multi-Agent Correlated Policy Factorization

Zhiyuan Li, Wenshuai Zhao, Lijun Wu et al.

AAAI 2025paperarXiv:2401.08728
4
citations

A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence

Mingyang Liu, Gabriele Farina, Asuman Ozdaglar

ICLR 2025arXiv:2408.00751
3
citations

A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning

Anjie Liu, Jianhong Wang, Samuel Kaski et al.

NEURIPS 2025arXiv:2510.17697

Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning

Laixi Shi, Jingchu Gai, Eric Mazumdar et al.

ICML 2025oralarXiv:2409.20067
4
citations

Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems

Zhuohui Zhang, Bin He, Bin Cheng et al.

AAAI 2025paperarXiv:2408.07397
6
citations

Distributionally Robust Multi-Agent Reinforcement Learning for Dynamic Chute Mapping

Guangyi Liu, Suzan Iloglu, Michael Caldara et al.

ICML 2025arXiv:2503.09755
3
citations

DoF: A Diffusion Factorization Framework for Offline Multi-Agent Reinforcement Learning

Chao Li, Ziwei Deng, Chenxing Lin et al.

ICLR 2025
7
citations

Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching

Lei Yuan, Yuqi Bian, Lihe Li et al.

ICLR 2025oral
5
citations

Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

Simin Li, Zihao Mao, Hanxiao Li et al.

NEURIPS 2025arXiv:2510.11824

eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels

Alexander DeRieux, Walid Saad

ICLR 2025arXiv:2405.17486
5
citations

Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning

Xinran Li, Xiaolu Wang, Chenjia Bai et al.

ICLR 2025arXiv:2502.19717
5
citations

FlickerFusion: Intra-trajectory Domain Generalizing Multi-agent Reinforcement Learning

Woosung Koh, Wonbeen Oh, Siyeol Kim et al.

ICLR 2025

GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL

Lang Qin, Ziming Wang, Runhao Jiang et al.

AAAI 2025paperarXiv:2404.15597
3
citations

High-order Interactions Modeling for Interpretable Multi-Agent Q-Learning

Qinyu Xu, Yuanyang Zhu, Xuefei Wu et al.

NEURIPS 2025arXiv:2510.20218
1
citations

Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

Logan Cross, Violet Xiang, Agam Bhatia et al.

ICLR 2025arXiv:2407.07086
24
citations

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

Yiqun Chen, Lingyong Yan, Weiwei Sun et al.

NEURIPS 2025arXiv:2501.15228
29
citations

InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma

Xiaoxuan Hou, Jiayi Yuan, Joel Z Leibo et al.

ICLR 2025oralarXiv:2411.09856
2
citations

Investigating Relational State Abstraction in Collaborative MARL

Sharlin Utke, Jeremie Houssineau, Giovanni Montana

AAAI 2025paperarXiv:2412.15388
1
citations

Learn How to Query from Unlabeled Data Streams in Federated Learning

Yuchang Sun, Xinran Li, Tao Lin et al.

AAAI 2025paperarXiv:2412.08138
1
citations

LOPT: Learning Optimal Pigovian Tax in Sequential Social Dilemmas

Yun Hua, Shang Gao, Wenhao Li et al.

NEURIPS 2025

MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures

Elena Zamaraeva, Christopher Collins, George Darling et al.

NEURIPS 2025arXiv:2506.04195
1
citations

MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning

Sizhe Tang, Jiayu Chen, Tian Lan

NEURIPS 2025arXiv:2511.06142
4
citations

Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning

Emile Anand, Ishani Karmarkar, Guannan Qu

NEURIPS 2025spotlightarXiv:2412.00661
5
citations

MOSDT: Self-Distillation-Based Decision Transformer for Multi-Agent Offline Safe Reinforcement Learning

Yuchen Xia, Yunjian Xu

NEURIPS 2025

Multi-Agent Reinforcement Learning with Communication-Constrained Priors

Guang Yang, Jingwen Qiao, Tianpei Yang et al.

NEURIPS 2025arXiv:2512.03528

OPHR: Mastering Volatility Trading with Multi-Agent Deep Reinforcement Learning

Zeting Chen, Xinyu Cai, Molei Qin et al.

NEURIPS 2025

Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Observation Delays

Songchen Fu, Siang Chen, Shaojing Zhao et al.

NEURIPS 2025

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

Ziyu Wan, Yunxiang Li, Xiaoyu Wen et al.

NEURIPS 2025arXiv:2503.09501
40
citations

Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective

Yang Zhang, Xinran Li, Jianing Ye et al.

NEURIPS 2025arXiv:2505.20922
5
citations

Sequential Multi-Agent Dynamic Algorithm Configuration

Chen Lu, Ke Xue, Lei Yuan et al.

NEURIPS 2025arXiv:2510.23535
1
citations

SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning

Xu Wan, Chao Yang, Cheng Yang et al.

AAAI 2025paperarXiv:2503.01458
1
citations

Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization

Tianxu Li, Kun Zhu

ICLR 2025
2
citations

Trajectory-Class-Aware Multi-Agent Reinforcement Learning

Hyungho Na, Kwanghyeon Lee, Sumin Lee et al.

ICLR 2025arXiv:2503.01440
1
citations

Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning

Jianming Chen, Yawen Wang, Junjie Wang et al.

AAAI 2025paperarXiv:2412.15619
7
citations

Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning

Hao Ma, Shijie Wang, Zhiqiang Pu et al.

AAAI 2025paperarXiv:2502.13430

Wolfpack Adversarial Attack for Robust Multi-Agent Reinforcement Learning

Sunwoo Lee, Jaebak Hwang, Yonghyeon Jo et al.

ICML 2025arXiv:2502.02844
1
citations

Wonder Wins Ways: Curiosity-Driven Exploration through Multi-Agent Contextual Calibration

Yiyuan Pan, Zhe Liu, Hesheng Wang

NEURIPS 2025arXiv:2509.20648

Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning

Yanwen Ba, Xuan Liu, Xinning Chen et al.

AAAI 2024paperarXiv:2312.12095
5
citations

ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Huiqun Li, Hanhan Zhou, Yifei Zou et al.

AAAI 2024paperarXiv:2312.15555
12
citations

Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning

Matteo Bettini, Ryan Kortvelesy, Amanda Prorok

ICML 2024oralarXiv:2405.15054
9
citations

Detecting Influence Structures in Multi-Agent Reinforcement Learning

Fabian Raoul Pieroth, Katherine Fitch, Lenz Belzner

ICML 2024

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

Yizhe Huang, Anji Liu, Fanqi Kong et al.

ICML 2024arXiv:2406.08002
5
citations

FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

Wenzhe Li, Zihan Ding, Seth Karten et al.

ICML 2024arXiv:2406.02081
10
citations

FoX: Formation-Aware Exploration in Multi-Agent Reinforcement Learning

Yonghyeon Jo, Sunwoo Lee, Junghyuk Yum et al.

AAAI 2024paperarXiv:2308.11272
16
citations

HGAP: Boosting Permutation Invariant and Permutation Equivariant in Multi-Agent Reinforcement Learning via Graph Attention Network

Bor Jiun Lin, Chun-Yi Lee

ICML 2024

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

Zeyang Liu, Lipeng Wan, Xinrui Yang et al.

AAAI 2024paperarXiv:2402.17978
7
citations

Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games

Songtao Feng, Ming Yin, Yu-Xiang Wang et al.

ICML 2024arXiv:2308.08858
2
citations
PreviousNext