MADiff: Offline Multi-agent Learning with Diffusion Models

Zhu, Zhengbang; Liu, Minghuan; Mao, Liyuan; Kang, Bingyi; Xu, Minkai; Yu, Yong; Ermon, Stefano; Zhang, Weinan

Computer Science > Artificial Intelligence

arXiv:2305.17330 (cs)

[Submitted on 27 May 2023 (v1), last revised 25 May 2024 (this version, v4)]

Title:MADiff: Offline Multi-agent Learning with Diffusion Models

Authors:Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang

View PDF HTML (experimental)

Abstract:Diffusion model (DM) recently achieved huge success in various scenarios including offline reinforcement learning, where the diffusion planner learn to generate desired trajectories during online evaluations. However, despite the effectiveness in single-agent learning, it remains unclear how DMs can operate in multi-agent problems, where agents can hardly complete teamwork without good coordination by independently modeling each agent's trajectories. In this paper, we propose MADiff, a novel generative multi-agent learning framework to tackle this problem. MADiff is realized with an attention-based diffusion model to model the complex coordination among behaviors of multiple agents. To the best of our knowledge, MADiff is the first diffusion-based multi-agent learning framework, which behaves as both a decentralized policy and a centralized controller. During decentralized executions, MADiff simultaneously performs teammate modeling, and the centralized controller can also be applied in multi-agent trajectory predictions. Our experiments show the superior performance of MADiff compared to baseline algorithms in a wide range of multi-agent learning tasks, which emphasizes the effectiveness of MADiff in modeling complex multi-agent interactions. Our code is available at this https URL.

Comments:	19 pages, 10 figures, 7 tables. The first two authors contributed equally to the work
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2305.17330 [cs.AI]
	(or arXiv:2305.17330v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2305.17330

Submission history

From: Zhengbang Zhu [view email]
[v1] Sat, 27 May 2023 02:14:09 UTC (2,609 KB)
[v2] Mon, 14 Aug 2023 13:48:38 UTC (2,609 KB)
[v3] Wed, 20 Dec 2023 14:54:15 UTC (3,074 KB)
[v4] Sat, 25 May 2024 13:02:09 UTC (4,013 KB)

Computer Science > Artificial Intelligence

Title:MADiff: Offline Multi-agent Learning with Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:MADiff: Offline Multi-agent Learning with Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators