Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Hoppe, Heiko; Enders, Tobias; Cappart, Quentin; Schiffer, Maximilian

Computer Science > Machine Learning

arXiv:2312.08884 (cs)

[Submitted on 14 Dec 2023 (v1), last revised 19 May 2024 (this version, v2)]

Title:Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Authors:Heiko Hoppe, Tobias Enders, Quentin Cappart, Maximilian Schiffer

View PDF

Abstract:We study vehicle dispatching in autonomous mobility on demand (AMoD) systems, where a central operator assigns vehicles to customer requests or rejects these with the aim of maximizing its total profit. Recent approaches use multi-agent deep reinforcement learning (MADRL) to realize scalable yet performant algorithms, but train agents based on local rewards, which distorts the reward signal with respect to the system-wide profit, leading to lower performance. We therefore propose a novel global-rewards-based MADRL algorithm for vehicle dispatching in AMoD systems, which resolves so far existing goal conflicts between the trained agents and the operator by assigning rewards to agents leveraging a counterfactual baseline. Our algorithm shows statistically significant improvements across various settings on real-world data compared to state-of-the-art MADRL algorithms with local rewards. We further provide a structural analysis which shows that the utilization of global rewards can improve implicit vehicle balancing and demand forecasting abilities. Our code is available at this https URL.

Comments:	22 pages, 6 figures, extended version of paper accepted at the 6th Learning for Dynamics & Control Conference (L4DC 2024)
Subjects:	Machine Learning (cs.LG); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
Cite as:	arXiv:2312.08884 [cs.LG]
	(or arXiv:2312.08884v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.08884

Submission history

From: Heiko Hoppe [view email]
[v1] Thu, 14 Dec 2023 12:47:33 UTC (275 KB)
[v2] Sun, 19 May 2024 08:09:08 UTC (403 KB)

Computer Science > Machine Learning

Title:Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators