Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys

Kouzehgar, Maryam; Meghjani, Malika; Bouffanais, Roland

doi:10.1109/IEEECONF38699.2020.9389128

Computer Science > Robotics

arXiv:2012.11641 (cs)

[Submitted on 21 Dec 2020]

Title:Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys

Authors:Maryam Kouzehgar, Malika Meghjani, Roland Bouffanais

View PDF

Abstract:Autonomous marine environmental monitoring problem traditionally encompasses an area coverage problem which can only be effectively carried out by a multi-robot system. In this paper, we focus on robotic swarms that are typically operated and controlled by means of simple swarming behaviors obtained from a subtle, yet ad hoc combination of bio-inspired strategies. We propose a novel and structured approach for area coverage using multi-agent reinforcement learning (MARL) which effectively deals with the non-stationarity of environmental features. Specifically, we propose two dynamic area coverage approaches: (1) swarm-based MARL, and (2) coverage-range-based MARL. The former is trained using the multi-agent deep deterministic policy gradient (MADDPG) approach whereas, a modified version of MADDPG is introduced for the latter with a reward function that intrinsically leads to a collective behavior. Both methods are tested and validated with different geometric shaped regions with equal surface area (square vs. rectangle) yielding acceptable area coverage, and benefiting from the structured learning in non-stationary environments. Both approaches are advantageous compared to a naïve swarming method. However, coverage-range-based MARL outperforms the swarm-based MARL with stronger convergence features in learning criteria and higher spreading of agents for area coverage.

Comments:	Accepted for Publication at IEEE/MTS OCEANS 2020
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2012.11641 [cs.RO]
	(or arXiv:2012.11641v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2012.11641
Journal reference:	in Proceedings of Global Oceans 2020: Singapore-US Gulf Coast (pp. 1-8). IEEE
Related DOI:	https://doi.org/10.1109/IEEECONF38699.2020.9389128

Submission history

From: Roland Bouffanais [view email]
[v1] Mon, 21 Dec 2020 19:11:23 UTC (1,749 KB)

Computer Science > Robotics

Title:Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators