A Regulation Enforcement Solution for Multi-agent Reinforcement Learning

Sun, Fan-Yun; Chang, Yen-Yu; Wu, Yueh-Hua; Lin, Shou-De

Computer Science > Computer Science and Game Theory

arXiv:1901.10059 (cs)

[Submitted on 29 Jan 2019 (v1), last revised 25 Oct 2019 (this version, v5)]

Title:A Regulation Enforcement Solution for Multi-agent Reinforcement Learning

Authors:Fan-Yun Sun, Yen-Yu Chang, Yueh-Hua Wu, Shou-De Lin

View PDF

Abstract:Human behaviors are regularized by a variety of norms or regulations, either to maintain orders or to enhance social welfare. If artificially intelligent (AI) agents make decisions on behalf of human beings, we would hope they can also follow established regulations while interacting with humans or other AI agents. However, it is possible that an AI agent can opt to disobey the regulations (being defective) for self-interests. In this paper, we aim to answer the following question: Consider a multi-agent decentralized environment. Agents make decisions in complete isolation of other agents. Each agent knows the state of its own MDP and its own actions but it does not know the states and the actions taken by other players. There is a set of regulations for all agents to follow. Although most agents are benign and will comply to regulations but not all agents are compliant at first, can we develop a framework such that it is in the self-interest of non-compliant agents to comply after all?. We first introduce the problem as Regulation Enforcement and formulate it using reinforcement learning and game theory under the scenario where agents make decisions in complete isolation of other agents. We then propose a solution based on the key idea that although we could not alter how defective agents choose to behave, we can, however, leverage the aggregated power of compliant agents to boycott the defective ones. We conducted simulated experiments on two scenarios: Replenishing Resource Management Dilemma and Diminishing Reward Shaping Enforcement, using deep multi-agent reinforcement learning algorithms. We further use empirical game-theoretic analysis to show that the method alters the resulting empirical payoff matrices in a way that promotes compliance (making mutual compliant a Nash Equilibrium).

Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:1901.10059 [cs.GT]
	(or arXiv:1901.10059v5 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.1901.10059

Submission history

From: Fan-Yun Sun [view email]
[v1] Tue, 29 Jan 2019 01:12:25 UTC (430 KB)
[v2] Sat, 8 Jun 2019 15:44:47 UTC (410 KB)
[v3] Sat, 15 Jun 2019 17:56:43 UTC (410 KB)
[v4] Wed, 23 Oct 2019 16:24:17 UTC (410 KB)
[v5] Fri, 25 Oct 2019 13:37:09 UTC (418 KB)

Computer Science > Computer Science and Game Theory

Title:A Regulation Enforcement Solution for Multi-agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:A Regulation Enforcement Solution for Multi-agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators