Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate The Future

Lee, Ritchie; Wolpert, David H.; Bono, James; Backhaus, Scott; Bent, Russell; Tracey, Brendan

Computer Science > Multiagent Systems

arXiv:1207.0852 (cs)

[Submitted on 3 Jul 2012]

Title:Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate The Future

Authors:Ritchie Lee, David H. Wolpert, James Bono, Scott Backhaus, Russell Bent, Brendan Tracey

View PDF

Abstract:This paper introduces a novel framework for modeling interacting humans in a multi-stage game. This "iterated semi network-form game" framework has the following desirable characteristics: (1) Bounded rational players, (2) strategic players (i.e., players account for one another's reward functions when predicting one another's behavior), and (3) computational tractability even on real-world systems. We achieve these benefits by combining concepts from game theory and reinforcement learning. To be precise, we extend the bounded rational "level-K reasoning" model to apply to games over multiple stages. Our extension allows the decomposition of the overall modeling problem into a series of smaller ones, each of which can be solved by standard reinforcement learning algorithms. We call this hybrid approach "level-K reinforcement learning". We investigate these ideas in a cyber battle scenario over a smart power grid and discuss the relationship between the behavior predicted by our model and what one might expect of real human defenders and attackers.

Comments:	Decision Making with Multiple Imperfect Decision Makers; Springer. 29 Pages, 6 Figures
Subjects:	Multiagent Systems (cs.MA); Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:1207.0852 [cs.MA]
	(or arXiv:1207.0852v1 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.1207.0852

Submission history

From: Ritchie Lee [view email]
[v1] Tue, 3 Jul 2012 22:30:34 UTC (3,980 KB)

Computer Science > Multiagent Systems

Title:Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate The Future

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate The Future

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators