Unifying Count-Based Exploration and Intrinsic Motivation

Bellemare, Marc G.; Srinivasan, Sriram; Ostrovski, Georg; Schaul, Tom; Saxton, David; Munos, Remi

Computer Science > Artificial Intelligence

arXiv:1606.01868 (cs)

[Submitted on 6 Jun 2016 (v1), last revised 7 Nov 2016 (this version, v2)]

Title:Unifying Count-Based Exploration and Intrinsic Motivation

Authors:Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Remi Munos

View PDF

Abstract:We consider an agent's uncertainty about its environment and the problem of generalizing this uncertainty across observations. Specifically, we focus on the problem of exploration in non-tabular reinforcement learning. Drawing inspiration from the intrinsic motivation literature, we use density models to measure uncertainty, and propose a novel algorithm for deriving a pseudo-count from an arbitrary density model. This technique enables us to generalize count-based exploration algorithms to the non-tabular case. We apply our ideas to Atari 2600 games, providing sensible pseudo-counts from raw pixels. We transform these pseudo-counts into intrinsic rewards and obtain significantly improved exploration in a number of hard games, including the infamously difficult Montezuma's Revenge.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1606.01868 [cs.AI]
	(or arXiv:1606.01868v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1606.01868

Submission history

From: Marc G. Bellemare [view email]
[v1] Mon, 6 Jun 2016 19:21:32 UTC (2,153 KB)
[v2] Mon, 7 Nov 2016 21:16:21 UTC (2,091 KB)

Computer Science > Artificial Intelligence

Title:Unifying Count-Based Exploration and Intrinsic Motivation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Unifying Count-Based Exploration and Intrinsic Motivation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators