Exploration by Random Network Distillation

Burda, Yuri; Edwards, Harrison; Storkey, Amos; Klimov, Oleg

Computer Science > Machine Learning

arXiv:1810.12894 (cs)

[Submitted on 30 Oct 2018]

Title:Exploration by Random Network Distillation

Authors:Yuri Burda, Harrison Edwards, Amos Storkey, Oleg Klimov

View PDF

Abstract:We introduce an exploration bonus for deep reinforcement learning methods that is easy to implement and adds minimal overhead to the computation performed. The bonus is the error of a neural network predicting features of the observations given by a fixed randomly initialized neural network. We also introduce a method to flexibly combine intrinsic and extrinsic rewards. We find that the random network distillation (RND) bonus combined with this increased flexibility enables significant progress on several hard exploration Atari games. In particular we establish state of the art performance on Montezuma's Revenge, a game famously difficult for deep reinforcement learning methods. To the best of our knowledge, this is the first method that achieves better than average human performance on this game without using demonstrations or having access to the underlying state of the game, and occasionally completes the first level.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1810.12894 [cs.LG]
	(or arXiv:1810.12894v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.12894

Submission history

From: Yuri Burda [view email]
[v1] Tue, 30 Oct 2018 17:44:42 UTC (3,449 KB)

Computer Science > Machine Learning

Title:Exploration by Random Network Distillation

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Exploration by Random Network Distillation

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators