Generative Adversarial Self-Imitation Learning

Guo, Yijie; Oh, Junhyuk; Singh, Satinder; Lee, Honglak

Computer Science > Machine Learning

arXiv:1812.00950 (cs)

[Submitted on 3 Dec 2018]

Title:Generative Adversarial Self-Imitation Learning

Authors:Yijie Guo, Junhyuk Oh, Satinder Singh, Honglak Lee

View PDF

Abstract:This paper explores a simple regularizer for reinforcement learning by proposing Generative Adversarial Self-Imitation Learning (GASIL), which encourages the agent to imitate past good trajectories via generative adversarial imitation learning framework. Instead of directly maximizing rewards, GASIL focuses on reproducing past good trajectories, which can potentially make long-term credit assignment easier when rewards are sparse and delayed. GASIL can be easily combined with any policy gradient objective by using GASIL as a learned shaped reward function. Our experimental results show that GASIL improves the performance of proximal policy optimization on 2D Point Mass and MuJoCo environments with delayed reward and stochastic dynamics.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1812.00950 [cs.LG]
	(or arXiv:1812.00950v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1812.00950

Submission history

From: Yijie Guo [view email]
[v1] Mon, 3 Dec 2018 18:21:18 UTC (4,015 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-12

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yijie Guo
Junhyuk Oh
Satinder Singh
Honglak Lee

export BibTeX citation

Computer Science > Machine Learning

Title:Generative Adversarial Self-Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generative Adversarial Self-Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators