Soft Hindsight Experience Replay

He, Qiwei; Zhuang, Liansheng; Li, Houqiang

Computer Science > Artificial Intelligence

arXiv:2002.02089 (cs)

[Submitted on 6 Feb 2020]

Title:Soft Hindsight Experience Replay

Authors:Qiwei He, Liansheng Zhuang, Houqiang Li

View PDF

Abstract:Efficient learning in the environment with sparse rewards is one of the most important challenges in Deep Reinforcement Learning (DRL). In continuous DRL environments such as robotic arms control, Hindsight Experience Replay (HER) has been shown an effective solution. However, due to the brittleness of deterministic methods, HER and its variants typically suffer from a major challenge for stability and convergence, which significantly affects the final performance. This challenge severely limits the applicability of such methods to complex real-world domains. To tackle this challenge, in this paper, we propose Soft Hindsight Experience Replay (SHER), a novel approach based on HER and Maximum Entropy Reinforcement Learning (MERL), combining the failed experiences reuse and maximum entropy probabilistic inference model. We evaluate SHER on Open AI Robotic manipulation tasks with sparse rewards. Experimental results show that, in contrast to HER and its variants, our proposed SHER achieves state-of-the-art performance, especially in the difficult HandManipulation tasks. Furthermore, our SHER method is more stable, achieving very similar performance across different random seeds.

Comments:	7 pages, 5 figures, 1 table, submitted to IJCAI2020
Subjects:	Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2002.02089 [cs.AI]
	(or arXiv:2002.02089v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2002.02089

Submission history

From: Qiwei He [view email]
[v1] Thu, 6 Feb 2020 03:57:04 UTC (534 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-02

Change to browse by:

cs
cs.RO

References & Citations

DBLP - CS Bibliography

listing | bibtex

Liansheng Zhuang
Houqiang Li

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Soft Hindsight Experience Replay

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Soft Hindsight Experience Replay

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators