Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization

Zhou, Qi; Li, Houqiang; Wang, Jie

Computer Science > Machine Learning

arXiv:1911.12574 (cs)

[Submitted on 28 Nov 2019]

Title:Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization

Authors:Qi Zhou, Houqiang Li, Jie Wang

View PDF

Abstract:Model-based reinforcement learning algorithms tend to achieve higher sample efficiency than model-free methods. However, due to the inevitable errors of learned models, model-based methods struggle to achieve the same asymptotic performance as model-free methods.
In this paper, We propose a Policy Optimization method with Model-Based Uncertainty (POMBU)---a novel model-based approach---that can effectively improve the asymptotic performance using the uncertainty in Q-values. We derive an upper bound of the uncertainty, based on which we can approximate the uncertainty accurately and efficiently for model-based methods. We further propose an uncertainty-aware policy optimization algorithm that optimizes the policy conservatively to encourage performance improvement with high probability. This can significantly alleviate the overfitting of policy to inaccurate models.
Experiments show POMBU can outperform existing state-of-the-art policy optimization algorithms in terms of sample efficiency and asymptotic performance. Moreover, the experiments demonstrate the excellent robustness of POMBU compared to previous model-based approaches.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.12574 [cs.LG]
	(or arXiv:1911.12574v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.12574

Submission history

From: Jie Wang [view email]
[v1] Thu, 28 Nov 2019 07:56:00 UTC (3,540 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Qi Zhou
Houqiang Li
Jie Wang

export BibTeX citation

Computer Science > Machine Learning

Title:Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators