Learning High-Level Policies for Model Predictive Control

Song, Yunlong; Scaramuzza, Davide

doi:10.1109/IROS45743.2020.9340823

Computer Science > Robotics

arXiv:2007.10284 (cs)

[Submitted on 20 Jul 2020 (v1), last revised 9 May 2021 (this version, v2)]

Title:Learning High-Level Policies for Model Predictive Control

Authors:Yunlong Song, Davide Scaramuzza

View PDF

Abstract:The combination of policy search and deep neural networks holds the promise of automating a variety of decision-making tasks. Model Predictive Control (MPC) provides robust solutions to robot control tasks by making use of a dynamical model of the system and solving an optimization problem online over a short planning horizon. In this work, we leverage probabilistic decision-making approaches and the generalization capability of artificial neural networks to the powerful online optimization by learning a deep high-level policy for the MPC (High-MPC). Conditioning on robot's local observations, the trained neural network policy is capable of adaptively selecting high-level decision variables for the low-level MPC controller, which then generates optimal control commands for the robot. First, we formulate the search of high-level decision variables for MPC as a policy search problem, specifically, a probabilistic inference problem. The problem can be solved in a closed-form solution. Second, we propose a self-supervised learning algorithm for learning a neural network high-level policy, which is useful for online hyperparameter adaptations in highly dynamic environments. We demonstrate the importance of incorporating the online adaption into autonomous robots by using the proposed method to solve a challenging control problem, where the task is to control a simulated quadrotor to fly through a swinging gate. We show that our approach can handle situations that are difficult for standard MPC.

Comments:	Accepted for Publication at the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2007.10284 [cs.RO]
	(or arXiv:2007.10284v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2007.10284
Journal reference:	IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, 2020
Related DOI:	https://doi.org/10.1109/IROS45743.2020.9340823

Submission history

From: Yunlong Song [view email]
[v1] Mon, 20 Jul 2020 17:12:34 UTC (1,523 KB)
[v2] Sun, 9 May 2021 16:47:53 UTC (1,523 KB)

Computer Science > Robotics

Title:Learning High-Level Policies for Model Predictive Control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning High-Level Policies for Model Predictive Control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators