Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

Yang, Tianpei; Hao, Jianye; Meng, Zhaopeng; Zhang, Zongzhang; Hu, Yujing; Cheng, Yingfeng; Fan, Changjie; Wang, Weixun; Liu, Wulong; Wang, Zhaodong; Peng, Jiajie

Computer Science > Machine Learning

arXiv:2002.08037 (cs)

[Submitted on 19 Feb 2020 (v1), last revised 25 May 2020 (this version, v3)]

Title:Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

Authors:Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Cheng, Changjie Fan, Weixun Wang, Wulong Liu, Zhaodong Wang, Jiajie Peng

View PDF

Abstract:Transfer Learning (TL) has shown great potential to accelerate Reinforcement Learning (RL) by leveraging prior knowledge from past learned policies of relevant tasks. Existing transfer approaches either explicitly computes the similarity between tasks or select appropriate source policies to provide guided explorations for the target task. However, how to directly optimize the target policy by alternatively utilizing knowledge from appropriate source policies without explicitly measuring the similarity is currently missing. In this paper, we propose a novel Policy Transfer Framework (PTF) to accelerate RL by taking advantage of this idea. Our framework learns when and which source policy is the best to reuse for the target policy and when to terminate it by modeling multi-policy transfer as the option learning problem. PTF can be easily combined with existing deep RL approaches. Experimental results show it significantly accelerates the learning process and surpasses state-of-the-art policy transfer methods in terms of learning efficiency and final performance in both discrete and continuous action spaces.

Comments:	Accepted by IJCAI'2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2002.08037 [cs.LG]
	(or arXiv:2002.08037v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.08037

Submission history

From: Tianpei Yang [view email]
[v1] Wed, 19 Feb 2020 07:30:57 UTC (1,365 KB)
[v2] Wed, 13 May 2020 08:41:30 UTC (6,043 KB)
[v3] Mon, 25 May 2020 10:21:28 UTC (3,021 KB)

Computer Science > Machine Learning

Title:Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators