Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Oh, Junhyuk; Singh, Satinder; Lee, Honglak; Kohli, Pushmeet

Computer Science > Artificial Intelligence

arXiv:1706.05064 (cs)

[Submitted on 15 Jun 2017 (v1), last revised 7 Nov 2017 (this version, v2)]

Title:Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Authors:Junhyuk Oh, Satinder Singh, Honglak Lee, Pushmeet Kohli

View PDF

Abstract:As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalization over unseen instructions, we propose a new objective which encourages learning correspondences between similar subtasks by making analogies. For generalization over sequential instructions, we present a hierarchical architecture where a meta controller learns to use the acquired skills for executing the instructions. To deal with delayed reward, we propose a new neural architecture in the meta controller that learns when to update the subtask, which makes learning more efficient. Experimental results on a stochastic 3D domain show that the proposed ideas are crucial for generalization to longer instructions as well as unseen instructions.

Comments:	ICML 2017
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1706.05064 [cs.AI]
	(or arXiv:1706.05064v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1706.05064

Submission history

From: Junhyuk Oh [view email]
[v1] Thu, 15 Jun 2017 20:04:35 UTC (3,869 KB)
[v2] Tue, 7 Nov 2017 00:37:51 UTC (3,869 KB)

Computer Science > Artificial Intelligence

Title:Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators