Efficient Architecture Search for Continual Learning

Gao, Qiang; Luo, Zhipeng; Klabjan, Diego

Computer Science > Machine Learning

arXiv:2006.04027 (cs)

[Submitted on 7 Jun 2020 (v1), last revised 9 Jun 2020 (this version, v2)]

Title:Efficient Architecture Search for Continual Learning

Authors:Qiang Gao, Zhipeng Luo, Diego Klabjan

View PDF

Abstract:Continual learning with neural networks is an important learning framework in AI that aims to learn a sequence of tasks well. However, it is often confronted with three challenges: (1) overcome the catastrophic forgetting problem, (2) adapt the current network to new tasks, and meanwhile (3) control its model complexity. To reach these goals, we propose a novel approach named as Continual Learning with Efficient Architecture Search, or CLEAS in short. CLEAS works closely with neural architecture search (NAS) which leverages reinforcement learning techniques to search for the best neural architecture that fits a new task. In particular, we design a neuron-level NAS controller that decides which old neurons from previous tasks should be reused (knowledge transfer), and which new neurons should be added (to learn new knowledge). Such a fine-grained controller allows one to find a very concise architecture that can fit each new task well. Meanwhile, since we do not alter the weights of the reused neurons, we perfectly memorize the knowledge learned from previous tasks. We evaluate CLEAS on numerous sequential classification tasks, and the results demonstrate that CLEAS outperforms other state-of-the-art alternative methods, achieving higher classification accuracy while using simpler neural architectures.

Comments:	12 pages, 11 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2006.04027 [cs.LG]
	(or arXiv:2006.04027v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.04027

Submission history

From: Qiang Gao [view email]
[v1] Sun, 7 Jun 2020 02:59:29 UTC (2,833 KB)
[v2] Tue, 9 Jun 2020 04:54:11 UTC (984 KB)

Computer Science > Machine Learning

Title:Efficient Architecture Search for Continual Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Architecture Search for Continual Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators