MetAdapt: Meta-Learned Task-Adaptive Architecture for Few-Shot Classification

Doveh, Sivan; Schwartz, Eli; Xue, Chao; Feris, Rogerio; Bronstein, Alex; Giryes, Raja; Karlinsky, Leonid

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.00412 (cs)

[Submitted on 1 Dec 2019 (v1), last revised 9 Mar 2020 (this version, v3)]

Title:MetAdapt: Meta-Learned Task-Adaptive Architecture for Few-Shot Classification

Authors:Sivan Doveh, Eli Schwartz, Chao Xue, Rogerio Feris, Alex Bronstein, Raja Giryes, Leonid Karlinsky

View PDF

Abstract:Few-Shot Learning (FSL) is a topic of rapidly growing interest. Typically, in FSL a model is trained on a dataset consisting of many small tasks (meta-tasks) and learns to adapt to novel tasks that it will encounter during test time. This is also referred to as meta-learning. Another topic closely related to meta-learning with a lot of interest in the community is Neural Architecture Search (NAS), automatically finding optimal architecture instead of engineering it manually. In this work, we combine these two aspects of meta-learning. So far, meta-learning FSL methods have focused on optimizing parameters of pre-defined network architectures, in order to make them easily adaptable to novel tasks. Moreover, it was observed that, in general, larger architectures perform better than smaller ones up to a certain saturation point (where they start to degrade due to over-fitting). However, little attention has been given to explicitly optimizing the architectures for FSL, nor to an adaptation of the architecture at test time to particular novel tasks. In this work, we propose to employ tools inspired by the Differentiable Neural Architecture Search (D-NAS) literature in order to optimize the architecture for FSL without over-fitting. Additionally, to make the architecture task adaptive, we propose the concept of `MetAdapt Controller' modules. These modules are added to the model and are meta-trained to predict the optimal network connections for a given novel task. Using the proposed approach we observe state-of-the-art results on two popular few-shot benchmarks: miniImageNet and FC100.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1912.00412 [cs.CV]
	(or arXiv:1912.00412v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.00412

Submission history

From: Eli Schwartz [view email]
[v1] Sun, 1 Dec 2019 14:04:34 UTC (205 KB)
[v2] Tue, 3 Dec 2019 09:27:25 UTC (203 KB)
[v3] Mon, 9 Mar 2020 11:44:12 UTC (289 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MetAdapt: Meta-Learned Task-Adaptive Architecture for Few-Shot Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MetAdapt: Meta-Learned Task-Adaptive Architecture for Few-Shot Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators