Pooling is neither necessary nor sufficient for appropriate deformation stability in CNNs

Ruderman, Avraham; Rabinowitz, Neil C.; Morcos, Ari S.; Zoran, Daniel

Computer Science > Computer Vision and Pattern Recognition

arXiv:1804.04438 (cs)

[Submitted on 12 Apr 2018 (v1), last revised 25 May 2018 (this version, v2)]

Title:Pooling is neither necessary nor sufficient for appropriate deformation stability in CNNs

Authors:Avraham Ruderman, Neil C. Rabinowitz, Ari S. Morcos, Daniel Zoran

View PDF

Abstract:Many of our core assumptions about how neural networks operate remain empirically untested. One common assumption is that convolutional neural networks need to be stable to small translations and deformations to solve image recognition tasks. For many years, this stability was baked into CNN architectures by incorporating interleaved pooling layers. Recently, however, interleaved pooling has largely been abandoned. This raises a number of questions: Are our intuitions about deformation stability right at all? Is it important? Is pooling necessary for deformation invariance? If not, how is deformation invariance achieved in its absence? In this work, we rigorously test these questions, and find that deformation stability in convolutional networks is more nuanced than it first appears: (1) Deformation invariance is not a binary property, but rather that different tasks require different degrees of deformation stability at different layers. (2) Deformation stability is not a fixed property of a network and is heavily adjusted over the course of training, largely through the smoothness of the convolutional filters. (3) Interleaved pooling layers are neither necessary nor sufficient for achieving the optimal form of deformation stability for natural image classification. (4) Pooling confers too much deformation stability for image classification at initialization, and during training, networks have to learn to counteract this inductive bias. Together, these findings provide new insights into the role of interleaved pooling and deformation invariance in CNNs, and demonstrate the importance of rigorous empirical testing of even our most basic assumptions about the working of neural networks.

Comments:	NIPS 2018 submission
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1804.04438 [cs.CV]
	(or arXiv:1804.04438v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1804.04438

Submission history

From: Ari Morcos [view email]
[v1] Thu, 12 Apr 2018 11:44:05 UTC (2,336 KB)
[v2] Fri, 25 May 2018 13:03:50 UTC (6,482 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Pooling is neither necessary nor sufficient for appropriate deformation stability in CNNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Pooling is neither necessary nor sufficient for appropriate deformation stability in CNNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators