Compressing 3DCNNs Based on Tensor Train Decomposition

Wang, Dingheng; Zhao, Guangshe; Li, Guoqi; Deng, Lei; Wu, Yang

doi:10.1016/j.neunet.2020.07.028

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.03647 (cs)

[Submitted on 8 Dec 2019 (v1), last revised 11 Aug 2020 (this version, v2)]

Title:Compressing 3DCNNs Based on Tensor Train Decomposition

Authors:Dingheng Wang, Guangshe Zhao, Guoqi Li, Lei Deng, Yang Wu

View PDF

Abstract:Three dimensional convolutional neural networks (3DCNNs) have been applied in many tasks, e.g., video and 3D point cloud recognition. However, due to the higher dimension of convolutional kernels, the space complexity of 3DCNNs is generally larger than that of traditional two dimensional convolutional neural networks (2DCNNs). To miniaturize 3DCNNs for the deployment in confining environments such as embedded devices, neural network compression is a promising approach. In this work, we adopt the tensor train (TT) decomposition, a straightforward and simple in situ training compression method, to shrink the 3DCNN models. Through proposing tensorizing 3D convolutional kernels in TT format, we investigate how to select appropriate TT ranks for achieving higher compression ratio. We have also discussed the redundancy of 3D convolutional kernels for compression, core significance and future directions of this work, as well as the theoretical computation complexity versus practical executing time of convolution in TT. In the light of multiple contrast experiments based on VIVA challenge, UCF11, and UCF101 datasets, we conclude that TT decomposition can compress 3DCNNs by around one hundred times without significant accuracy loss, which will enable its applications in extensive real world scenarios.

Comments:	Accepted by Neural Networks. Please see the final version by the DOI below
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1912.03647 [cs.CV]
	(or arXiv:1912.03647v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.03647
Related DOI:	https://doi.org/10.1016/j.neunet.2020.07.028

Submission history

From: Dingheng Wang [view email]
[v1] Sun, 8 Dec 2019 09:51:08 UTC (538 KB)
[v2] Tue, 11 Aug 2020 03:42:21 UTC (751 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Compressing 3DCNNs Based on Tensor Train Decomposition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Compressing 3DCNNs Based on Tensor Train Decomposition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators