Net2Net: Accelerating Learning via Knowledge Transfer

Chen, Tianqi; Goodfellow, Ian; Shlens, Jonathon

Computer Science > Machine Learning

arXiv:1511.05641 (cs)

[Submitted on 18 Nov 2015 (v1), last revised 23 Apr 2016 (this version, v4)]

Title:Net2Net: Accelerating Learning via Knowledge Transfer

Authors:Tianqi Chen, Ian Goodfellow, Jonathon Shlens

View PDF

Abstract:We introduce techniques for rapidly transferring the information stored in one neural net into another neural net. The main purpose is to accelerate the training of a significantly larger neural net. During real-world workflows, one often trains very many different neural networks during the experimentation and design process. This is a wasteful process in which each new model is trained from scratch. Our Net2Net technique accelerates the experimentation process by instantaneously transferring the knowledge from a previous network to each new deeper or wider network. Our techniques are based on the concept of function-preserving transformations between neural network specifications. This differs from previous approaches to pre-training that altered the function represented by a neural net when adding layers to it. Using our knowledge transfer mechanism to add depth to Inception modules, we demonstrate a new state of the art accuracy rating on the ImageNet dataset.

Comments:	ICLR 2016 submission
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1511.05641 [cs.LG]
	(or arXiv:1511.05641v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1511.05641

Submission history

From: Tianqi Chen [view email]
[v1] Wed, 18 Nov 2015 02:09:20 UTC (209 KB)
[v2] Thu, 19 Nov 2015 19:07:40 UTC (172 KB)
[v3] Thu, 7 Jan 2016 22:54:48 UTC (255 KB)
[v4] Sat, 23 Apr 2016 23:14:39 UTC (391 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tianqi Chen
Ian J. Goodfellow
Jonathon Shlens

export BibTeX citation

Computer Science > Machine Learning

Title:Net2Net: Accelerating Learning via Knowledge Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Net2Net: Accelerating Learning via Knowledge Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators