Modular Networks: Learning to Decompose Neural Computation

Kirsch, Louis; Kunze, Julius; Barber, David

Computer Science > Machine Learning

arXiv:1811.05249 (cs)

[Submitted on 13 Nov 2018]

Title:Modular Networks: Learning to Decompose Neural Computation

Authors:Louis Kirsch, Julius Kunze, David Barber

View PDF

Abstract:Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number of parameters with a relatively small increase in resources. We propose a training algorithm that flexibly chooses neural modules based on the data to be processed. Both the decomposition and modules are learned end-to-end. In contrast to existing approaches, training does not rely on regularization to enforce diversity in module use. We apply modular networks both to image recognition and language modeling tasks, where we achieve superior performance compared to several baselines. Introspection reveals that modules specialize in interpretable contexts.

Comments:	NIPS 2018
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1811.05249 [cs.LG]
	(or arXiv:1811.05249v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1811.05249

Submission history

From: Louis Kirsch [view email]
[v1] Tue, 13 Nov 2018 12:24:23 UTC (668 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-11

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Louis Kirsch
Julius Kunze
David Barber

export BibTeX citation

Computer Science > Machine Learning

Title:Modular Networks: Learning to Decompose Neural Computation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Modular Networks: Learning to Decompose Neural Computation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators