Deep Joint Task Learning for Generic Object Extraction

Wang, Xiaolong; Zhang, Liliang; Lin, Liang; Liang, Zhujin; Zuo, Wangmeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:1502.00743 (cs)

[Submitted on 3 Feb 2015]

Title:Deep Joint Task Learning for Generic Object Extraction

Authors:Xiaolong Wang, Liliang Zhang, Liang Lin, Zhujin Liang, Wangmeng Zuo

View PDF

Abstract:This paper investigates how to extract objects-of-interest without relying on hand-craft features and sliding windows approaches, that aims to jointly solve two sub-tasks: (i) rapidly localizing salient objects from images, and (ii) accurately segmenting the objects based on the localizations. We present a general joint task learning framework, in which each task (either object localization or object segmentation) is tackled via a multi-layer convolutional neural network, and the two networks work collaboratively to boost performance. In particular, we propose to incorporate latent variables bridging the two networks in a joint optimization manner. The first network directly predicts the positions and scales of salient objects from raw images, and the latent variables adjust the object localizations to feed the second network that produces pixelwise object masks. An EM-type method is presented for the optimization, iterating with two steps: (i) by using the two networks, it estimates the latent variables by employing an MCMC-based sampling method; (ii) it optimizes the parameters of the two networks unitedly via back propagation, with the fixed latent variables. Extensive experiments suggest that our framework significantly outperforms other state-of-the-art approaches in both accuracy and efficiency (e.g. 1000 times faster than competing approaches).

Comments:	9 pages, 4 figures, NIPS 2014
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
MSC classes:	68U01
Cite as:	arXiv:1502.00743 [cs.CV]
	(or arXiv:1502.00743v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1502.00743
Journal reference:	Advances in Neural Information Processing Systems (pp. 523-531), 2014

Submission history

From: Liang Lin [view email]
[v1] Tue, 3 Feb 2015 05:35:09 UTC (309 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Joint Task Learning for Generic Object Extraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Joint Task Learning for Generic Object Extraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators