Deep Supervision with Intermediate Concepts

Li, Chi; Zia, M. Zeeshan; Tran, Quoc-Huy; Yu, Xiang; Hager, Gregory D.; Chandraker, Manmohan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1801.03399 (cs)

[Submitted on 8 Jan 2018 (v1), last revised 20 Jul 2018 (this version, v2)]

Title:Deep Supervision with Intermediate Concepts

Authors:Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D.Hager, Manmohan Chandraker

View PDF

Abstract:Recent data-driven approaches to scene interpretation predominantly pose inference as an end-to-end black-box mapping, commonly performed by a Convolutional Neural Network (CNN). However, decades of work on perceptual organization in both human and machine vision suggests that there are often intermediate representations that are intrinsic to an inference task, and which provide essential structure to improve generalization. In this work, we explore an approach for injecting prior domain structure into neural network training by supervising hidden layers of a CNN with intermediate concepts that normally are not observed in practice. We formulate a probabilistic framework which formalizes these notions and predicts improved generalization via this deep supervision method. One advantage of this approach is that we are able to train only from synthetic CAD renderings of cluttered scenes, where concept values can be extracted, but apply the results to real images. Our implementation achieves the state-of-the-art performance of 2D/3D keypoint localization and image classification on real image benchmarks, including KITTI, PASCAL VOC, PASCAL3D+, IKEA, and CIFAR100. We provide additional evidence that our approach outperforms alternative forms of supervision, such as multi-task networks.

Comments:	Submitted to TPAMI, first revision. arXiv admin note: text overlap with arXiv:1612.02699
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1801.03399 [cs.CV]
	(or arXiv:1801.03399v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1801.03399

Submission history

From: Chi Li [view email]
[v1] Mon, 8 Jan 2018 22:00:48 UTC (6,810 KB)
[v2] Fri, 20 Jul 2018 04:10:35 UTC (6,810 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Supervision with Intermediate Concepts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Supervision with Intermediate Concepts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators