Pri3D: Can 3D Priors Help 2D Representation Learning?

Hou, Ji; Xie, Saining; Graham, Benjamin; Dai, Angela; Nießner, Matthias

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.11225 (cs)

[Submitted on 22 Apr 2021 (v1), last revised 18 Dec 2021 (this version, v3)]

Title:Pri3D: Can 3D Priors Help 2D Representation Learning?

Authors:Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner

View PDF

Abstract:Recent advances in 3D perception have shown impressive progress in understanding geometric structures of 3Dshapes and even scenes. Inspired by these advances in geometric understanding, we aim to imbue image-based perception with representations learned under geometric constraints. We introduce an approach to learn view-invariant,geometry-aware representations for network pre-training, based on multi-view RGB-D data, that can then be effectively transferred to downstream 2D tasks. We propose to employ contrastive learning under both multi-view im-age constraints and image-geometry constraints to encode3D priors into learned 2D representations. This results not only in improvement over 2D-only representation learning on the image-based tasks of semantic segmentation, instance segmentation, and object detection on real-world in-door datasets, but moreover, provides significant improvement in the low data regime. We show a significant improvement of 6.0% on semantic segmentation on full data as well as 11.9% on 20% data against baselines on ScanNet.

Comments:	ICCV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2104.11225 [cs.CV]
	(or arXiv:2104.11225v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.11225

Submission history

From: Ji Hou [view email]
[v1] Thu, 22 Apr 2021 17:59:30 UTC (4,406 KB)
[v2] Tue, 24 Aug 2021 18:01:36 UTC (2,900 KB)
[v3] Sat, 18 Dec 2021 17:15:57 UTC (4,510 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-04

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ji Hou
Saining Xie
Benjamin Graham
Angela Dai
Matthias Nießner

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Pri3D: Can 3D Priors Help 2D Representation Learning?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Pri3D: Can 3D Priors Help 2D Representation Learning?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators