Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles

Srivastava, Siddharth; Jurie, Frederic; Sharma, Gaurav

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.08494 (cs)

[Submitted on 27 Mar 2019 (v1), last revised 11 Oct 2019 (this version, v2)]

Title:Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles

Authors:Siddharth Srivastava, Frederic Jurie, Gaurav Sharma

View PDF

Abstract:We address the problem of 3D object detection from 2D monocular images in autonomous driving scenarios. We propose to lift the 2D images to 3D representations using learned neural networks and leverage existing networks working directly on 3D data to perform 3D object detection and localization. We show that, with carefully designed training mechanism and automatically selected minimally noisy data, such a method is not only feasible, but gives higher results than many methods working on actual 3D inputs acquired from physical sensors. On the challenging KITTI benchmark, we show that our 2D to 3D lifted method outperforms many recent competitive 3D networks while significantly outperforming previous state-of-the-art for 3D detection from monocular images. We also show that a late fusion of the output of the network trained on generated 3D images, with that trained on real 3D images, improves performance. We find the results very interesting and argue that such a method could serve as a highly reliable backup in case of malfunction of expensive 3D sensors, if not potentially making them redundant, at least in the case of low human injury risk autonomous navigation scenarios like warehouse automation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1904.08494 [cs.CV]
	(or arXiv:1904.08494v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.08494

Submission history

From: Siddharth Srivastava [view email]
[v1] Wed, 27 Mar 2019 14:59:40 UTC (3,804 KB)
[v2] Fri, 11 Oct 2019 07:02:34 UTC (5,081 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators