Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?

Johnson-Roberson, Matthew; Barto, Charles; Mehta, Rounak; Sridhar, Sharath Nittur; Rosaen, Karl; Vasudevan, Ram

Computer Science > Computer Vision and Pattern Recognition

arXiv:1610.01983 (cs)

[Submitted on 6 Oct 2016 (v1), last revised 25 Feb 2017 (this version, v2)]

Title:Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?

Authors:Matthew Johnson-Roberson, Charles Barto, Rounak Mehta, Sharath Nittur Sridhar, Karl Rosaen, Ram Vasudevan

View PDF

Abstract:Deep learning has rapidly transformed the state of the art algorithms used to address a variety of problems in computer vision and robotics. These breakthroughs have relied upon massive amounts of human annotated training data. This time consuming process has begun impeding the progress of these deep learning efforts. This paper describes a method to incorporate photo-realistic computer images from a simulation engine to rapidly generate annotated data that can be used for the training of machine learning algorithms. We demonstrate that a state of the art architecture, which is trained only using these synthetic annotations, performs better than the identical architecture trained on human annotated real-world data, when tested on the KITTI data set for vehicle detection. By training machine learning algorithms on a rich virtual world, real objects in real scenes can be learned and classified using synthetic data. This approach offers the possibility of accelerating deep learning's application to sensor-based classification problems like those that appear in self-driving cars. The source code and data to train and validate the networks described in this paper are made available for researchers.

Comments:	Proceedings of International Conference on Robotics and Automation (ICRA) 2017, 8 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:1610.01983 [cs.CV]
	(or arXiv:1610.01983v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1610.01983

Submission history

From: Matthew Johnson-Roberson [view email]
[v1] Thu, 6 Oct 2016 18:26:43 UTC (8,066 KB)
[v2] Sat, 25 Feb 2017 13:20:49 UTC (4,064 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators