Single image based three-dimensional scene reconstruction using semantic and geometric priors

GJ Yoon, J Song, YJ Hong, SM Yoon - Neural Processing Letters, 2022 - Springer
GJ Yoon, J Song, YJ Hong, SM Yoon
Neural Processing Letters, 2022Springer
Single image based three-dimensional (3D) scene reconstruction has become an important
research topic for computer vision and computer graphics fields to provide machine vision
systems with near human visual perception. Previous approaches for 3D scene
reconstruction and depth estimation from single images required many factors including
motion parallax, stereoscopic parallels, and various monocular depth cues adopted from
known geometric priors. Deep learning based depth estimation techniques have advanced …
Abstract
Single image based three-dimensional (3D) scene reconstruction has become an important research topic for computer vision and computer graphics fields to provide machine vision systems with near human visual perception. Previous approaches for 3D scene reconstruction and depth estimation from single images required many factors including motion parallax, stereoscopic parallels, and various monocular depth cues adopted from known geometric priors. Deep learning based depth estimation techniques have advanced single image based depth estimation by aggregating various complexity information from RGB depth image datasets for training images to drive the process. This paper proposes an effective 3D scene estimation methodology by automatically extracting vanishing point and semantic information including 3D geometric characteristics without prior assumptions. The vanishing point is extracted from line segments and minimum spanning tree clustering to remove spurious noisy edges. Retracting geometric and semantic information from a given image is achieved by a generative adversarial network trained on the created training set. We verified the proposed approach’s efficiency and effectiveness experimentally with a large database created by directly recovering 3D scenes from from an input image.
Springer
Showing the best result for this search. See all results