Multilevel feature fusion and edge optimization network for self-supervised monocular depth estimation

Guohua Liu; Shuqing Niu

doi:10.1117/1.JEI.31.3.033027

6 June 2022 Multilevel feature fusion and edge optimization network for self-supervised monocular depth estimation

Guohua Liu, Shuqing Niu

Author Affiliations +

Journal of Electronic Imaging, Vol. 31, Issue 3, 033027 (June 2022). https://doi.org/10.1117/1.JEI.31.3.033027

Abstract

Monocular depth estimation is an essential step in scene geometry understanding. However, the depth maps predicted by the existing methods have the problems of loss of small target details and blurred edges. To this end, we propose a monocular depth estimation method based on multilevel feature fusion and edge optimization to obtain depth maps with rich depth details and precise edges. First, we improved the encoder part to make it better adapted for the depth estimation task. In addition, we fill the U-shaped network with a dense feature fusion layer (Dense-FL) to capture global context information and then combine the proposed structure attention module to further enhance the obtained feature information, so the depth map contains more detailed information. Finally, we design an edge optimization module to incorporate edge features into the training process and combine the proposed reweighted loss and image edge detail loss to constrain the network, so as to further improve the learning ability of the model to the edge of the object. The experimental results on the KITTI dataset show that the depth prediction results obtained by our method perform better at small targets and the edges of objects in the scene, and the structure of the objects is more complete. Generalization experiments on the Cityscapes and Sceneflow datasets also further confirmed the effectiveness and superiority of our proposed method.

Citation Download Citation

Guohua Liu and Shuqing Niu "Multilevel feature fusion and edge optimization network for self-supervised monocular depth estimation," Journal of Electronic Imaging 31(3), 033027 (6 June 2022). https://doi.org/10.1117/1.JEI.31.3.033027

Received: 13 January 2022; Accepted: 23 May 2022; Published: 6 June 2022

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

;

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
19 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Convolution

Image fusion

Image processing

Computer programming

Cameras

Image restoration

RGB color model

Show All Keywords

Keywords/Phrases

Search In:

Publication Years