DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion

Authors

  • Zhiqiang Yan Nanjing University of Science and Tenchnology
  • Kun Wang Nanjing University of Science and Technology
  • Xiang Li Nanjing University of Science and Technology
  • Zhenyu Zhang Nanjing University of Science and Technology
  • Jun Li Nanjing University of Science and Technology
  • Jian Yang Nanjing University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v37i3.25415

Keywords:

CV: Multi-modal Vision, CV: Vision for Robotics & Autonomous Driving

Abstract

Unsupervised depth completion aims to recover dense depth from the sparse one without using the ground-truth annotation. Although depth measurement obtained from LiDAR is usually sparse, it contains valid and real distance information, i.e., scale-consistent absolute depth values. Meanwhile, scale-agnostic counterparts seek to estimate relative depth and have achieved impressive performance. To leverage both the inherent characteristics, we thus suggest to model scale-consistent depth upon unsupervised scale-agnostic frameworks. Specifically, we propose the decomposed scale-consistent learning (DSCL) strategy, which disintegrates the absolute depth into relative depth prediction and global scale estimation, contributing to individual learning benefits. But unfortunately, most existing unsupervised scale-agnostic frameworks heavily suffer from depth holes due to the extremely sparse depth input and weak supervisory signal. To tackle this issue, we introduce the global depth guidance (GDG) module, which attentively propagates dense depth reference into the sparse target via novel dense-to-sparse attention. Extensive experiments show the superiority of our method on outdoor KITTI, ranking 1st and outperforming the best KBNet more than 12% in RMSE. Additionally, our approach achieves state-of-the-art performance on indoor NYUv2 benchmark as well.

Downloads

Published

2023-06-26

How to Cite

Yan, Z., Wang, K., Li, X., Zhang, Z., Li, J., & Yang, J. (2023). DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3109-3117. https://doi.org/10.1609/aaai.v37i3.25415

Issue

Section

AAAI Technical Track on Computer Vision III