Expediting Building Footprint Extraction from High-resolution Remote Sensing Images via progressive lenient supervision

Guo, Haonan; Du, Bo; Wu, Chen; Su, Xin; Zhang, Liangpei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.12220 (cs)

[Submitted on 23 Jul 2023 (v1), last revised 10 Apr 2024 (this version, v2)]

Title:Expediting Building Footprint Extraction from High-resolution Remote Sensing Images via progressive lenient supervision

Authors:Haonan Guo, Bo Du, Chen Wu, Xin Su, Liangpei Zhang

View PDF

Abstract:The efficacy of building footprint segmentation from remotely sensed images has been hindered by model transfer effectiveness. Many existing building segmentation methods were developed upon the encoder-decoder architecture of U-Net, in which the encoder is finetuned from the newly developed backbone networks that are pre-trained on ImageNet. However, the heavy computational burden of the existing decoder designs hampers the successful transfer of these modern encoder networks to remote sensing tasks. Even the widely-adopted deep supervision strategy fails to mitigate these challenges due to its invalid loss in hybrid regions where foreground and background pixels are intermixed. In this paper, we conduct a comprehensive evaluation of existing decoder network designs for building footprint segmentation and propose an efficient framework denoted as BFSeg to enhance learning efficiency and effectiveness. Specifically, a densely-connected coarse-to-fine feature fusion decoder network that facilitates easy and fast feature fusion across scales is proposed. Moreover, considering the invalidity of hybrid regions in the down-sampled ground truth during the deep supervision process, we present a lenient deep supervision and distillation strategy that enables the network to learn proper knowledge from deep supervision. Building upon these advancements, we have developed a new family of building segmentation networks, which consistently surpass prior works with outstanding performance and efficiency across a wide range of newly developed encoder networks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2307.12220 [cs.CV]
	(or arXiv:2307.12220v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.12220

Submission history

From: Haonan Guo [view email]
[v1] Sun, 23 Jul 2023 03:55:13 UTC (1,728 KB)
[v2] Wed, 10 Apr 2024 13:15:41 UTC (1,824 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Expediting Building Footprint Extraction from High-resolution Remote Sensing Images via progressive lenient supervision

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Expediting Building Footprint Extraction from High-resolution Remote Sensing Images via progressive lenient supervision

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators