Next Article in Journal
Optimization of a Biomass-Based Power and Fresh Water-Generation System by Machine Learning Using Thermoeconomic Assessment
Previous Article in Journal
An Optimization Strategy for EV-Integrated Microgrids Considering Peer-to-Peer Transactions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Investigation, Evaluation, and Dynamic Monitoring of Traditional Chinese Village Buildings Based on Unmanned Aerial Vehicle Images and Deep Learning Methods

1
School of Civil Engineering and Architecture, University of Jinan, Jinan 250022, China
2
Institute of Regional Science, Karlsruhe Institute of Technology, 76131 Karlsruhe, Germany
3
School of Information Science and Engineering, University of Jinan, Jinan 250022, China
*
Author to whom correspondence should be addressed.
Sustainability 2024, 16(20), 8954; https://doi.org/10.3390/su16208954
Submission received: 2 September 2024 / Revised: 23 September 2024 / Accepted: 9 October 2024 / Published: 16 October 2024
(This article belongs to the Section Sustainable Urban and Rural Development)

Abstract

:
The investigation, evaluation, and dynamic monitoring of traditional village buildings are crucial for the protection and inheritance of their architectural styles. This study takes traditional villages in Shandong Province, China, as an example, employing UAV images and deep learning technology. Utilizing the YOLOv8 instance segmentation model, it introduces three key features reflecting the condition of traditional village buildings: roof status, roof form, and courtyard vegetation coverage. By extracting feature data on the condition of traditional village buildings and constructing a transition matrix for building condition changes, combined with corresponding manual judgment assistance, the study classifies, counts, and visually outputs the conditions and changes of buildings. This approach enables the investigation, evaluation, and dynamic monitoring of traditional village buildings. The results show that deep learning technology significantly enhances the efficiency and accuracy of traditional village architectural investigation and evaluations, and it performs well in dynamic monitoring of building condition changes. The “UAV image + deep learning” technical system, with its simplicity, accuracy, efficiency, and low cost, can provide further data and technical support for the planning, protection supervision, and development strategy formulation of traditional Chinese villages.

1. Introduction

1.1. Background

Traditional Chinese villages harbor a wealth of historical information and cultural landscapes. Spatial features serve as the spatial carriers that display and bear the historical and cultural characteristics of these villages. Protecting and inheriting the spatial features and the cultural factors they contain is essential to preserve the roots of culture and better achieve the synergy of conservation and development [1]. Architecture is the most direct reflection of the spatial features and regional characteristics of traditional villages and is also the spatial element that changes most rapidly and is most susceptible to external influences. Currently, the integrity and sustainability of architectural styles in traditional Chinese villages are prominent issues, mainly manifested in two aspects: first, a large number of traditional buildings suffer from continuous damage due to long-term vacancy and lack of maintenance; second, unorganized and unconstrained construction activities in villages cause constructive destruction to the architectural style. Investigating and evaluating traditional village buildings and dynamically monitoring changes are crucial prerequisites and means for the protection and inheritance of their architectural styles. Current traditional methods of building investigation, evaluation, and dynamic monitoring rely heavily on manual labor, which not only consumes a significant amount of time and economic costs but also has considerable issues with data accuracy and the continuity of monitoring. How to improve the timeliness of traditional village building information acquisition and reduce costs to achieve continuous, dynamic, and real-time assessment and change monitoring is a question worth exploring.
With technological advancements, artificial intelligence is increasingly applied to the field of cultural heritage protection. Image recognition technology based on deep learning provides an effective method for identifying and extracting spatial features of traditional villages [2,3]. On one hand, data mining, identifying, and extracting traditional village spatial feature information that does not rely on subjective judgment, and analyzing the underlying patterns behind the spatial forms of traditional villages, can provide scientific data support for their protection. On the other hand, convenient long-term acquisition of spatial feature data can provide an effective means for monitoring traditional village protection activities and construction behaviors.
This study, combined with the practice of traditional village protection and utilization, takes unmanned aerial vehicle (UAV) images as the data foundation and applies deep learning technology to the identification of traditional village architectural features. Through deep learning model training, it extracts key feature information that reflects the condition of traditional village buildings, constructing an efficient, low-cost technical method for the investigation, evaluation, and dynamic monitoring of traditional village buildings, further enriching and deepening the connotation of research on traditional Chinese villages.

1.2. Related Work

The architectural investigation, evaluation, and change monitoring of traditional villages involve assessing the condition of buildings and capturing their changes through continuous observation to determine the extent of these changes and their impact on the protection and development of traditional villages. Western countries began attempting to monitor and plan for the protection of cultural heritage in the 1970s [4], while China only started incorporating construction development monitoring into urban planning and design information systems at the beginning of the 21st century [5]. With the application of digital technology in urban construction [6], heritage conservation [7], monitoring, and early warning [8], digital monitoring has become a major focus in research and practice. Current research primarily uses high-resolution remote sensing satellite images [9], terrestrial 3D laser scanning [10], digital low-altitude photogrammetry [11], and UAV sensors [12]; it is applied in fields such as national spatial development and ecological protection [13], cultural heritage site conservation [14], resource and environmental carrying capacity assessment [15], structural safety monitoring of architectural heritage [16], and disaster prevention [17]. Currently, there is a scarcity of research in China specifically dedicated to the spatial survey, evaluation, and change monitoring of traditional villages. The existing studies on the spatial survey and evaluation of traditional villages predominantly focus on cultural landscapes [18], heritage resources [19], and specific landscape elements such as soundscapes [20] and color landscapes [21]. These studies typically employ traditional techniques and methods, including survey statistics, spatial syntax, and topological analysis, which are deficient in providing a systematic and quantitative analysis of traditional village spaces using modern tools. The monitoring research is even more limited, with the few available research outcomes concentrating on the monitoring and early warning of the impact of tourism development on the protection of traditional village spaces [22] and the construction of monitoring and early warning systems for traditional villages [23,24]. The field of dynamic monitoring research aimed at the protection and development of traditional villages still lacks a novel holistic perspective and intelligent technologies.
The application of deep learning technology in the protection of traditional villages is becoming increasingly widespread, with researchers conducting in-depth studies on the classification of traditional village landscapes [25], landscape evaluation [26], spatial texture recognition [27], and classification and quantification of residential buildings [28]. Current research on the identification of architectural features in traditional Chinese villages mainly focuses on three aspects: First, the optimization of deep learning algorithm models and the improvement of recognition accuracy. Hang Xue et al. [29] introduced a multi-scale fusion and detail enhancement network MAD-UNet for the effective extraction of rural buildings. Feng Fan et al. [30] enhanced the extraction capability of multi-scale buildings based on a multi-input multi-output and multi-feature fusion fully convolutional network. Chen Xuejiao et al. [31] studied methods for extracting buildings from high-resolution remote sensing images using a densely connected feature pyramid fusion network. By continuously optimizing algorithm models, the accuracy and efficiency of building identification are improved. Second, research on the selection of extracted features explores common architectural feature recognition indicators, including color [32], spectrum [33], texture [34], shape [35], shadow [36], and semantics, designing effective recognition feature systems to better describe buildings. Third, the specific application of semantic segmentation and instance segmentation algorithms is explored, with studies investigating the extraction of buildings from remote sensing and UAV images using improved U-NET [37], HF-FCN [38], Mask R-CNN [39], and Mask Scoring R-CNN [40] semantic segmentation and instance segmentation algorithm models, with instance segmentation algorithm models showing better applicability for the identification of buildings in rural areas.
Deep learning, with its capacity for leveraging vast amounts of data and computational power, has demonstrated significant technological advantages. Utilizing high-resolution satellite remote sensing imagery and UAV imagery, deep learning techniques have been employed by researchers for extensive studies on rapid detection of urban building targets [41], precise identification and extraction of buildings [42], automatic recognition of urban unauthorized constructions [43], post-earthquake damage identification of buildings [44,45], structural classification of buildings [46], and analysis of architectural features [47]. On this basis, researchers have also explored the application of deep learning technology in the field of urban community spatial planning [48], establishing it as a vital technical instrument and research direction in urban and rural spatial planning. Conversely, the application of deep learning technology in the domain of traditional Chinese villages remains relatively scarce, with only preliminary discussions on the intelligent recognition of rural buildings [49] and the automatic classification of rural architectural features [50]. Research on the identification of spatial characteristics and dynamic monitoring of traditional villages is essentially non-existent. Given the vast number of traditional Chinese villages and the urgent tasks of protection and development they face, it is imperative to undertake adaptive and innovative explorations based on existing research and practices. This study focuses on the application pathways of deep learning in the protection and development of traditional Chinese villages, aiming to further expand the practical application domains of deep learning.

1.3. Characteristics for Architectural Condition Recognition and Assessment

From the perspective of image recognition, the identification of architectural features based on UAV orthoimages primarily encompasses three aspects: the condition of building roofs (intact and damaged), roof forms, and the coverage status of courtyard vegetation [51]. Through summarization and refinement, in the traditional villages of the Shandong region, the forms of building roofs are categorized into two major types: double-sloped roofs and flat roofs. The double-sloped roofs can be further refined into three subtypes: red tile, gray tile, and colored steel tile (as shown in Figure 1). Additionally, influenced by traditional concepts such as Feng Shui, evergreen coniferous species are rarely planted in rural courtyards in the Shandong region, with a preference for more practical fruit trees. The courtyard vegetation exhibits distinct coverage characteristics during the growing season. Based on the aforementioned classification criteria, the building conditions can be further divided into six image recognition features, which serve as the basis for deep learning-based image recognition (see the category division in Figure 2).

1.4. Dynamic Monitoring of Building Conditions and Assessment Criteria

Through extensive field research combined with relevant literature [52,53] and protection and utilization requirements [54,55], the conditions and changes of traditional village buildings primarily include forms such as abandonment, damage, restoration, rehabilitation, new construction, demolition, and remaining unchanged. These forms constitute the main content of the dynamic monitoring of changes in the conditions of traditional village buildings. Based on the summary of research data from traditional villages in Shandong Province, the aforementioned monitoring content can be discerned through the changes in roof condition, roof form and color, courtyard vegetation coverage, and their interconversion relationships.
The characteristics and criteria for distinguishing the changes in the conditions of traditional village buildings are as follows:
  • Abandonment: The residential status shifts from occupied to unoccupied. This can be identified by observing the condition of courtyard vegetation and its changes; unoccupied courtyards exhibit irregular, large areas covered with weeds and vines.
  • Damage: Mainly manifested by partial or complete destruction of the roof, which can be identified by changes in the roof’s condition.
  • Restoration: Undertaken to preserve the existing condition of a building or to appropriately restore it to its original state, this involves repair, reinforcement, maintenance, and improvement works with an emphasis on maintaining and restoring the building’s original appearance and historical characteristics. In traditional village architecture, restoration focuses on reviving the traditional materials and styles of the building’s roof and facade. Therefore, it can be identified through observations of changes in the roof’s condition, form, and color.
  • Rehabilitation: Conducted to enable a building to be reused or to exhibit its original functions and value, this work emphasizes the restoration of the building’s original functionality and utility. Traditional building materials and styles may not be used. It can be identified through observations of changes in the roof’s condition, form, and color.
  • New construction: Construction activities on vacant land during the monitoring period, which can be identified by the emergence of a roof where there was none before.
  • Demolition: The removal of existing buildings to create vacant land, which can be identified by the disappearance of a roof where there was one before.
  • Unchanged: Well-preserved buildings that have not undergone significant changes in their condition during the monitoring period, which can be identified by the stability of the roof’s condition.

2. Materials and Methods

The overall workflow and main procedures are illustrated in Figure 2. It contains three major steps: deep learning model training, architectural feature recognition and extraction, and dynamic monitoring and visualization output.
Initially, fundamental data was amassed via remote sensing images, field investigations, and UAV images, followed by the development of deep learning models. Subsequently, these models were deployed to identify and extract architectural features from the sample villages, leveraging image feature extraction to inform subsequent dynamic monitoring. Utilizing the extracted data, in conjunction with manual discrimination, the state of buildings and their transformations were categorized, statistically analyzed, and visually presented, facilitating the assessment and dynamic monitoring of traditional village buildings.

2.1. Study Area and Data Collection

2.1.1. Study Area

The construction of deep learning model data in this study took traditional villages in Shandong Province, China, as samples. Shandong Province currently boasts 168 traditional Chinese villages. The geographical features of the coexistence of land and sea, as well as the rich terrain and landforms in Shandong Province, and the profound cultural heritage formed by the integration of diverse cultures, such as Qilu culture and marine culture, have nurtured traditional villages with diverse types and distinct regional characteristics. These villages are categorized into five spatial typologies: the Eastern Shandong Peninsula, Central Shandong Mountainous Region, Southern Shandong Hills, Southwest Shandong Plain, and Northern Shandong Plain, making the province a significant distribution area and typical examples of traditional Chinese villages.

2.1.2. UAV Photography

UAV images offer a higher planar data acquisition rate for upper areas such as building roofs, enabling the collection of multi-directional data information, and serving as an effective method for the investigation, evaluation, and change monitoring of traditional village architecture. The research team conducted field investigations of traditional villages within the jurisdiction of Shandong Province. Considering the limitations of the accuracy of remotely sensed images available to the public and the complexity of the traditional village architectural environment, this study utilized UAV photography to obtain orthoimages of 168 traditional Chinese villages. These images are used as the data source for the training set of the deep learning model, and the image data acquisition process is depicted in Figure 3.

2.2. Deep Learning-Based Feature Extraction Model

2.2.1. Building Feature Extraction by YOLOv8

Currently, deep learning models applied to image recognition are primarily divided into two major categories: one-stage algorithm models represented by YOLO [56] and SSD [57], and two-stage algorithm models represented by Mask R-CNN [58] and Faster R-CNN [59]. Compared to two-stage algorithm models, one-stage algorithm models can directly classify samples without the need for generating region proposal boxes, obtaining recognition results in a single phase, thus demonstrating superior advantages in terms of speed and efficiency.
Among the one-stage algorithm models, the YOLO series is a typical example. With the algorithm optimization of the YOLOv8 model [60], particularly in processing speed and data capacity, it has shown excellent performance in building image analysis tasks. In the preliminary model selection phase of this study, for the recognition of rural building features, YOLOv8 demonstrated good reliability and accuracy; therefore, this study adopts the instance segmentation model of YOLOv8 as the basic model for the study.
YOLOv8 is a version within the object detection and image segmentation series models developed by Ultralytics. YOLOv8 enhances the network’s gradient flow by introducing the C2f module in place of the traditional C3 structure [61]. It incorporates the Mosaic technique during the data augmentation phase to enrich the diversity of image backgrounds and simultaneously turns off this enhancement during the later stages of training to stabilize model convergence [62], thereby improving the model’s learning capabilities. In terms of loss functions, YOLOv8′s detection head employs CIoU and DFL, while the classification head uses the binary cross-entropy (BCE) loss function, further enhancing the model’s performance [63]. To accommodate varying data capacity requirements, YOLOv8 fine-tunes the model’s channel configuration, offering five different versions: n, s, m, l, and x [64]. Considering the data set capacity of this study, YOLOv8n was selected as the specific model for research.
This study utilized the YOLOv8n instance segmentation model, based on the high-resolution UAV orthoimagery obtained, and through the comparison of multi-period image data recognition results, combined with manual assistance, to determine the changes in the state of buildings, identify the impacts of these changes, and accordingly conduct investigations, evaluations, and dynamic monitoring of traditional village buildings.

2.2.2. Data Preprocess

To meet the requirements for sample size and image resolution during the training phase of the YOLOv8 model, the output accuracy of the collected orthoimages was set to 10 cm. Subsequently, the original orthoimagery was cropped into 640*640 pixel images and different types of buildings and courtyards in the orthoimages were semantically segmented and annotated using the labeling software Labelme 2014 (Figure 4).
To achieve image enhancement and expansion of the training dataset, avoid unnecessary loss of image resolution during processing, and enhance the model’s generalization ability, the following data augmentation measures were taken: First, horizontal flipping was performed on each input image (Figure 5a); Second, random rotation was applied to the images (Figure 5b), with the angle varying between −15° and +15° to reflect the possible orientations of buildings in actual scenes; Third, blur effects were introduced (Figure 5c), with a maximum blur radius of 2.5 pixels, to mimic image blur caused by camera shake or rapid movement. Through the enhancement of the sample dataset, an average of three variants was generated for each original image, expanding the scale of the training set and improving the quality of the model’s training dataset through augmentation measures.

2.2.3. Model Training and Evaluation

This study employed the YOLOv8n-seg.pt model for instance segmentation of buildings and courtyards in orthoimagery, utilizing the deep learning framework of PyTorch 1.1.2, coupled with CUDA 11.7 for GPU-accelerated optimization during model training. To ensure the quality of input data, the image size was uniformly set to 640*640 pixels during training, and the training epochs were set to 15, 30, 60, 100, 200, and 500 epochs to thoroughly train the model and achieve stable performance.
To assess the model’s stability, three performance metrics are introduced: Precision, Recall, and Mean Average Precision (Table 1). Precision is used to evaluate the proportion of correctly predicted positive samples by the model; Recall is used to assess the proportion of all true positive samples that the model can identify; Mean Average Precision represents the average accuracy of predictions across multiple classes, used to evaluate the quality of the model’s algorithm. It can be divided into two metrics, mAP50 and mAP50-95. The mAP50 metric indicates the average precision at a 50% Intersection over Union (IoU) threshold, which is the ratio of the intersection area to the union area between the predicted region and the true region. The mAP50-95 metric indicates the average precision across the 50–95% IoU threshold range.
Following the training process described previously, and in conjunction with the comparison of the loss function across different training epochs (Figure 6), it can be observed that after 30 epochs, the loss function metric becomes stable, indicating effective model fitting. After 100 training epochs, the model reaches a plateau, and continued training leads to a decrease in overall accuracy due to overfitting. A comprehensive evaluation indicates that the training effect at the 60th epoch is superior to other batches, with the precision of detection boxes and instance segmentation on the training set stabilizing at 94.6% (Table 2), representing the optimal performance. Therefore, the training results from 60 epochs are selected as the predictive weights for the model. Concurrently, to enhance understanding of the model’s recognition process and for further analysis, the prediction results are visually processed. Figure 7 illustrates the visualization starting from the original.

2.3. Model Transferability

To assess the transferability of the trained model, this study selected 24 typical villages from five traditional village spatial style areas in Shandong Province as test samples, referring to the style types and regional divisions of traditional villages in Shandong [65]. The selection of test sample villages took into account factors such as style features, village area, settlement morphology, preservation status, altitude, topography, and landforms, ensuring that the test samples cover all types of architectural characteristics of traditional villages in Shandong Province. A total of 57,738 validation data points were obtained (Figure 8). The test results were compared with actual data, as shown in Table 3. The model achieved high recognition accuracy for damaged buildings, flat-roofed buildings, red-tiled roof buildings, and gray-tiled roof buildings, all above 95%, with the recognition accuracy for red-tiled roof buildings reaching 97.1%. The recognition accuracy for abandoned buildings and buildings with colored steel tile roofs was relatively lower; the accuracy for abandoned buildings was 91.3%, mainly because the primary identification feature for abandoned buildings is the vegetation coverage in the courtyards, which can be easily misidentified due to the large area and high density of vegetation in public spaces and green areas within the villages. The recognition accuracy for buildings with colored steel tile roofs was 86.7%. Combining field research, the relatively low accuracy is attributed to some colored steel tiles exhibiting features similar to red tiles due to rust. On the other hand, some buildings have adopted colored steel tiles that imitate the gray roof style to protect the traditional village appearance, which also increases the difficulty of identification. Considering the limitations of the actual collected data and the diversity of architectural characteristics in traditional villages, the trained model demonstrates high accuracy in identifying key features such as roofs and courtyard vegetation coverage, meeting the requirements for practical work such as the investigation, evaluation, and dynamic monitoring of changes in traditional village buildings within the Shandong Province.

3. Results

This study selects Dongfanliu Village in Zhangqiu District, Jinan City, Shandong Province as the sample village for model application (Figure 9). Dongfanliu Village is a traditional Chinese village and a famous historical and cultural village in Shandong Province, with complete preservation of the spatial pattern and traditional architectural forms, making it a typical example of traditional villages in the central Shandong region [66]. In 2022, Dongfanliu Village was included in the national demonstration of concentrated protection and utilization of traditional villages. The construction of the demonstration zone has led to overall protection and enhancement of the traditional village. Concurrently, the construction of the Jiwei Expressway, which started in the same period, passes through the northern part of the village. These protective and constructive actions have had certain impacts on the village’s architectural style and spatial texture. The significant spatial changes in Dongfanliu Village provide an excellent sample for architectural investigation evaluation and dynamic monitoring. The research team conducted field investigation in Dongfanliu Village in June 2020 and June 2024, and obtained orthoimages of the village using unmanned aerial vehicles (Figure 10).

3.1. Building Condition Assessment

3.1.1. UAV Image Recognition

Inputting the UAV images from 2020 and 2024 into the deep learning model, this study extracts the building condition features from the two periods of village images (Figure 11 and Figure 12) and compares the changes in the data between the two periods. The results (Table 4) indicate that compared to 2020, in 2024, the number of damaged buildings in the village decreased from 31 to 15; the number of abandoned buildings decreased from 30 to 26; the number of red tile roofs decreased from 1343 to 1297; the number of gray tile roofs increased from 17 to 36; the number of colored steel tile roofs decreased from 44 to 24; and the number of flat-roofed buildings decreased from 1222 to 1200.

3.1.2. Changes in Building Condition

Based on the extracted data, a differential analysis was conducted on the building condition patches from the two periods (Figure 13). By combining manual discrimination to eliminate recognition errors, a transition matrix of building condition changes was constructed (Table 5), which analyzes the interconversion relationships between different types of building conditions. Specifically, four abandoned buildings were transformed into gray-tiled buildings; seven damaged buildings were changed into red-tiled buildings, and nine into gray-tiled buildings; five red-tiled buildings were converted into gray-tiled buildings, and 27 were demolished; 12 colored steel tile-roofed buildings were demolished, and four were changed into red-tiled buildings; 30 flat-roofed buildings were demolished; and three new gray-tiled buildings were constructed.

3.2. Building Condition Dynamic Monitoring

In accordance with the definitions of traditional village building conditions and their forms of change, as described earlier, the data from the building condition change transition matrix were further transformed and categorized into types such as abandonment, damage, restoration, rehabilitation, new construction, demolition, and remaining unchanged, to obtain dynamic monitoring indicator data for building condition changes (Table 6). Specifically, there were 18 restored buildings, including the restoration of nine originally damaged buildings, four originally abandoned buildings, and the traditional-style restoration of five originally red-tiled buildings; 11 rehabilitated buildings, comprising the rehabilitation of seven damaged buildings and four colored steel tile-roofed buildings; three newly built traditional-style buildings; 69 demolished buildings, all of which were of modern style; and 2527 unchanged buildings. The dynamic monitoring results of building condition changes from 2020 to 2024 are shown in Figure 14.
In conjunction with the “Conservation and Development Plan for Dongfanliu Traditional Village,” restoration, rehabilitation, and newly built traditional-style buildings are all located within the core protection area of the traditional village. The restored buildings are mainly in the southern part of the village, representing the restoration and protection of the Taihe Hall traditional residential building complex, which has formed a more complete historical building cluster in the area through restoration. The rehabilitated buildings are primarily concentrated in the central part of the village, including the rehabilitation of the Gong Family Ancestral Hall and the former residence of the martial scholar, which has provided better protection for key historical buildings in the village and further enriched the traditional village’s architectural style. The restoration and rehabilitation of buildings reflect the positive effects of the concentrated protection and utilization of traditional villages carried out in 2022.
Newly built buildings are mainly located in the northwest of the village, where some original modern buildings have been demolished to make way for new traditional-style constructions, demonstrating the strict control of construction activities in the core protection area as stipulated by the conservation and development plan, which has been well implemented. Demolished buildings are concentrated in the eastern part of the village, all within the construction control zone designated by the traditional village conservation and development plan, mainly due to the occupation of the village by a newly built expressway. The construction of the expressway has significantly impacted the integrity of the village’s spatial pattern and the completeness of its style. In the construction of major infrastructure projects, it is necessary to enhance the awareness of protecting traditional villages and strengthen the scientific demonstration of site (route) selection to reduce the disturbance and impact on cultural heritage resources such as traditional villages.

4. Discussion

4.1. Enhancing the Work Efficiency and Precision of Traditional Village Architecture Investigation and Evaluation

Architectural investigation and evaluation are essential for the protection and inheritance of traditional village architectural styles. Currently, traditional methods of architectural investigation and evaluation rely heavily on field investigations, which involve conducting individual research on village buildings to obtain relevant information. These conventional approaches are time-consuming, costly, labor-intensive, and subject to human error, which affects the accuracy and efficiency of the research. In contrast, this study proposes a cost-effective solution that combines UAV images with deep learning technology for the investigation and evaluation of traditional village architecture. According to traditional research methods, a team of six people spends a total of 20 h conducting field investigations and feature labeling for 2556 buildings in a single case study village, and an additional five hours on data analysis and visualization. The same village’s UAV image collection and processing can be completed by one person in 90 min (including 30 min for UAV outdoor flight preparation and aerial photography, and 60 min for UAV orthoimage output). With a well-trained deep learning model, pre-processing of UAV aerial data, architectural feature recognition, data statistics, and visualization image output can be completed within 90 min (Figure 15). In terms of the overall time consumption, the former is 50 times longer than the latter, and with the expansion of the research area and the increase in the number of case study villages, the new method proposed in this study can significantly improve work efficiency. Moreover, the architectural feature recognition method and judgment dimensions proposed in this study, which mainly focus on the status of building roofs, roof forms, and courtyard vegetation coverage, have few indicators and high recognition accuracy. In the verification of traditional village architectural features in Shandong Province, the overall model recognition accuracy reached 95.3%, meeting the requirements for the actual work of traditional village architectural investigation and evaluation and change monitoring. Its advantages of being fast, accurate, and low-cost make it suitable for large-scale traditional village research.

4.2. The Potential for Diversified Applications in the Conservation and Development of Traditional Villages

This study achieves the investigation, evaluation, and dynamic monitoring of changes in traditional village architecture through the transformation and classification of architectural feature recognition and feature change transition matrices, with the results visualized for spatial analysis. Taking Dongfanliu Village as an example, abandoned and damaged buildings are mainly distributed within the core protection area, showing a generally scattered and locally concentrated pattern, which is extensive and requires enhanced protective measures. Between 2020 and 2024, buildings that were repaired and renovated were relatively concentrated, all located at significant spatial nodes of the village, positively contributing to the protection of the village’s spatial structure and texture. During the same period, demolished buildings were concentrated in the eastern part of the village, primarily due to the construction of a highway affecting the village. By analyzing the spatial distribution and change characteristics, long-term, dynamic, and precise monitoring of traditional villages can be realized.
Additionally, the technical methods proposed in this study have broad application prospects in other aspects of the protection and development of traditional villages. For instance, by analyzing the extracted architectural feature data, it is possible to accurately identify whether buildings are inhabited or uninhabited and assess the state of building abandonment without the need for field research. Furthermore, rapid quantitative assessment of damage and abandonment in the core protection areas of traditional villages can be conducted, which not only helps to accurately identify the protection status of traditional villages but also determines key protective spaces and directions. By analyzing the spatial distribution of historical and characteristic buildings in the village, detailed data support can also be provided for the optimization of traditional village spatial patterns and the direction of architectural renewal.

4.3. Limitations and Further Research Directions

This study innovatively applies deep learning technology for the investigation, evaluation, and dynamic monitoring of changes in traditional village architecture, but it also has certain limitations. Although UAVs can conveniently obtain image data, the collected samples are limited. In the future, it may be possible to supplement the data by acquiring high-precision remote-sensing satellite images or commercial UAV images. Moreover, this study uses traditional villages within Shandong Province as samples, but the spatial characteristics of traditional villages in China vary greatly by region. To enhance the applicability of the model, it is necessary to collect spatial feature information from traditional villages in different regions of China, expand the sample training set data, and broaden the scope of the model application. Due to the consideration of obtaining courtyard vegetation coverage features, the data collected are mostly from summer images. Further refinement of recognition features and enhancement of the model’s accuracy and reliability are still needed for identifying the characteristics of traditional village architecture in other seasons, especially in northern regions.
The study primarily employs the YOLOv8 instance segmentation model to recognize the architectural features of traditional villages, with research results focusing on the statistics of the number of buildings with different features and their changes. This technical method still has significant potential for expansion in the automated extraction of other spatial feature data. It can be combined with related analysis software such as ArcGIS Pro 3 to obtain specific spatial data, including building area, building spacing, building orientation, building density, settlement morphology and boundary shape, and distribution of public spaces. By extracting various types of spatial parameters, it can provide data support for the research and practice of traditional village conservation.

5. Conclusions

This study utilizes UAV image and deep learning technology to explore a new technical approach and workflow for the investigation, evaluation, and dynamic monitoring of traditional village architecture. By employing the YOLOv8 instance segmentation model and incorporating three key features reflecting the condition of traditional village architecture—namely, the state of building roofs, roof forms, and the coverage of courtyard vegetation—the study extracts feature data of traditional village buildings, constructs a matrix of building condition changes, and combines manual judgment assistance to classify, count, and visually output the conditions and changes of buildings, achieving investigation, evaluation, and dynamic monitoring of traditional village architecture. The results indicate that deep learning technology can significantly enhance the efficiency and accuracy of the investigation and evaluation of traditional village architecture and has a good application effect in the dynamic monitoring of the condition of traditional village buildings. The “UAV image + deep learning” technical system, with its simplicity, accuracy, efficiency, and low cost, can provide timely and reliable data and technical support for the planning, protection supervision, and development strategy formulation of traditional villages. As the model dataset expands and the algorithms continue to be optimized, deep learning technology is also expected to play a greater role in the protection of urban and rural cultural heritage.

Author Contributions

Conceptualization, X.L. and Y.F.; methodology, X.L. and Y.F.; software, X.L. and C.S.; validation, X.L. and Y.F.; formal analysis, Y.F.; investigation, Y.F. and X.L.; resources, X.L. and Y.F.; data curation, Y.F., X.L. and C.S.; writing—original draft preparation, X.L. and Y.F.; writing—review and editing, X.L. and Y.Y.; visualization, Y.F. and X.L.; supervision, Y.Y.; project administration, Y.F. and X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Education of the People’s Republic of China: 21YJCZH024, the People’s Government of Shandong Province: ZR2021ME226 and the Shandong Federation of Social Science: 24BFS146.

Data Availability Statement

The processed data used to support the findings of this study have not been made available because the data is part of ongoing research.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
UAVUnmanned Aerial Vehicle

References

  1. Wu, X.Y.; Zhang, X.M. Mechanism and Path of Modern Transformation of Traditional Villages in the Perspective of Rural Revitalization. Agric. Econ. 2024, 7, 67–69. [Google Scholar]
  2. Liu, X.H.; Wang, Z.Y.; Hu, C.; Liu, H. A review of the application of deep learning technology in the study of traditional Chinese villages. Inter. Archit. China 2024, 4, 141–143. [Google Scholar] [CrossRef]
  3. Monna, F.; Rolland, T.; Denaire, A.; Navarro, N.; Granjon, L.; Barbé, R.; Chateau-Smith, C. Deep learning to detect built cultural heritage from satellite imagery. Spatial distribution and size of vernacular houses in Sumba, Indonesia. Agric. Econ. 2021, 52, 171–183. [Google Scholar] [CrossRef]
  4. Wu, M.P. Preventive Conservation of Architectural Heritage: An Overview of the Theoretical Development and Practices in European. China Cult. Herit. 2020, 2, 59–78. [Google Scholar]
  5. Zhang, B.G. Research and monitoring of urban construction environment should be incorporated into the urban planning and design information system. Planners 2000, 3, 57. [Google Scholar]
  6. Huang, J.X.; Dang, A.R.; Li, X.Y.; Weng, Y. History and trends of urban planning and management modernization in China. Sci. Technol. Rev. 2024, 42, 51–60. [Google Scholar]
  7. Zhang, J.T.; Tan, L.B. Knowledge Maps Analysis in Domestic Traditional Village Digitalization Field Based on CiteSpace. Dev. Small Cities Towns 2024, 42, 108–115. [Google Scholar]
  8. Wang, J.H.; Cheng, B.Y.; Lv, Z.C. Automatie Monitoring of Linear Cultural Heritage Based on Multi-temporal DOM Images. Geomat. Inf. Sci. Wuhan Univ. 2019, 44, 77–83. [Google Scholar]
  9. Tapete, D.; Cigna, F. InSAR data for geohazard assessment in UNESCO World Heritage sites: Stateof-the-art and perspectives in the Copernicus era. Int. J. Appl. Earth Obs. Geoinf. 2017, 63, 24–32. [Google Scholar] [CrossRef]
  10. Ulvi, A. Documentation, Three-Dimensional (3D) Modelling and visualization of cultural heritage by using Unmanned Aerial Vehicle (UAV) photogrammetry and terrestrial laser scanners. Int. J. Remote Sens. 2021, 42, 1944–2021. [Google Scholar] [CrossRef]
  11. Bakrman, T.; Bayram, B.; Akpinar, B.; Karabulut, M.; Bayrak, O.; Yigitoglu, A.; Seker, D. Implementation of ultra-light UAV systems for cultural heritage documentation. J. Cult. Herit. 2024, 44, 174–184. [Google Scholar] [CrossRef]
  12. Jorayev, G.; Wehr, K.; Benito-Calvo, A.; Njau, J.; Torre, I. Imaging and photogrammetry models of Olduvai Gorge (Tanzania) by Unmanned Aerial Vehicles: A high-resolution digital database for research and conservation of Early Stone Age sites. J. Archaeol. Sci. 2016, 75, 40–56. [Google Scholar] [CrossRef]
  13. He, H.Y.; Chen, C.F.; Chen, F.L.; Tang, P.P. Deformation monitoring along the landscape corridor of Zhangjiakou Ming Great Wall using Sentinel-1 SBAS-InSAR approach. Remote Sens. Nat. Resour. 2021, 33, 205–213. [Google Scholar]
  14. Xu, Z.Q.; Zhang, N. Strengthening the protection and monitoring of historical and cultural heritage in urban and rural areas and improving the level of digital governance. Urban Rural Dev. 2024, 10, 98–99. [Google Scholar]
  15. Lu, B.B.; Tian, X.X.; Qin, S.S.; Shi, W.L.; Li, J.S. Urban Land Resources Carrying Capacity Evaluation of Wuhan with Geographically Weighted Techniques. Geomat. Inf. Sci. Wuhan Univ. 2024. [Google Scholar] [CrossRef]
  16. Yang, C.; Chen, Z.Y. Digital Conservation of World Cultural and Natural Heritage and New International Trends Thereof. Landsc. Archit. 2023, 30, 96–102. [Google Scholar]
  17. Sun, H.; Gao, J.H.; Yan, T.Y. Remote sensing of vegetation drought: Research progress. Natl. Remote Sens. Bull. 2024, 28, 1395–1411. [Google Scholar]
  18. Hu, Y. Investigation on Naxi Traditional Villages from the Perspective of Cultural Landscape. Master’s Thesis, Beijing Jiaotong University, Beijing, China, December 2022. [Google Scholar]
  19. Tan, H.; Liu, Y. Value Evaluation and Classification Protection of Traditional Villages in Yunnan Based on the Objective of Cultural Heritage. J. Southwest For. Univ. 2022, 6, 92–100. [Google Scholar]
  20. Xia, Y.; Guo, J.; Zuo, Y. Preservation Strategy and Soundscape Survey of Tourism-Oriented Traditional Villages: Taking Hongcun as an Example. Huazhong Archit. 2022, 40, 173–177. [Google Scholar]
  21. Su, J.M.; Liu, X.Q. Investigation and Analysis of the Color Features in Traditional Huizhou Villages from the Perspective of Color Geography. Huazhong Archit. 2023, 41, 155–160. [Google Scholar]
  22. Liu, P.L.; Li, X.T.; Yan, G.L.; Liu, Y.C. Digital Monitoring and Warning Model of Traditional Village Landscape from the Perspective of Culture and Tourism Integration. Econ. Geogr. 2022, 42, 193–200. [Google Scholar]
  23. Tao, J.; Chen, Z.X.; Wei, C.; Xiao, D.W.; Hang, J.P. A Dynamic Monitoring and Early Warning System for Traditional Villages in China. S. Archit. 2023, 4, 29–39. [Google Scholar]
  24. Li, Q.; Wang, W.; Xu, X.D.; Xu, N. Construction of A Classified Monitoring System for Traditional Villages from the Perspective of “Production, Life, and Ecology”. S. Archit. 2022, 5, 45–53. [Google Scholar]
  25. Tan, Q.H.; Xiao, D.W.; Luo, M.N.; Tao, J. A Study on the Classification of Traditional Village Images Based on Convolutional Neural Network. City Plan. Rev. 2020, 44, 52–58. [Google Scholar]
  26. Wang, T.; Chen, J.; Liu, L.; Guo, L.L. A Review: How Deep Learning Technology Impacts the Evaluation of Traditional Village Landscapes. Buildings 2023, 13, 525. [Google Scholar] [CrossRef]
  27. Huo, X.W.; Zhang, J.; Luo, W.; Wang, P.; Yin, W.Y. The Technical Exploration and Practice of Deep Learning Recognition in Traditional Textures of Historical Settlements. Dev. Small Cities Towns 2024, 42, 87–94. [Google Scholar]
  28. Zhang, C.M.; Tan, R.S.; Song, C.M.; Zhao, D.S. Research on Quantitative Measurement of Automatic Classification of Residential Buildings Under Deep Learning. Artif. Intell. Sci. Eng. 2023, 48, 1–11. [Google Scholar]
  29. Xue, H.; Liu, K.; Wang, Y.; Chen, Y.X.; Huang, C.Y.; Wang, P.F.; Li, L. MAD-UNet: A Multi-Region UAV Remote Sensing Network for Rural Building Extraction. Sensors 2024, 24, 2393. [Google Scholar] [CrossRef]
  30. Feng, F.; Wang, S.T.; Zhang, J.; Wang, C.Y.; Liu, B. Building Extraction from Remote Sensing lmagery Based on Scale Adaptive Fully Convolutional Network. Laser Optoelectron. Prog. 2021, 58, 459–470. [Google Scholar]
  31. Chen, X.J.; Tian, Q.L.; Yi, P.Y. Building Extraction from High-Resolution Remote Sensing Images with Deep Learning. World Nucl. Geosci. 2023, 40, 81–88. [Google Scholar]
  32. Cote, M.; Saeedi, P. Automatic Rooftop Extraction in Nadir Aerial Imagery of Suburban Regions Using Corners and Variational Level Set Evolution. IEEE Trans. Geosci. Remote Sens. 2013, 51, 313–328. [Google Scholar] [CrossRef]
  33. Zhong, S.; Huang, J.; Xie, W. A new method of building detection from a single aerial photograph. In Proceedings of the 9th International Conference on Signal Processing (ICSP 2008), Beijing, China, 26–29 October 2008. [Google Scholar]
  34. Awrangjeb, M.; Zhang, C.; Fraser, C.S. Improved Building Detection Using Texture Information. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2013, 38, 143–148. [Google Scholar] [CrossRef]
  35. Dunaeva, A.V.; Kornilov, F.A. Specific shape building detection from aerial imagery in infrared range. Ural Fed. Univ. Named First Pres. Russ. B.N. Yeltsin 2017, 6, 84–100. [Google Scholar]
  36. Li, E.; Femiani, J.; Xu, S.; Zhang, X.P.; Wonka, P. Robust Rooftop Extraction from Visible Band Images Using Higher Order CRF. IEEE Trans. Geosci. Remote Sens. 2015, 53, 4483–4495. [Google Scholar] [CrossRef]
  37. Song, S.R. Research of Building Extraction and Chronological Classification in Southern Jiangsu Province Based on Deep Learning. Master’s Thesis, China University of Mining and Technology, Xuzhou, China, May 2019. [Google Scholar]
  38. Zuo, T.; Feng, J.; Chen, X. HF-FCN: Hierarchically Fused Fully Convolutional Network for Robust Building Extraction. In Proceedings of the 13th Asian Conference on Computer Vision (ACCV 2016), Taipei, China, 20–24 November 2016. [Google Scholar]
  39. Song, S.R. Research on Object Recognition Method of Urban Buildings in High Spatial Resolution Remote Sensing Imagery. Master’s Thesis, Beijing University of Civil Engineering and Architecture, Beijing, China, May 2020. [Google Scholar]
  40. Hu, Y.; Guo, F. Building Extraction Using Mask Scoring R-CNN Network. In Proceedings of the 3rd International Conference on Computer Science and Application Engineering (CSAE 2019), Sanya, China, 22–24 October 2019. [Google Scholar]
  41. Yan, Z.R. Research on Urban Building Detection Method in UAV Images Based on Convolutional Attention. Geomat. Spat. Inf. Technol. 2022, 45, 265–268. [Google Scholar]
  42. Zeng, Y.; Guo, Y.; Li, J. Recognition and extraction of high-resolution satellite remote sensing image buildings based on deep learning. Neural Comput. Appl. 2022, 34, 2691–2706. [Google Scholar] [CrossRef]
  43. Ma, J.B.; Zhou, Z.X. Automatic Recognition Method of Illegal Buildings in UAV Image based on U-net Network. Jiangxi Surv. Mapp. 2022, 3, 13–66. [Google Scholar]
  44. Zhang, Y.; Guo, H.M.; Yin, W.G.; Zhao, Z.; Lu, C.J.; Yu, Y.Y. Recognition Method for Earthquake-induced Building Damage from Unmanned-aerial-vehicle-based Images Using Bag of Words and Histogram Intersection Kernel Support Vector Machine. Sens. Mater. 2022, 34, 4383–4404. [Google Scholar] [CrossRef]
  45. Wang, C.; Qiu, X.; Liu, H.; Li, D.; Zhao, K.G.; Wang, L.L. Damaged buildings recognition of post-earthquake high-resolution remote sensing images based on feature space and decision tree optimization. Comput. Sci. Inf. Syst. 2020, 17, 619–646. [Google Scholar] [CrossRef]
  46. Du, H.; Cao, Y.; Zhang, F.; Lv, J.L.; Deng, S.R.; Lu, Y.K.; He, S.F.; Zhang, Y.S.; Yu, Q.K. A classification method of building structures based on multi-feature fusion of UAV remote sensing images. Earthq. Res. Adv. 2021, 1, 100069. [Google Scholar] [CrossRef]
  47. Wang, S.; Park, S.; Park, S.; Kim, J. Building façade datasets for analyzing building characteristics using deep learning. Data Brief 2024, 57, 110885. [Google Scholar] [CrossRef] [PubMed]
  48. Zheng, Y.; Lin, Y.M.; Zhao, L.; Wu, T.H.; Jin, D.P.; Li, Y. Spatial planning of urban communities via deep reinforcement learning. Nat. Comput. Sci. 2023, 11, 748–762. [Google Scholar] [CrossRef] [PubMed]
  49. Chen, B.; Peng, X.Y.; Zhou, S.H.; Chen, J.L.; Kong, X.J.; Bian, M.Y.; Lin, G.Y. Intelligent Recognition and 3D Modeling of Rural Buildings Based on Multi-Source Data Fusion. Trop. Geogr. 2023, 43, 190–201. [Google Scholar]
  50. Meng, C.; Song, Y.; Ji, J.; Jia, Z.Y.; Zhou, Z.X.; Gao, P.; Liu, S.X.Y. Automatic classification of rural building characteristics using deep learning methods on oblique photography. Build. Simul. 2022, 15, 1161–1174. [Google Scholar] [CrossRef]
  51. Wang, M.; Xu, W.; Cao, G.; Liu, T. Identification of rural courtyards’ utilization status using deep learning and machine learning methods on unmanned aerial vehicle images in north China. Build. Simul. 2024, 17, 799–818. [Google Scholar] [CrossRef]
  52. Wang, D.G.; Lv, Q.Y.; Wu, Y.F.; Fan, Z.Q. The characteristic of regional differentiation and impact mechanism of architecture style of traditional residence. J. Nat. Resour. 2019, 34, 1864–1885. [Google Scholar]
  53. Huang, Y.J. Spatial Shape and Structure of Traditional Residence in Shandong Historic Village. Hundred Sch. Arts 2013, 29, 82–84. [Google Scholar]
  54. GB55035-2023; Specification for Urban and Rural Historical and Cultural Protection and Utilization Projects. Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2023.
  55. Guidelines for the Construction of Protection and Enhancement Projects for Historical and Cultural Cities and Blocks (Trial); Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2024.
  56. Redmon, J. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
  57. Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, C.; Fu, C.Y.; Bergetc, A. SSD: Single Shot MultiBox Detector; Springer: Berlin/Heidelberg, Germany, 2016; Volume 33, pp. 205–213. [Google Scholar]
  58. He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy, 22–29 October 2017. [Google Scholar]
  59. Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
  60. Wang, G.; Chen, Y.; An, P.; Hong, H.Y. UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios. Sens. Multidiscip. Digit. Publ. Inst. 2023, 23, 7190. [Google Scholar] [CrossRef] [PubMed]
  61. Wang, W.; Cheng, Y.; Zhou, Y.K.; Zhang, W.J.; Wang, j. Research on Lightweight Network for Rapid Detection of Remote Sensing Image Targets based on YOLO. Remote Sens. Technol. Appl. 2024, 39, 547–556. [Google Scholar]
  62. Yang, Z.Y.; Luo, L.; Wu, T.Y.; Yu, B.X. Improved Lightweight Ship Target Detection Algorithm for Optical Remote Sensing Images with YOLOv8. Comput. Eng. Appl. 2024, 60, 248–257. [Google Scholar]
  63. Liu, Y.; Lv, F.S.; Li, X.K.; Zuo, S.Y. Remote sensing image target detection algorithm based on YOLOv8-LGA. J. Optoelectron. Laser 2024, 33, 205–213. [Google Scholar]
  64. Jocher, G.; Chaurasia, A.; Qiu, J. Ultralytics YOLO (Version 8.0.0). 2023. Available online: https://github.com/ultralytics/ultralytics (accessed on 15 April 2024).
  65. Fan, Y.; Li, X.; Xiao, W.J. Study on the Spatial Style Types and Zoning of Traditional Villages in Shandong Province Based on K-modes Clustering Algorithm. Dev. Small Cities Towns 2024, 42, 100–107. [Google Scholar]
  66. Fan, Y.; Li, J.; Yang, X.N. Spatial Analysis and Style Inheritance of Traditional Settlements in the Context of Space: Time Deconstruction as Exemplified in Dongfanliu Village, Jinan. J. Shandong Agric. Eng. Univ. 2023, 40, 83–93. [Google Scholar]
Figure 1. Examples of architectural roof features and courtyard vegetation coverage features.
Figure 1. Examples of architectural roof features and courtyard vegetation coverage features.
Sustainability 16 08954 g001
Figure 2. Technical flowchart.
Figure 2. Technical flowchart.
Sustainability 16 08954 g002
Figure 3. Description of the UAV photography.
Figure 3. Description of the UAV photography.
Sustainability 16 08954 g003
Figure 4. Example of original orthoimage cropping.
Figure 4. Example of original orthoimage cropping.
Sustainability 16 08954 g004
Figure 5. Data augmentation measures example (a). Image flipping, (b). Image rotation, (c). Image blurring).
Figure 5. Data augmentation measures example (a). Image flipping, (b). Image rotation, (c). Image blurring).
Sustainability 16 08954 g005
Figure 6. Loss function across different epochs.
Figure 6. Loss function across different epochs.
Sustainability 16 08954 g006
Figure 7. Visualization process in deep learning.
Figure 7. Visualization process in deep learning.
Sustainability 16 08954 g007
Figure 8. Test sample villages building feature recognition example.
Figure 8. Test sample villages building feature recognition example.
Sustainability 16 08954 g008
Figure 9. Location map of Dongfanliu village.
Figure 9. Location map of Dongfanliu village.
Sustainability 16 08954 g009
Figure 10. UAV orthoimagery of Dongfanliu village (a) captured in June 2020, (b) captured in June 2024.
Figure 10. UAV orthoimagery of Dongfanliu village (a) captured in June 2020, (b) captured in June 2024.
Sustainability 16 08954 g010
Figure 11. Building condition recognition results of Dongfanliu village in 2020.
Figure 11. Building condition recognition results of Dongfanliu village in 2020.
Sustainability 16 08954 g011
Figure 12. Building condition recognition results of Dongfanliu village in 2024.
Figure 12. Building condition recognition results of Dongfanliu village in 2024.
Sustainability 16 08954 g012
Figure 13. Building patch change analysis from 2020 to 2024.
Figure 13. Building patch change analysis from 2020 to 2024.
Sustainability 16 08954 g013
Figure 14. Dynamic monitoring results of building condition changes from 2020 to 2024.
Figure 14. Dynamic monitoring results of building condition changes from 2020 to 2024.
Sustainability 16 08954 g014
Figure 15. Comparison of deep learning model recognition results (a) with manual annotation results (b).
Figure 15. Comparison of deep learning model recognition results (a) with manual annotation results (b).
Sustainability 16 08954 g015
Table 1. Model stability evaluation metrics.
Table 1. Model stability evaluation metrics.
MetricsFormula
Precision T P   * T P + F P   *
Recall T P T P + F N   *
Mean Average PrecisionmAP50the mAP value at the 50% IoU * threshold
mAP50-90the mAP value within the 50–95% IoU threshold range
* TP: Number of buildings correctly classified; FP: Number of buildings incorrectly classified as this category (i.e., buildings of other categories incorrectly identified as this category); FN: Number of buildings that should have been identified as this category but were not; IoU threshold represents the ratio of the intersection area to the union area between the predicted region and the true region.
Table 2. Training results across different training epochs.
Table 2. Training results across different training epochs.
EpochsBoxMask
PrecisionRecallmAP50mAP50-95PrecisionRecallmAP50mAP50-95
150.8730.7560.8340.6860.8730.7560.8290.656
300.9080.7400.8400.6900.9080.7400.8420.609
600.9460.7350.8410.6940.9460.8250.8300.658
1000.9310.7400.8330.6970.9310.7900.8340.678
2000.8700.7680.8010.6720.9160.7300.7980.629
5000.8710.8050.8360.6950.8950.7900.8410.675
Table 3. Test sample villages building feature recognition results.
Table 3. Test sample villages building feature recognition results.
ClassificationOriginal QuantityIdentify QuantityCorrect ClassificationUnrecognizedIdentify ErrorsPrecisionRecall
Abandonment552579528245191.3%95.7%
Damage408394377311795.6%92.4%
Red-tiled25340259402518715375397.1%99.4%
Gray-tiled42304278411111916796.1%97.2
Colored steel-tiled36083524305655246886.7%84.7%
Flat roof236002434323199401114495.3%98.3%
Table 4. Analysis of changes in building condition recognition results between 2020 and 2024 in Dongfanliu village.
Table 4. Analysis of changes in building condition recognition results between 2020 and 2024 in Dongfanliu village.
Classification20202024Change Quantity
Abandonment30264
Damage311516
Red-tiled1343129746
Gray-tiled173819
Colored steel-tiled442420
Flat roof1222120022
Table 5. Transition matrix of building condition changes from 2020 to 2024.
Table 5. Transition matrix of building condition changes from 2020 to 2024.
20202024
AbandonmentDamageRed-TiledGray-TiledColored Steel-TiledFlat RoofDemolitionRedundancyTotal Quantity
Abandonment26004000030
Damage01579000031
Red-tiled00128650027151343
Gray-tiled00017000017
Colored steel-tiled004024012444
Flat roof00000120030−81222
New construction——00300————3
Total quantity26151297382412006911——
Table 6. Dynamic monitoring indicator data of building condition changes from 2020 to 2024.
Table 6. Dynamic monitoring indicator data of building condition changes from 2020 to 2024.
AbandonmentDamageRestorationRehabilitationNew ConstructionDemolitionUnchanged
20203031——————————
2024261518113692527
Quantity changes−4−1618113692527
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, X.; Yang, Y.; Sun, C.; Fan, Y. Investigation, Evaluation, and Dynamic Monitoring of Traditional Chinese Village Buildings Based on Unmanned Aerial Vehicle Images and Deep Learning Methods. Sustainability 2024, 16, 8954. https://doi.org/10.3390/su16208954

AMA Style

Li X, Yang Y, Sun C, Fan Y. Investigation, Evaluation, and Dynamic Monitoring of Traditional Chinese Village Buildings Based on Unmanned Aerial Vehicle Images and Deep Learning Methods. Sustainability. 2024; 16(20):8954. https://doi.org/10.3390/su16208954

Chicago/Turabian Style

Li, Xuan, Yuanze Yang, Chuanwei Sun, and Yong Fan. 2024. "Investigation, Evaluation, and Dynamic Monitoring of Traditional Chinese Village Buildings Based on Unmanned Aerial Vehicle Images and Deep Learning Methods" Sustainability 16, no. 20: 8954. https://doi.org/10.3390/su16208954

APA Style

Li, X., Yang, Y., Sun, C., & Fan, Y. (2024). Investigation, Evaluation, and Dynamic Monitoring of Traditional Chinese Village Buildings Based on Unmanned Aerial Vehicle Images and Deep Learning Methods. Sustainability, 16(20), 8954. https://doi.org/10.3390/su16208954

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop