1. Introduction
Geostationary orbit remote sensing satellites have the advantages of high temporal resolution and large imaging width, which enables them to continuously cover large areas. Consequently, images from geostationary orbit satellites are frequently utilized in meteorology, environmental protection, fire monitoring, and other remote sensing applications [
1,
2]. The application of such images is becoming increasingly popular due to continuous worldwide efforts in developing a new generation of geostationary satellite sensors. Therefore, geostationary orbit remote sensing satellites are now producing an unprecedented amount of earth observation data. For example, recently, the new generations of the GOES-R satellite series in North America [
3,
4], the Himawari-8/9 satellite in Japan [
4,
5], and the Fengyun-4A (FY-4A) satellite [
6,
7] and the GF-4 satellite in China [
8,
9] have been successfully launched, and the MTG-I satellite in Europe is scheduled for launch in 2021 [
10]. Images from geostationary orbit remote sensing satellites offer an opportunity to use super-resolution reconstruction (SRR) technology to further improve image spatial resolution and mapping.
It is extremely difficult to obtain high spatial resolution images from geostationary orbit remote sensing satellites due to the limitations of optical imaging payload technology, the physical constraints of the imaging instruments themselves, the high altitude of the satellites, and the large distance to the imaging object (which is dozens of times that of low-earth-orbit satellite images). Spatial resolution is of key importance for the utility of satellite-based earth observing systems [
11]. Multispectral satellite images with a high spatial resolution can provide detailed and accurate information for land-use and land-cover mapping [
12]. The factors that affect the spatial resolution of multispectral sensors include the orbit altitude, the speed of the platform, the instantaneous field of view, and the revisit cycle. Studies have investigated ways to improve the spatial resolution of images from geostationary orbit remote sensing satellites via post-processing, since it is very difficult to upgrade multiple imaging devices once a satellite has been launched.
Methods for the mapping of remote sensing images include hard classification and soft classification techniques [
13]. Traditional hard classification techniques generally cannot effectively classify mixed pixels in land cover. Techniques such as maximum likelihood classification (MLC) [
14], support vector machine (SVM) [
15], and random forest classifiers (RFC) [
16] can all effectively classify mixed pixels by estimating the fractional abundance of land cover classes in each mixed pixel. However, these methods are limited by the distribution in the spatial information of land cover. Research on super-resolution mapping (SRM) has developed rapidly in recent years. SRM is a super-resolution soft classification based on the Markov random field theorem, in which the fractional images are first allocated randomly under computational constraints [
17], and the initialized results are then optimized by changing the spatial arrangement of subpixels to gradually approach a certain objective, or via a pixel-swapping optimization algorithm, the neighboring pixel value [
18], minimizing the perimeter of the images [
19], and geostatistical indicators based on SRM [
20] in order to generate a realistic spatial structure on the refined thematic map. Due to the limited performance of the soft classification of low-resolution image datasets, the SRM method has high algorithm complexity and slow operational efficiency. Therefore, artificial intelligence algorithms, such as genetic algorithms [
21], particle swarm optimization [
22], and sparse algorithms [
23,
24] have not been fully employed in the processing of images from geostationary orbit multispectral remote sensing satellites.
In this study, the accuracy of mapping based on images from geostationary orbit remote sensing satellites was improved using super-resolution reconstruction (SRR). The employed method is different to the SRM approach in that it can increase the number of informative pixels using the mismatch in ground texture information from image to image. These reconstructed higher spatial resolution images can then be classified without any limitation on the classification techniques adopted. The SRR method has been widely used for the processing of remote sensing satellite images. It includes (1) a frequency-domain method, (2) a combined spatial-domain and frequency-domain method, and (3) a deep-learning-based method. The SRR method is simple to implement and can be implemented in parallel; however, the classification result is poor. The airspace reconstruction method is comprised of the non-uniform sampling interpolation iterative back projection (IBP) method [
25], projection onto convex sets (POCS) methods [
26], the maximum a posteriori (MAP) algorithm [
27], the total variation (TV) algorithm, a convolutional neural network algorithm [
28,
29], etc. However, although these methods are relatively simple and easy to implement, they are not suitable for the practical classification of satellite images. In recent years, a large number of SRR methods based on sparse representation have been developed. However, in the frequency domain, specific transformations can only provide sparse representation for specific types of input signals, and thus it is difficult to develop a general image sparse representation method. Some researchers are currently working on finding a universal solution to the image ill-posed inverse problem. For example, an overlapping group sparsity total variation (OGSTV) model [
30] was used to restore damaged images and was demonstrated to be highly effective in reducing the stairstep effect. Additionally, other researchers proposed a model based on the alternating direction method of multipliers (ADMM) to regularize TV and showed this model to be very effective for removing salt and pepper noise, although not random noise [
31]. Meanwhile, the authors proposed an SRR method based on dictionary and sparse representation [
32]; however, this method required a relatively large number of training samples.
In this paper, we adopt a novel SRR method called the mixed sparse representation non-convex high-order total variation (MSR-NCHOTV) method to achieve the fine classification of images from geostationary orbit remote sensing satellites. To the best of our knowledge, there has been no previous attempt to formulate a solution for a mixed sparse representation model for such classification. The sparsity of the image spatial domain model and OGSTV and NCHOTV regularizers were employed for the removal of staircase artifacts from geostationary orbit remote sensing satellite images. To effectively handle the subproblems and constraints, we adopted the ADMM algorithm to improve the quality of the reconstruction of sparse signals as well as the computing speed of the SRR algorithm. We used low-resolution GF-4 images of sequential frames obtained from the same scene over a short time to ensure the robustness of the algorithm. We improved the spatial resolution of the images by integrating several multispectral GF-4 images acquired on different dates. Overall, it is shown that compared with a bilinear interpolation (BI) SRR method, the MSR-NCHOTV method obtained a better classification and clustering outcome, both visually and numerically.
The remainder of this paper is organized as follows. In
Section 2, the methodology of the MSR-NCHOTV SRR method is briefly reviewed. In
Section 3, the experimental data and the pretreatment of GF-4 satellite images are described. In
Section 4, the classification results are presented and compared to demonstrate the effectiveness of the proposed method. Finally, conclusions are presented in
Section 5.
3. Experimental Data and Pretreatment
3.1. Experimental Data
This study used image datasets from the GF-4 geostationary orbit remote sensing satellite. This is China’s first high-resolution geostationary orbit optical imaging satellite and the world’s most sophisticated high-resolution geostationary orbit remote sensing satellite. The GF-4 satellite was launched on 29 December 2015 from the Xichang Satellite Launch Center in Sichuan Province, Southwestern China, and is equipped with a CMOS plane array optical remote sensing camera. The GF-4 satellite has a high temporal resolution, high spatial resolution, and large imaging width. It has a ground spatial resolution of 50 m, an image width of 500 × 500 km, an orbital height of 36,000 km, and allows high-frequency, all-weather, continuous, long-term observation of large areas [
8,
44].
3.2. Research Area
In this study, two sets of 10 consecutive GF-4 satellite image frames with different shooting times and covering different regions were selected. The spatial resolution of these experimental data is 50 m, and the shooting interval is one frame per minute.
Dataset 1 covered the Binhai New area of Tianjin City, covering the intersection between the Shandong Peninsula and the Liaodong Peninsula from 38°40′~39°00′N and 117°20′~118°00′E. The data were acquired at 10:40 a.m. on 24 August 2018. The Binhai New area has 153 km of coastline, a land area of 2270 km2, and a sea area of 3000 km2. The climate characteristics of the area have components of a continental warm temperate zone monsoon climate and a maritime climate.
Dataset 2 was acquired at 09:00 a.m. on 25 June 2016 and covered the Wendeng City district, Weihai City, Shandong Province. The Wendeng City district, located in the east of the Shandong Peninsula from 36°52′~37°23′N and 121°43′~122°19′E, covers a total area of 1645 km2 and has 155.88 km of coastline. The district has a continental monsoon climate with four distinct seasons.
According to the imaging mode of the GF-4 geostationary orbit staring satellite, a pixel window of 1024 × 1024 was selected from each group of images for the study area in order to improve the classification efficiency of the satellite images, as shown in
Figure 2.
In this study, an SVM classification method was adopted to verify the classification accuracy of the proposed method. In the classification process, high-resolution Google Earth images of the two study areas were used to verify the correctness of the selection of the classification training samples.
3.3. Experimental Process
GF-4 satellite images have a wide coverage, and the region of interest (ROI) varies from frame to frame. In this paper, methods for cutting GF-4 satellite images are divided into manual and automatic methods. In the process of selecting the same ROI area in each GF-4 image frame for cutting, the size of each frame was 10,240 × 10,240 pixels. Although the position of each frame is offset relative to the position of the other frames, the cutting of the ROI area is not affected since each frame is the same size. Firstly, we selected an image frame, annotated the X and Y coordinates of the point in the top left corner of the cutting area, and then entered the length and width of the image to be cut using a cutting algorithm in the Matlab 2019a software (MathWorks, Natick, MA, USA) and the C++ programming language, finally obtaining the required cutting area of the ROI.
Due to the influence of various factors in the satellite remote sensing imaging process, the image acquisition time is inconsistent for each spectral segment. By conducting band registration between Band 1 and Bands 2, 3, 4, and 5, respectively, of the same frame of the GF-4 satellite image, it was found that the difference in pixel values between different bands of each frame of the GF-4 satellite image is one or more pixels, and the difference in spatial resolution is tens to hundreds of meters. The results of the band registration accuracy evaluation of a single frame of a GF-4 satellite image are shown in
Table 1.
Pixel offset errors between the bands of the GF-4 satellite greatly reduce the data quality of the GF-4 satellite images, and thus, greatly affect the subsequent processing and application of GF-4 satellite data. Therefore, we adopted a joint registration method based on ORB feature extraction and intensity to improve the accuracy of the SRR. As shown in
Figure 3, the red and green double images of the registered image are completely superposed, thus greatly improving the quality of the GF-4 satellite image.
5. Discussion
In this paper, it is shown that the proposed MSR-NCHOTV SRR method can significantly improve the spatial resolution and sharpness of images and provide abundant texture information and detail. Compared with the SRR results obtained using the BI method [
24], the sharpness and SNR of the five bands of the GF-4 satellite images obtained using the MSR-NCHOTV SRR method were higher by an average of 39.54% and 51.52%, respectively. Compared with the SRR results obtained using the POCS method [
25], the sharpness and SNR of the five bands of the GF-4 satellite images obtained using the MSR-NCHOTV SRR method were higher by an average of 11.86% and 43.63%, respectively. Compared with the SRR results obtained using the IBP method [
26], the sharpness and SNR of the five bands of the GF-4 satellite images obtained using the MSR-NCHOTV SRR method were higher by an average of 18.00% and 40.27%, respectively (
Table 10).
Table 11 shows a comparison of the OA and Kappa coefficient values of the classification results obtained using the two groups of experimental data, in which the values are expressed as percentage differences relative to the values obtained for the classification based on the image obtained using the BI method [
24]. As shown in
Table 11, for both experiments, the average OA and Kappa coefficient values obtained using the MSR-NCHOTV method are higher than those obtained using the POCS method [
25] and IBP method [
26]; compared to the values obtained using the BI method [
24], the average values of OA and the Kappa coefficient obtained using the MSR-NCHOTV method are higher by 32.20% and 46.14%, respectively.
From the above comparative analysis, it can be seen that the sharpness and SNR of the SRR satellite image, and the OA and Kappa coefficient of the classification results, obtained using the MSR-NCHOTV method are all higher than those obtained using the BI, IBP, or POCS methods. The MSR-NCHOTV SRR method has obvious advantages in eliminating step artifacts and maintaining image texture details. Moreover, the method is not restricted by the image category, the image classification method, or the hardware environment. Therefore, the MSR-NCHOTV SRR method has potential as a preprocessing stage for multispectral image classification and can not only improve the spatial resolution of the images but can also generate more abundant information and higher-quality super-resolution images. This method can also be used for the deblurring, feature extraction, fusion, and denoising of remote sensing images. However, the MSR-NCHOTV SRR method is more complicated in parameter selection in the calculation process, and requires multiple iterations in the calculation process, which reduces the operational efficiency of the algorithm. In the future, we will optimize and improve the MSR-NCHOTV SRR method in order to improve the robustness of the algorithm, reduce the computational load, and shorten the operational time, and we will further improve the operational efficiency of the algorithm by using a faster GPU. Additionally, we will also apply this method to the k nearest neighbors (KNN) [
48], multi-layer extreme learning machine-based autoencoder (MLELM-AE) [
49], and fuzziness and spectral angle mapper-based active learning (FSAM-AL) [
50] classification methods, and we will furthermore use the method to achieve the SRR of remote sensing videos.