1. Introduction
As one of the most biologically productive ecosystems on earth, wetlands are of significant importance for hydrological and ecological processes [
1]. They play a vital role in flood storage, water-quality improvement, shoreline protection, carbon sequestration, and, more importantly, a desirable habitat for both animals and plants [
2,
3,
4]. Unfortunately, worldwide wetlands, especially in coastal areas, are vulnerable to threats from nature and human influence, including shoreline erosion because of tides and storms, reservoir construction, and increasing urbanization [
3,
5,
6]. Therefore, accurate mapping of the coastal wetlands is essential for their scientific management and sustainable development.
Remote-sensing techniques benefit from their wide coverage and their regular and rapid monitoring capacity; they are recognized as labor-saving and low-cost techniques for wetland mapping and assessment [
7,
8,
9,
10]. However, due to the location of coastal wetlands at the joint zone between continent and sea, there are many mixed objects of anthropogenic wetlands and semi-natural regions which are usually fragmented, complicated, and heterogeneous [
11,
12]. On one hand, some land covers, such as sea, breeding aquatics, reservoirs, etc., show subtle differences in their spectral features. Distinguishing those similar wetland land covers is challenging. On the other hand, the land covers from the same wetland type present strong spectral heterogeneity because of the variances in water volume, salt content, vegetation density, and illumination conditions [
6,
13,
14,
15]. Consequently, characteristics such as high within-class variability and low between-class disparity make the classification of coastal wetlands a challenging task.
In order to improve classification accuracy of coastal wetlands, various feature extraction methods were introduced to increase the separability of different land covers. In early studies, the classification of coastal wetland mainly relied on the medium–low-resolution images, and the utilized feature extraction methods mainly focused on spectral information at pixel-level, such as the normalized difference vegetation index (NDVI), normalized difference water index (NDWI), land surface water index (LSWI), and so on [
4,
11,
16]. Although these methods can be used to separate land covers with different spectral characteristics, they are hardly applied to realize the refined classification of land cover using high-resolution remote-sensing images with several spectral bands and abundant spatial information [
17]. To model the spatial characteristics of different land covers, many spatial feature extraction methods based on a moving window, including gray-level co-occurrence matrix (GLCM), Gabor filtering, and Markov random field (MRF) were tested [
18,
19]. In this way, geometry and texture information can be employed to discriminate objects better [
20]. However, both pixel-based and moving window-based approaches need to predetermine object structures, while land covers in the real world usually are irregularly shaped [
21,
22]. In general, when the input data are highly correlated with nearby pixels, a smaller window cannot provide sufficient samples for characterizing the object of interest [
23,
24], whilst a larger one may cause intractable computational problems [
23,
25]. In recent years, morphological attribute profiles (APs) [
20,
24,
25,
26] were widely employed to model spatial features of land covers in remote-sensing datasets. In particular, extended multi-attribute profiles (EMAPs) [
27,
28,
29] could present multilevel analysis for imagery via sequentially applying morphological attribute filters that are able to characterize the information in different object structures. As described, the construction of EMAPs avoids the requirement for predefined image structures. Moreover, it can keep the geometrical traits for relevant areas and effectively attenuate unimportant details [
20,
27,
30], which is useful for preserving the boundaries of wetland regions and decreasing their within-class spectral and spatial variabilities.
Currently, machine learning is recognized as the most promising technique for quantitative information retrieval from remotely sensed images [
31]. A series of machine learning approaches were developed, such as maximum likelihood (ML) [
32], support vector machines (SVMs) [
10,
33], random forest (RF) [
34,
35], neural networks (NNs) [
36], and so on. Among the various machine learning methods, NN-based classifiers gain superiority in terms of robustness, high data error tolerance, and better classification performance [
36,
37,
38]. When handling a complex dataset, multilayer perceptron (MLP) neural networks [
39] are required, which feature more layers with a full connection between all neurons. As a typical nonparametric classifier, MLP is designed to learn the nonlinear features irrespective of their statistical properties, which is widely used in remotely sensed image processing [
40,
41,
42] including coastal wetland classification [
10,
37,
43]. Nevertheless, the estimation of parameters, such as weights and biases, in MLP is always a difficult task [
44,
45]. These parameters dramatically affect the trained model’s generalization capacity, as getting their optimal settings inevitably turns into a vital problem. In general, the error backpropagation algorithm (BP) [
2,
46] is utilized to train parameters weights and biases. This approach, on the other hand, easily falls into local optima and gets premature convergence [
38,
47]. In recent years, because of their promising self-organization and global optimization abilities, swarm intelligence algorithms, such as genetic algorithms (GAs) [
40,
48], particle swarm optimization (PSO) algorithms [
46,
49,
50], and differential evolution (DE) [
51] were successfully used to optimize the parameters of MLP.
Gravitational search algorithm (GSA) [
52,
53] is a recently developed swarm intelligence algorithm inspired by Newton’s law of gravity [
54]. In GSA, agents are recognized as celestial bodies and search for the optimal solution via interactive movements under gravitational force. These years, GSA is increasingly popular because of its simple structure, well-understood theory, easily implemented strategy, and so on [
55,
56]. Nevertheless, GSA still faces a premature convergence problem when processing complicated problems [
56,
57,
58,
59]. Thus, many GSA variants were proposed [
56,
60], including the stability-constrained adaptive alpha for the gravitational search algorithm (SCAA) [
61], in which the searching performance of GSA was improved by adaptively adjusting the important parameter alpha.
As the Sentinel-2 multispectral instrument (MSI) sensor simultaneously possesses rich spatial and spectral information, it shows enormous potential in characterizing wetland extents [
9,
62,
63,
64]. Researchers proposed many advanced techniques for mapping wetlands from Sentinel-2 images. Stratoulias et al. [
9] mapped lakeshore areas using the selected high-spatial-resolution (i.e., 10 m and 20 m) bands of Sentinel-2 imagery on the basis of hyperspectral data and the satellite’s spectral response function. Chatziantoniou et al. [
8] evaluated the performance of a synergistic utilization of Sentinel-1 and Sentinel-2 dataset for representing land use and land cover (LULC) of wetlands under SVMs. Therein, spectral features (gained from minimum noise fraction (MNF), principal component analysis (PCA)) and spatial features (GLCM texture, shape, and crop features) were extracted and employed on a complex wetland region. In Reference [
65], an approach combining pixel-based, index-based, and object-based classification technique was introduced, in which NDVI and NDWI were employed for the discrimination of the contents within the wetlands, and an object-based method was used to extract the boundaries of wetland types. However, few of the methods tried utilizing the spatial information of Sentinel-2 imagery in multilevel to model the scale-varying objects in wetlands.
The overarching aim of this study was to propose a novel classification framework for Sentinel-2 MSI imagery of complex coastal wetlands by developing an optimized multilayer perceptron classifier and applying it to the spectral–spatial features constructed by the multispectral features and the EMAPs. The contributions of this research can be summarized as follows:
(1) Wetland EMAPs feature extraction: Four attribute filters are introduced based on the characteristics of wetlands in each band from Sentinel-2 imagery to extract the EMAPs that model the wetland objects in multiscale and multilevel.
(2) A novel MLP classifier: The SCAA optimized MLP, which is denoted as SCAA_MLP, is proposed to select the most appropriate parameters of the MLP classifier. The superiority of SCAA in balancing exploration and exploitation can effectively promote the capability of MLP classifiers.
(3) Coastal wetland mapping: Through the application of SCAA_MLP classifier to the spectral and spatial features, accurate coastal wetland maps can be obtained.
The remainder of the paper is organized as follows:
Section 2 gives a detailed introduction of the proposed method.
Section 3 presents the studied areas and the experimental results. In
Section 4, a careful discussion of the proposed method with some future scope is provided. Finally,
Section 5 concludes the whole paper.
4. Discussion
Coastal wetland types in Setinel-2 remotely sensed imagery are highly complex with low within-class homogeneity and between-class heterogeneity. Merely relying on the spectral characteristics of wetlands results in massive misclassification and noise. As depicted in
Figure 9b and
Figure 10b, confusion appeared among spectral similar objects like river, reservoir pond, breeding aquatics, and sea, while severe noise existed in the heterogeneous wetland types like river and breeding aquatics. The boundary information between different wetland objects was also weakened with the blurred classification. These issues demonstrate the need for using effective spatial features of coastal wetlands. The proposed method in this paper incorporates EMAPs as a spatial characterization tool. The utilization of different attributes and dense filter threshold settings provides multilevel spatial description modeling various structural information in the irregularly shaped wetland types. Moreover, the attribute filter operation can preserve the geometrical characteristics of objects and attenuate unimportant details, for they do not need to predefine image structures. In this context, the utilization of EMAPs is helpful to decrease intra-class variability, improve inter-class variability, and delineate boundaries. This superiority was validated by the quantitative analysis (
Table 5 and
Table 6) and visual inspection (
Figure 9c and
Figure 10c), whereby spectrally similar wetland types (e.g., river, reservoir pond, sea, etc.) and greatly heterogeneous objects (e.g., breeding aquatics, sparse vegetation, and dry river channel areas) were correctly classified with less noise and accurate boundaries.
In MLP neural networks, the input of each neuron is firstly weighted and bias is added, and then it is handled by a nonlinear activation function. During this process, the performance of MLP significantly relies on the parameter weight and bias settings. However, traditional training approaches tend to fall into local optima and fail to obtain the optimal values for weights and biases [
38,
47]. Such limitations usually cause undesirable classification results; for instance, misclassification and confusion between the river, reservoir pond, breeding aquatics, and sea that are spectrally similar appeared in the coastal wetland classification maps of BP_MLP and GSA_MLP despite the employment of EMAP spatial features. Even for those correctly identified objects, some noise still existed, as presented in
Figure 7d,e and
Figure 8d,e. In this work, the intelligent optimization algorithm SCAA, which achieves superior balance between exploration and exploitation search to escape from local optima, was used to effectively set proper values of weights and biases in MLP. The classification results in
Table 3 and
Table 4 suggest that SCAA_MLP achieved desirable results in terms of OA and per-class classification accuracy. In addition, from
Figure 7f and
Figure 8f, SCAA_MLP achieved the smoothest classification maps on Jiaozhou Bay wetland and YRD wetland with precise boundary information and high geometric fidelity.
From the classification maps in
Section 4, it is clear that most coastal wetland types in Jiaozhou Bay and Yellow River Delta can be roughly divided into natural wetlands and artificial wetlands. In Jiaozhou Bay, the artificial wetlands, like some reservoir ponds, breeding aquatics, and salterns, are mainly found in the northern and northwestern regions with regular shapes. In the Yellow River Delta, artificial wetland types are mainly distributed on the northern, northeastern, and eastern coasts, in which paddy fields are identified in northern and central parts. Moreover, the Yellow River Delta coastal wetlands are characterized by extensive salinization, especially in the tidal flat, which leads to the misclassification with saltern. These factors indicate the strong relationship between regional industrial and agricultural development and land cover distribution in the coastal zone. As a special ecosystem, coastal wetland is vulnerable to alteration and shrinkage due to natural fluctuations and external disturbance. In a future study, a multitemporal remotely sensed dataset on Jiaozhou Bay wetland and YRD wetland will be utilized to analyze the dynamic change of different wetland types. Furthermore, the relative significance of natural and anthropogenic variables in the observed wetland changes will be assessed and evaluated to reveal the impact of environmental conditions on wetland change.