6.2. Models Based on Single Date/Incidence Angle SAR Data
No models based on single SAR images achieved acceptable accuracies (≥∼80%) for all classes (
Figure 3), though in many cases, user’s and producer’s accuracies were higher for models constructed with QP compared to CP data. This has been observed in other studies [
19,
32], and is attributable to the fact that the QP data contain more target information, and less noise. In most cases, accuracies were also higher for models constructed with medium compared to high resolution CP data, which is to be expected as the latter contain more noise. However, often the highest accuracy of the model based on high resolution CP data was close to or exceeded the lowest accuracy of the model based on medium resolution CP data (i.e., when considering all six simulations). Thus, it has been demonstrated that multiple simulations may be required to provide confidence in the reliability of the results.
Accuracies were often higher with high resolution CP data because of the combined effects of the quality of training/validation data (i.e., few samples in some cases), application of the NESZ pattern (random values, non-repeating between simulations), and low class separability. Specifically, for a given class, the addition of noise shifted a varying proportion of pixels either outside or within a multivariate feature space that was separable from others. This effect was also more obvious for shallow water, which had few training/validation samples, and low separability with classes like agriculture/non-forest. At least some of this variability is also attributable to the way RF randomly selects training data, and predictor variables used for node splitting though this effect was minimized by taking the mode prediction of 10 runs.
For most classes, there was less than 10% difference in the user’s and producer’s accuracies between the three sets of simulated high or three sets of medium resolution CP data. However, for forest and shallow water accuracies differed by up to 13% and 16%, respectively. However, all three simulations of high or three simulations of medium resolution data tended to misclassify the same classes (albeit at different rates), and per class accuracies were generally either relatively high or low (> or <∼80%). Therefore, only results for the first simulations are shown in
Figure 3, while the following is applicable to and references results for all simulations.
Low accuracies were observed for shallow water (
Figure 3), mostly because of confusion with agriculture/non-forest. This is unsurprising since in many cases both classes were observed in the field to exhibit similar surface roughness conditions. To a lesser extent, shallow water was also confused with water and marsh. This is likely because vegetation density was low in some areas (thus surface water was dominant), and because some places contained a high proportion of cattails (dominant in marshes). Models constructed with QP and CP data misclassified shallow water and agriculture/non-forest about the same number of times (i.e., 18–24 misclassifications), although the latter confused shallow water, water, and marsh more often.
For marsh high accuracies (
Figure 3) were only achieved for models constructed with the QP FQ17W summer image. In contrast, accuracies were low for all models (QP and CP) based on the FQ17W spring image due to confusion between marsh and swamp. This is sensible since both classes exhibited similar backscatter characteristics in the spring (i.e., high total power attributable to double bounce and volume scattering). In the summer however, double bounce decreased more in swamps due to the leafing out of the canopy (i.e., mean values decreased 12.0 and 4.4 dB for swamp and marsh, respectively), thus making them more separable. On the other hand, accuracies for marsh were low with all FQ5W images, mostly due to confusion with agriculture/non-forest, forest, and swamp (82 and 97–102 misclassifications with QP and CP spring data, respectively; and 44 and 58–69 misclassifications with QP and CP summer data, respectively).
For swamp accuracies were high (
Figure 3) for all FQ5W spring images, regardless of polarization and NESZ value. With the FQ17W spring image, lower but still acceptable accuracies were also achieved, though just with QP image. This decrease in accuracy with the FQ17W images is attributable to the fact that at shallower angles, the path length between the sensor and saturated soil/water surface is greater, increasing the number of features with which the signal interacts, and resulting in increasingly similar backscattering characteristics between flooded and non-flooded areas due to greater signal attenuation, volume and/or multiple scattering [
40,
41]. Notably, the lower information content and higher NESZ of the CP data appears to have compounded the effects of incidence angle, since no models based on CP FQ17W spring image achieved acceptable accuracies. Accuracies for swamp were low for all models based on summer images, since the leafing out of the canopy similarly resulted in dominant signal attenuation and volume/multiple scattering that also increased confusion with forest, especially at shallower angles (72–91 versus 47–56 misclassifications for all QP and CP models based on the FQ17W and FQ5W summer images, respectively). Thus, as was observed for marsh, the separability of swamp from other classes was similarly affected by the acquisition timing, incidence angle, polarization, and NESZ value of the SAR image.
Accuracies were high for water in all cases, indicating that this class may be less sensitive to the differences between QP and CP data. For forest, only the model constructed with the QP FQ17W spring image achieved acceptable accuracies, as this class was similarly confused with swamp in the summer. Agricultural/non-forest was only accurately classified by models based on FQ17W spring data (both QP and CP data).
6.3. Models Based on Multi-Angle/Multi-Temporal SAR Data
Multiple models based on two or four QP images achieved high accuracies (≥∼80%) for all classes, demonstrating the benefit of multi-temporal fusion for inventorying and monitoring wetlands (
Figure 4). This increase in accuracy (relative to single SAR images) is due to multiple class pairs having similar backscatter characteristics in one, but not both seasons. For example,
Figure 5 shows that for marsh and swamp the proportion of surface, double bounce, and volume scattering (Freeman–Durden or
m-chi decomposition) was similar in the spring, but differed in the summer. However, values were also similar between swamp and forest in the summer, while differing in the spring. Thus, to separate all classes required information from both seasons. These results are consistent with others that have similarly demonstrated the benefit of multi-temporal SAR data for classifying wetlands [
21,
29]. On the other hand, multi-angle data alone appears to have been less effective for separating these classes, as accuracies were lower for models based on both spring, and especially both summer images (
Figure 4). Accuracies were also lower with both FQ5W images, demonstrating preference for having at least one image acquired at a relatively shallow angle.
With CP data, only models based on all four images achieved acceptable accuracies for all classes and all simulations, demonstrating the need for more diverse sources of information to achieve high accuracies (multi-angle and multi-temporal) (
Table A1). As was observed with single SAR images, accuracies were generally higher with medium compared to high resolution CP data, although, again, the highest accuracy among the three simulations of the high resolution data often equalled or exceeded the lowest accuracy among the three simulations of the medium resolution data. For all classes except shallow water though, differences in accuracy between the high and medium resolution CP data never exceeded 9%, and in a majority of cases the difference was less than 5%. Therefore, for these classes, there may be little benefit to acquiring medium resolution CP data at 16 m, compared to the high resolution data at 5 m, especially since both will have the same swath width (30 km).
For shallow water though more models achieved acceptable accuracies (≥∼80%), and accuracies were up to 21% higher with medium versus high resolution CP data (
Table A1). However, the effects of spatial resolution similarly need to be evaluated, since this class can occupy relatively small areas, as the presence and density of vegetation can be spatially variable. Notably, because accuracies varied between simulations, so too did observed differences between models based on high or medium resolution data. For example, with the FQ5W spring and FQ17W summer models, user’s and producer’s accuracies for shallow water differed by as much as 9% and 21% (third simulation of high versus first simulation of medium), to as little as 6% and 11% (second simulation of high versus third simulation of medium). Again, this demonstrates that multiple simulations are required to provide confidence in the reliability of results.
Shallow water was also the only class for which the difference in accuracies between the three simulations of high or three simulations of medium resolution CP data exceeded 10% (maximum observed difference was 15%, although for some models differences were as low as 4%). For other classes, user’s and producer’s accuracies differed by 5% or less in a majority of cases, and, again, all simulations of high or medium resolution data generally misclassified the same classes, and accuracies tended to be relatively high or low (> or <∼80%). Therefore,
Figure 4 and
Figure 5 only show results for the first simulations, although all are referenced in the following section (
Table A1).
With the exception of models based on both summer images, all QP and medium resolution CP datasets accurately classified shallow water (user’s and producer’s accuracies ≥∼80%). With high resolution CP data, however, only models based on all images achieved acceptable accuracies for all simulations (
Table A1). It is worth mentioning that the low accuracies for models constructed with just summer data were again due mostly to confusion with agriculture/non-forest, and that, despite the combination of two angles, the number of misclassifications remained the same as models based on single SAR images (16–23 with both QP and CP data). This demonstrates the value of multi-temporal data for separating these classes.
Marsh was accurately classified (user’s and producer’s accuracies ≥∼80%) by all models based on QP data, except those constructed with just spring images, or the FQ5W spring and summer images. For the model based on spring images, accuracies were low primarily due to confusion with swamp and forest (13 and 20 misclassifications, respectively), while with the FQ5W spring and summer images confusion was mostly with forest and agriculture (10 and 16 misclassifications, respectively). Conversely, only models based on all four high or medium resolution CP images achieved acceptable accuracies for all simulations, although for multiple configurations accuracies were close to acceptable, or were acceptable for some but not all simulations.
Swamp was accurately classified by all combinations of QP and CP data (user’s and producer’s accuracies ≥84%), except when models were constructed with just summer images. For the latter, this was mostly due to confusion between swamp and forest (i.e., 57 and 50–62 misclassifications with QP and CP data, respectively), and swamp and agriculture/non-forest (i.e., 15 and 29–40 misclassifications with QP and CP data, respectively). Thus, it has been demonstrated that the availability of at least one spring image is critical for the accurate classification of swamp.
All models constructed with multi-angle/multi-temporal SAR data accurately classified water (user’s and producer’s accuracies ≥94%). On the other hand, accuracies were only high for forest with models based on QP data (user’s and producer’s accuracies ≥79%), except when constructed with the FQ5W spring and FQ5W summer imagery or the FQ5W summer and FQ17W summer images (user’s and producer’s accuracies equalled 64–78% and 48–78%, respectively). With the CP data, accuracies were only consistently high for forest when all four images were combined. Agriculture/non-forest was accurately classified by all models, except those based on CP data acquired just in the summer.
6.4. Models Based on Single SAR Images and High Spatial Resolution DEM/DSM Data
For all models constructed with single SAR images, overall accuracies increased significantly (McNemar’s test statistic; 95% confidence interval [
57]) following the addition of the high spatial resolution DEM and DSM data (results for first simulation shown in
Figure 6; all results provided in
Table A2). It is particularly notable that, for wetlands, independent overall accuracies increased by 19–39%. Despite this, accuracies were still relatively low for some classes, and only those constructed with the QP FQ5W spring, QP FQ17W spring, or CP FQ17W spring images (combined with the DEM and DSM) achieved acceptable accuracies (≥∼80%) for all classes. Accuracies were also relatively high for models constructed with the CP FQ5W spring images, DEM and DSM data, although producer’s accuracies were low in most cases for forest (ranged from 73–80%). Interestingly, the degree to which accuracies increased varied between classes and SAR images, with little to no change being observed in some cases (e.g., with QP FQ5W spring data user’s and producer’s accuracies for swamp only increased by 6% and 2% with the addition of the DEM and DSM data).
Compared to models constructed with just the single SAR images, user’s and producer’s accuracies differed less between QP and CP datasets when the DEM and DSM data was included (i.e., up to 14% and in a majority of cases <3% difference between models based on either QP, DEM and DSM data or CP, DEM and DSM data, compared to up to 55% and in a majority of cases > 10% difference between models based on just QP or CP data). There was also less of a difference between models constructed with high or medium resolution CP data (i.e., maximum of 7% and 16% when classified with, and without the DEM and DSM, respectively). This demonstrates that the DEM and DSM compensated both for the loss in information content and higher NESZ values of the CP compared to the QP data, as well as the higher NESZ value of the high compared to medium resolution CP data. With RCM then, DEM and DSM data are expected to become increasingly important as complementary data sources for wetland mapping and monitoring.
For shallow water, the addition of the DEM and DSM proved critical in improving the separability of shallow water and agriculture/non-forest, decreasing the number of misclassifications from 18–24 to 0–4 (for all QP and CP models). This is because, while both features exhibited similar backscatter characteristics, they are also located at different topographic positions (mean elevation of shallow water and agriculture non-forest is 74.4 and 93.3 m, respectively). For models based on CP data, the DEM and DSM also reduced confusion with water, and marsh (i.e., from 8–25 to 4–6 misclassifications for all models), while with QP data these classes were misclassified about the same number of times (3–8) regardless of whether DEM and DSM data was included.
Marsh was accurately classified by all models constructed with single SAR images, DEM and DSM data (user’s and producer’s accuracies ≥89%), and the range in accuracies between models was relatively low (89–95%) compared to those based on SAR data only (30–86%). This shows that these data again compensated for some of the observed differences in accuracy as a result of the timing, incidence angle, polarization and NESZ value of the SAR image. As an example, for models based on the QP FQ17W spring or QP FQ17W summer images, user’s and producer’s accuracies differed by less than 4% when the DEM and DSM was included, compared to 16% and 22% when models were constructed with just SAR data.
For swamp, user’s and producer’s accuracies were already relatively high for models constructed with images acquired in the spring, however the addition of DEM and DSM data resulted in more comparable accuracies between the FQ5W and FQ17W images compared to models based on either SAR image alone (i.e., user’s and producer’s accuracies differed by 1% and 5%, compared to 11% and 12%). This is because, for the FQ17W spring image, the DEM and DSM data reduced confusion between swamp and marsh (29 and 18–23 fewer misclassifications for models based on QP and CP data, respectively). For models based on SAR images acquired in the summer, accuracies for swamp remained low regardless of the addition of DEM and DSM data, as this did not reduce confusion with forest. This is because the DEM and DSM values for 96% of the 258 training and validation points for forest were distributed throughout the same range as values for swamp, with many for forest also being at low elevations. In light of this, it is expected that true bare Earth models and/or products with higher vertical accuracy could improve the separability between these classes.
Water was accurately classified by models constructed both with and without DEM and DSM information. Conversely, accuracies for forest remained low for models based on images acquired in the summer, because of confusion with swamp. Agriculture/non-forest was accurately classified by all models following the addition of DEM and DSM data.
6.5. Models Based on Multi-Angle/Multi-Temporal SAR Data and High Spatial Resolution DEM/DSM Data
Compared to single SAR images, the addition of the DEM and DSM to models based on multi-angle/temporal SAR data had less of an effect on the overall accuracies of wetlands in some cases, which increased by 1–21% (results for first simulation shown in
Figure 7; all results provided in
Table A3). With QP data, accuracies only increased significantly (McNemar’s test statistic; 95% confidence interval [
57]) for models based on both spring images, both summer images, and the FQ5W summer and FQ17W spring images. With CP data, however, the DEM and DSM provided additional, relevant information in a majority of cases. As a result, accuracies increased significantly for all models, except those constructed with the FQ5W spring and FQ17W summer images (one of six simulations only), both summer images (four of six simulations), or all four SAR images (all simulations).
All QP and CP models also achieved acceptable accuracies (≥80%) for all classes, with the exception of those based on the FQ5W spring and summer images, or both summer images, due to low accuracies for forest, and low accuracies for swamp, and forest, respectively. Notably, multiple configurations of just two QP or CP images, DEM and DSM data, achieved approximately the same accuracies as models based on all four QP or CP images. In contrast, when only SAR data were included as inputs, fewer models based on QP data achieved acceptable accuracies for all classes, and, with CP data, only the combination of all four images achieved acceptable accuracies for all classes and simulations. Again, this demonstrates the importance of the DEM and DSM data in achieving acceptable accuracies with CP data, but that with QP data high accuracies are possible with just multi-angle/temporal SAR data.
As was observed with single SAR images, user’s and producer’s accuracies again differed less between QP and CP models when the DEM and DSM data were included (i.e., up to 15% and in a majority of cases < 4% difference between models based multi-angle/temporal SAR, DEM and DSM data, compared to up to 36% and in a majority of cases > 7% difference between models based on just SAR data). This difference is demonstrated in
Figure 8, which shows the indepednent overall accuracies of wetlands for models based on QP or the first simulation of high resolution CP imagery, classified both with and without the DEM and DSM data.
Differences in the user’s and producer’s accuracies between models based on high or medium resolution CP data were also lower (maximum of 11% compared to maximum of 21% difference when classified with and without the DEM and DSM, respectively). This again shows that these data compensated for some of the difference in information content between the QP and CP data, and between the NESZ value of the high compared to medium resolution CP data. It is worth mentioning that, between simulations of high and medium resolution data, user’s and producer’s accuracies differed by a maximum of 10%, and, in most cases, differences were less than 3%, thus only results for the first are provided in
Figure 7 and
Figure 8, while all are referenced in the subsequent section (
Table A3).
The addition of the DEM and DSM was critical for improving accuracies for shallow water for models based on both summer images (i.e., user’s and producer’s accuracies increased from 23–96% to 88–92%). Further, while lower accuracies were often observed for shallow water with multi-angle/temporal high resolution CP data, all models accurately classified shallow water following the addition of the DEM and DSM data. Thus, this information may prove critical in cases where only high resolution data are available.
Addition of the DEM and DSM data also improved accuracies for marsh. With QP data, the DEM and DSM data were necessary for achieving acceptable accuracies for models based on images acquired just in the spring, and the FQ5W spring and summer images. With CP data, accuracies for marsh were acceptable (≥88%) for all models and all simulations with the DEM and DSM data, whereas with just SAR data, only models based on all four high or all four medium resolution images achieved acceptable accuracies for all simulations.
For swamp accuracies were high for all models based on multi-angle/temporal SAR, DEM and DSM data, except those based on just the two summer images. Notably, user’s and producer’s accuracies did increase (by 8% and 10%, respectively) following the inclusion of the DEM and DSM information, but remained relatively low (74% and 60%, respectively) due to confusion with forest.
Water, again, was accurately classified by all models, with user’s and producer’s accuracies ≥96%. Accuracies for forest, on the other hand, improved for a number of models, though remained low for those based on both FQ5W images, and both SAR images acquired in the summer. Agriculture/non-forest was accurately classified by all models, including those based on both summer images, for which accuracies were low when based just on SAR data.