1. Introduction
Crop type separation is a crucial requirement for the planning [
1], short-term monitoring [
2], management [
3], high-throughput phenotyping [
4,
5,
6], and climate change modeling [
7] of agricultural areas. Many of these tasks need up-to-date information, in particular before the end of the growing season. These tasks require spatially explicit land cover maps. Nevertheless, this kind of information is usually reported by farmers only after the season and mostly at administrative units [
8]. Even when the reporting for subsidies takes place in a geographic information system, as is the case in Switzerland, the data are only entered after the season or even in the following year [
9]. Therefore, a more up-to-date assessment of crop status needs a different data collection source.
Remote sensing has proven its potential to deliver such information even before the end of season [
10]. Land cover crop maps are often based on satellite data because these platforms provide data sets multiple times per month and are suitable for both single-date or multitemporal classification tasks [
11,
12]. Moreover, several studies have demonstrated the ability to carry out early stage crop mapping [
1,
2,
13,
14]. However, due to fixed orbits and thus fixed observation dates, clouds can limit the amount of usable data. Furthermore, freely available data from Landsat or Sentinel satellites have coarse spatial resolution and therefore capture a high amount of mixed pixel information, especially in small structured farmlands. Hence, data sets of higher spatial resolution are necessary to acquire spectrally pure pixels that are needed in algorithm training [
10,
15,
16]. To date, data sets gained with spaceborne sensors at an appropriate spatial resolution are still expensive and rare.
Unmanned aerial vehicles (UAVs) deliver spatially very high resolution (VHR) data sets and a 3D point-cloud with a spatial resolution of only a few centimeters [
17]. UAV data creates new opportunities in agriculture [
18,
19,
20], e.g., assessment of plant health status [
21], water stress [
22], management techniques [
23], erosion of soils [
24], and detection of individual plants [
25], among many others. The advantages of VHR data sets compared to common satellite data sets have also been analyzed, e.g., for vegetation indices (VI) [
26,
27]. Further, UAVs have the ability to acquire data in a very flexible manner, e.g., by considering changing weather conditions, and they are cheap to operate compared to other carrier systems [
17]. However, less sophisticated sensors are usually mounted on UAVs, containing broader spectral bands that are difficult to spectrally and radiometrically calibrate [
28]. Consumer-grade cameras provide red, green and blue (RGB) wavelength bands, but they can be modified in a way to be able to acquire a near-infrared (NIR) band as well [
29]. Therefore, a NIR-RGB mosaic can be built using combined data from these two camera types.
In Switzerland, farmland is often structured in small plots [
30], and therefore VHR data sets are required to accurately discriminate different crops. To acquire VHR data sets for multiple dates, consumer-grade cameras with RGB and NIR-GB bands were mounted on a UAV. Uncalibrated data sets were used in this study to investigate the potential of a straightforward, user-friendly, and low-cost data acquisition and processing methodology for crop separation over a growing season. Due to the sensors’ limited spectral properties, the incorporation of additional information content through textural features is a promising option to improve the accuracy of crop separation [
31,
32].
Using approaches, in which all available remotely sensed data sets are jointly processed for crop classification purposes, the accuracy of crop separation has been found to increase [
1,
11,
12,
14]. In very fragmented agricultural systems, with very small fields, inhomogeneous cropping cycles and stringent crop rotation schemes, such as on the Swiss Plateau, sawing and harvesting dates of crops are subject to large temporal variations and arbitrarily changing patterns of bare vs. vegetated land cover. These properties are resulting in a decrease of the overall accuracy of crop separation by more than 20% (data not shown) on the Swiss Plateau when using a stacked approach.
Since crops show a changing spectral response due to the evolution of their phenological stage over the growing season, we hypothesize that crops can be separated with varying accuracy at different observation dates over the season. Hence, the aims of this study are to (i) determine an optimal temporal window during the growing season to separate all present crop types from each other, including textural features of an uncalibrated NIR-RGB data set, (ii) evaluate the potential of an additional NIR band compared to a standard RGB configuration, and (iii) assess the potential to discriminate a single crop type from all others on different observation dates.
3. Results
The accuracy results (i.e., AA and UA) can be found in
Table 3 for the two band settings (i.e., NIR-RGB and RGB, including spatial features of the two SE sizes). The OA, Kappa, and AR metrics are reported in
Table A1 in the
Appendix A, as well as the p-values of the significance tests (Wilcoxon signed rank test) between the observation dates for each of the two band settings (
Table A2 and
Table A3), and between the NIR-RGB and RGB settings (
Table A4).
3.1. Temporal Windows during the Growing Season
Based on the AA metric, three temporal windows during the growing season were identified, i.e., an early temporal window from 515 to 1232 AGDD, a mid-season temporal window from 1362 to 2016 AGDD, and a late temporal window from 2626 to 3238 AGDD (
Figure 2). The mid-season window shows the highest crop separation potential with an average AA value of 79.9% over the four observation dates for the NIR-RGB setting and an average AA value of 76.9% for the RGB setting. At 1556 AGDD, the maximum AA values of 82.0% and 78.3% are reached for the NIR-RGB setting and for the RGB setting, respectively (
Table 3).
The five observations in the early temporal window until 1232 AGDD contain slightly lower AA values. The average AA values in this time frame are 7.7% lower for the NIR-RGB setting compared to the mid-season temporal window and, correspondingly, 14.2% lower for the RGB setting. The lowest AA values are found in the late temporal window at 62.7% and 59.8% for the NIR-RGB and RGB settings, respectively.
For the NIR-RGB setting, the AA values within a given temporal window are fairly stable but vary significantly between the three temporal windows (
Table A2). In contrast, the AA values for the RGB setting are significantly less stable within both the early and late temporal windows. The AA value for the observation at 515 AGDD is not significantly different from the AA values of the mid-season, and the AA value for 1232 AGDD is not significantly different from the AA value at 3238 AGDD (
Table A3).
In the early temporal window, the accuracy of the crop discrimination slightly decreases over time by 4.4% for the NIR-RGB setting (
Table 3). The highest performance is achieved at 515 AGDD with an AA of 74.0%. The lowest performance is achieved at 989 AGDD with an AA of 69.6%. In case of the RGB setting, the AA values decrease by over 10% between the highest AA of 76.9% at 515 AGDD and the lowest AA of 66.3% at 1232 AGDD.
Regarding the NIR-RGB setting within the early temporal window, only the AA value at 848 AGDD is significantly different from the one at 1232 AGDD (
Table A2). For the RGB setting, the AA value for the observation at 515 AGDD is significantly higher than all other AA values in the early temporal window, whereas the AA value of the observation at 1232 AGDD is significantly lower than the AA values at 515, 635, and 848 AGDD (
Table A3).
In the mid-season temporal window, the highest AA values are achieved at 1556 AGDD with 82.0% in case of the NIR-RGB setting and 78.3% in case of the RGB setting (
Table 3). The lowest AA values are at 1723 AGDD with 78.4% in case of the NIR-RGB setting and at 2016 AGDD with 74.8% in case of the RGB setting. There are no significant differences between the AA values, except for the highest AA at 1556 AGDD that is significantly higher than the AA values at 1723 and 2016 AGDD for the NIR-RGB setting (
Table A2). For the RGB setting, the AA value at 2016 AGDD is significantly lower than the AA values at 1556 and 1723 AGDD (
Table A3). All other AA values in this temporal window are not significantly different from each other.
In the late temporal window, the AA values for both band settings are lower than the AA values in the other temporal windows (
Table 3), and there is no significant difference between the AA value at 2626 and 3238 AGDD in the case of the NIR-RGB setting (
Table A2). As for the RGB setting, the AA value at 3238 AGDD is significantly higher (by as much as 4.4%) than the AA value at 2626 AGDD (
Table A3).
3.2. Band Settings
Considering the two investigated band settings, only minor differences in the temporal evolution of the respective AA values are found over the growing season. In the early temporal window, a minimum AA value of 69.9% is reached at 989 AGDD for the NIR-RGB setting, and a minimum AA value of 66.3% at 1232 AGDD in case of the RGB setting. For both settings, the AA values increase significantly between 1232 and 1362 AGDD when they reach the mid-season temporal window. They then fall to low values of 2626 AGDD. Overall, the additional NIR band leads to significantly better results for crop separation in terms of AA values for all observation dates at a significance level of
p < 0.05, except for the observations at 515 and 989 AGDD (
Table A4).
3.3. Discrimination of Individual Crop Types
The temporal assessment of the ability to separate a single crop from the others is very similar for both band settings (
Figure 3). Rapeseed can be distinguished best in early season with a UA of over 90% until 635 AGDD. Afterwards, the UA remains above 70% until 1556 AGDD, when it decreases to a minimum of less than 20% at 2626 AGDD. On the contrary, maize can easily be discriminated after 1556 AGDD, with the UA increasing from under 75% to over 90%.
The discrimination of sugar beet is most effective at 515 AGDD with a UA of more than 85%. At 635 AGDD, its discrimination is most difficult with a UA of less than 55%. From 848 AGDD onwards, the UA lies between 60% and 76% except for in the case of the RGB setting, when the UA is 50.5% at 1232 AGDD.
Cereals can be distinguished with a UA between 78% and 90% from the beginning of the analyzed AGDD until 1723 AGDD. At 2016 AGDD, the UA is highest with 94.9% for the NIR-RGB setting and 91% for the RGB setting. At 2626 AGDD, the UA is around 51%, and for the last observation date it drops to 7%.
The UA of grassland is under 60% at 515 AGDD, but increases to a range of 80% to 94% from 635 AGDD onwards. On the final observation date, the UA is highest at 98%.
Bare soil can best be discriminated at 1362 and 2016 AGDD with UA values between 65% and 70%. Earlier, the UA is around 55%; it is only higher at 848 AGDD, reaching 65% for the NIR-RGB setting and 61% for the RGB setting. The UA is under 50% for the last two observation dates.
4. Discussion
4.1. Temporal Windows During the Growing Season
The phenological stage of crops is essential to discriminate them from each other in remote sensing data analysis. When considering all of the crops in the study area, the accuracy of crop separation is highest in the mid-season temporal window between 1362 and 2016 AGDD but accuracy reaches a maximum at 1556 AGDD. This is in line with other studies that report best accuracies for crop separation with data sets acquired in July [
2,
40,
41]. At this time of the growing season, the winter crops are in a senescence stage and the summer crops in their most productive stage.
In the early temporal window, at the beginning of the growing season, maize and sugar beet are gradually sown in. Since their seedlings are difficult to distinguish [
40] and can thus be mixed up with pixels from bare soil fields, the AA values are lower than in the subsequent mid-season temporal window. The slight decrease in AA values in the early temporal window is not of statistical significance.
Apart from a few exceptions, crop separation accuracies within a temporal window are not significantly different from the other observations in the same temporal window. This holds true for the two tested band settings and their textural features of the UAV data sets. Consequently, any observation of crops within a given temporal window will yield a similar, not significantly different AA.
4.2. Band Settings
An additional NIR band leads to significantly higher crop separation accuracies for most of the observation dates, as reported in several other studies [
42,
43,
44]. Only at 515 AGDD does the AA value of the RGB setting become higher than in the case of the NIR-RGB setting. Since this difference is not significant, crop separation accuracy with an additional NIR band is generally higher, or at least equal to an RGB setting.
In land use/land cover classifications that are based on textural features, the applied SE sizes play a crucial role [
32]. Differing spectral signatures resulting from the interplay of plant material and soil background largely define textural information in agricultural fields. Consequently, when the crop canopy is closed, the spectral, rather than the textural, properties gain in importance [
45] and therefore, an additional NIR band can lead to an improvement in crop separation [
43].
4.3. Discrimination of Individual Crop Types
With regard to the separation of individual crop species, their own respective phenological stages are crucial. Rapeseed, for example, produces yellow flowers up to 635 AGDD and can therefore be distinguished from all other crops with high precision, as it is the only investigated crop with this specific color. The discrimination of cereals in general, being the other winter crops in the study area, is best straight after harvesting at 2016 AGDD when the signal is dominated by the crop residuals and bare soil, and summer crops (i.e., sugar beet and maize) are in their most productive stage.
The differences between individual summer crops get more pronounced with phenological development; therefore, the UA values of maize and sugar beet increase with generally increasing AGDDs. As reported by [
46], the UA of maize increases by over 50% between the first discrimination, when maize fields are freshly sown in at 515 AGDD, and before harvesting at 2626 AGDD, when all other fields are either bare soil or covered by green grassland or sugar beet.
Sugar beet is the first sown summer crop in the study area and can be accurately separated at 515 AGDD, as the spectral signal is dominated by bare soil and therefore is distinguishable from the other present crops (i.e., winter crops or grassland). At the following observation date (i.e., 635 AGDD), differentiation between maize, sugar beet and bare soil fields becomes more difficult, since the spectral signal of all three classes at this phenological stage is dominated by the signal of bare soil and indistinguishable seedlings [
40].
Overall, the UA value of grassland is highest after 3238 AGDD, since late in the season it is the only crop well distinguishable from bare soil, maize (appearing brown in its final phenological stages), and sugar beet. In addition, grassland covers most of the area in late season so, consequently, the OA is also high (
Table A1). Nevertheless, the AA metric, which gives equal weight to the UA values of all present crop types, is low because the other crops can no longer be distinguished accurately from each other.
Finally, bare soil can most accurately be separated from crops when their plant materials cover a large part of the area and therefore dominate the measured signal. It is particularly difficult to distinguish fields under preparation (e.g., plowed) from fields containing small seedlings. The discrimination of cereals shortly after harvesting is successful since the textural appearance of crop residuals and bare soil is different. Therefore, discrimination of bare soil is most successful at 1362 and 2016 AGGD.
4.4. Limitations and Outlook
Crop types present in an agricultural area influence the resulting separation accuracy. The more diverse they are in terms of morphology and physiology, the more accurately they can be discriminated. Consequently, it is more challenging to separate between different winter crops or between different summer crops than to differentiate winter crops from summer crops. In particular, further separation of crops that have been combined into one class in this study (e.g., cereals like winter wheat, spelt and winter barley) remains to be investigated. Moreover, the optimal temporal windows analyzed here may vary depending on the specific crop types present in an agricultural area, and the behavior between the windows must be further examined.
As shown in a range of other studies, multitemporal data sets improve the accuracies of crop classification tasks [
41,
47]. Therefore, it would be interesting to investigate the potential of combining data from different acquisition dates to find the most promising temporal combinations for crop separation. Due to the fact that observations with optical sensors are subject to limitations (e.g., cloudy conditions), combined analysis of data acquired in the early and mid-season temporal windows could be promising.
5. Conclusions
This paper presents a methodology to separate agricultural crops based on data sets acquired with uncalibrated consumer-grade cameras mounted on a UAV over the course of 11 observation dates between 5 May 2015 (515 AGDD) and 29 September 2015 (3238 AGDD). The obtained four bands data sets, consisting of a NIR, red, green, and blue band, were extended with textural features to differentiate cereals, grassland, maize, rapeseed and sugar beet, and, if present, bare soil, based on an RF approach.
Three temporal windows across the growing season were identified where the accuracies of crop separation between the observation dates were not significantly different. The first temporal window ranged from 515 AGDD (May 5) to 1232 AGDD (June 17), with an AA for separation between 70% and 75%. A mid-season temporal window between 1362 AGDD (June 25) and 2016 AGDD (July 22) proved to be the most optimal for crop separation with AA values around 80%. Observations after 2626 AGDD (August 21) fell into the third group with an AA of under 65%.
An additional NIR band leads to significantly higher separation results compared to a pure RGB setting, thanks to an extended capability to spectrally discriminate the plant materials of the various crops. The accuracy of the separation of single crops varies over the course of the observed period. Winter crops can be discriminated from summer crops more accurately in the early season, whereas the accuracy of single summer crop separation from other summer crops is highest during their most productive phenological stage in the mid-season temporal window.
Overall, this paper concludes that crop separation based on uncalibrated NIR-RGB data sets in a highly fragmented and small structured agricultural area like the Swiss Plateau is most accurate between 1362 AGDD and 2016 AGDD over the investigated period.