4.1. Within-Method Variability in the Measurements
Previous studies have mainly focused on evaluating the accuracy in terms of systematic errors or differences between measurements methods [
15,
16,
17]. Variability in repeated measurements is equally important, because it directly relates to the required number of measurements: The larger the variability in the measurements, the larger the number of measurements is needed to determine mean spectra reliably. Based on our results on standard deviation (
Figure 2), it is possible to deduce the standard error of the mean for each method when the number of measurements is known. It can be seen that with any of the methods ten measurements is enough to obtain a standard error of mean less than 0.01 units. There were some differences between methods, discussed below, but generally it appears that all methods are suitable for precise spectral measurements, and using fast methods does not necessarily increase the variability in the measurements.
It is interesting that we observed no obvious differences in the standard deviations of measurements between SIS and DIS. DIS is notably faster in operation, yet the number of measurements can be maintained at about the same level. It should be noted that in order to operate DIS fast, the integration time had to be shortened from the optimum. This increased random noise at the low and high ends of the spectrum, but in most of the wavelengths the standard deviation did not increase (
Figure 3).
LC, representing contact probe types of measurements, showed the highest standard deviation among the methods. Due to the design of LC, the white reference had to be manually held in correct position during the measurement, which could have contributed to the results; the view and illumination angles in the white reference measurement easily varied between measurements. However, we observed similar differences between methods even when analyzing the raw sample measurement readings (data not shown). Assuming that the output power of the stabilized light source did not fluctuate, this indicates that the errors in white reference measurement are not the main reason for larger variability in measurements of LC compared to the other methods.
Intuitively, the standard deviation should depend on the size of the illuminated spot on the leaf, or the size of the sample port, because if larger area is sampled per measurement, there should be less variability between measurements due to spatial averaging. Contrary to our expectations, standard deviation of the measurements did not seem to depend clearly on the size of the illuminated spot on the leaf. LC illuminated the smallest area and had also the highest standard deviation. However, there was no clear difference between SIS and DIS, although SIS illuminates approximately a four times larger area. More research is needed on the variability of leaf optical properties at different scales: within leaves, between leaves of the same individual, and between plant individuals grown in similar or different conditions to help in planning optimal sampling designs for collection of spectral libraries of plants. To this end, the fast methods tested here are very useful, because they allow a large number of measurements to be performed in a relatively short time.
It should be noted that the standard deviation of measurements in DIS increased when measuring the maple leaves. Due to the shape and large size of the maple leaf and the design of DIS, the measurements with DIS had to be performed closer to the leaf tip than with the other methods. The leaf tip has more veins and possibly other irregularities in the leaf properties that may introduce extra variation in the measurements.
4.2. Systematic Differences between Methods
Based on our results, it is clear that there are systematic differences between the measurement methods. Direct implication of this is that particular care must be taken when comparing or combining measurements acquired with different methods. Systematic errors in leaf-level spectra can lead to respective over- or underestimation of canopy level reflectance when the leaf spectra are used as input in radiative transfer models. This problem could be somewhat mitigated by applying empirical or physically-based corrections to the measurements, or alternatively, adjusting the models to handle different types of input data. However, the differences between measurement methods depended on target (e.g., plant species). This indicates that no correction can fully remove the differences, unless information on target is included in the correction procedure. This could be very difficult in practice, since it would require measurement of target-specific empirical correction factors. The effects of target and wavelength are probably related to differences in spectral bi-directional reflectance distribution functions (BRDFs) of different targets. In the case of LC, the influence of BRDF is obvious since the measurement is performed in hotspot geometry, but also the integrating spheres may not integrate over the entire hemisphere, because there are walls around the sample port. To further complicate the matter, the optical properties of green leaves depend on incident irradiation and the duration of exposure to measurement light (e.g., [
9,
10]) which vary between methods. These changes are difficult to quantify and depend on the seasonally varying biochemical composition of the leaf as well as leaf’s growing environment. Nevertheless, general differences between methods are evident and easily summarized: Small difference in reflectance but large difference in transmittance between SIS and DIS, and overestimation of reflectance by LC.
Although the differences in reflectance between SIS and DIS were statistically significant for most of the wavelengths, they can be considered relatively small. The maximum absolute difference did not exceed 0.027. In relative terms, large differences (up to 22%) were observed for maple leaves in the visible wavelengths. However, the differences were not statistically significant for leaves in the visible region (
Appendix B,
Table B2). The differences of our measurements against the calibrated values of the reflectance standards were larger than differences between SIS and DIS, and the differences between SIS and DIS were also comparable to those observed by Lukeš et al. [
16] between different integrating spheres. Thus, the differences in reflectance between SIS and DIS approached general level of accuracy observed in laboratory integrating spheres. However, it should be noted that the differences tended to be largest for leaves and smallest for gray-level standards (
Table 4), indicating that there are BRDF effects.
The difference in transmittance measured by SIS and DIS was considerable. The average relative difference was 14%, with DIS producing smaller values of transmittance. The most likely explanation for the difference is the attenuation of the light as it passes through the sample in the sample port of DIS. The sample port is surrounded by black walls (approximately 1 mm thick) that may absorb some of the diffusely scattered radiation from the sample or diffuse radiation entering the sample from either of the spheres. There may also be lateral losses that depend on the leaf thickness [
15]. However, it is also possible that SIS overestimated transmittance. We calculated the ratio of the white reference signal of transmittance (empty port) measurement to the white reference signal of reflectance measurement, and noticed that white reference for transmittance was 92–96% of the white reference for reflectance. In theory (in an ideal sphere with identical sample illumination), they should be equal. The detected attenuation of white reference can indicate an overestimation of transmittance. Several explanations can be given for the differences in the white reference signals: (1) The light source was not perfectly collimated with some inevitable stray light. (2) The RTS-3ZC is designed to have a different distance between the sample and light source for reflectance and transmittance measurements causing differences in illumination irradiance. (3) There is potentially multiple scattering of radiation between the sample and the sphere lamp in RTS-3ZC [
19], which may also result in transmittance overestimation.
We did not evaluate the measured transmittance values in absolute terms, because we did not have a calibrated transmittance standard available. An indication of the correctness of absorption values can be obtained by comparing to earlier measurements of absorption. Minimum NIR absorption was 0.07 for both birch and maple in DIS, and 0.01 for birch and 0.02 for maple in SIS. These values correspond to those measured earlier [
4,
15]. As discussed by Mõttus et al. [
15], the minimum NIR absorption in DIS is somewhat larger than values reported for the same species or genera in previous studies and it is thus evident that DIS overestimates absorption. However, the values with SIS are close to zero, and some studies have even reported negative absorption [
16], which indicates that slight overestimation of transmittance (and thus absorption) by SIS is also possible. We observed slightly negative minimum NIR absorption (−0.002) for SIS in case of white paper.
LC systematically overestimated leaf reflectance. It is known that leaf spectral BRDFs differ strongly from Lambertian [
30] and that the measured reflectance factors therefore depend on view and illumination geometries. The reflectance factor of Spectralon is also dependent on view and illumination geometries [
31,
32]. Both Spectralon and leaves have increased reflectance in the hotspot direction [
33,
34,
35], in which LC measures. It may be that compared to leaves, Spectralon reflects more diffusely and has a smaller peak in reflectance close to hotspot. This would explain the observed overestimation of leaf reflectance. Reflectance contributions originating from areas outside the leaf and multiple scattering from background (i.e., transmitted through the sample, reflected by background and transmitted again) explain only a small portion of the difference (see
Section 2.2.3). Previous comparison between ASD contact probe and integrating spheres observed slightly smaller mean absolute differences than we did, indicating that factors such as the measurement geometry and design of the probe may influence the comparison [
18]. Because leaf spectral BRDFs may vary between species [
30], a contact probe is probably not the best choice for comparing spectral differences between species. However, it is well suitable for monitoring relative changes in reflectance and thus physiological status of plants in field conditions, which is where it is usually applied [
8,
9,
10].
To illustrate the practical relevance of the differences, we computed spectral indices from the measured spectra (
Figure 6). Indices particularly sensitive to between-method differences were those that are designed to track small changes in spectra (such as NDWI). Indices relying on high contrast between two wavelengths were not as sensitive (such as NDVI). In addition to showing between-method differences, we demonstrated that the spectral indices were highly dependent on whether they were calculated from leaf reflectance or transmittance spectra, or from adaxial or abaxial side of the leaf. All four quantities contribute to canopy spectral reflectance, and ignoring any of them may exert a large influence when the leaf optical properties are upscaled to canopy level. In fact, differences between methods, although significant, tended to be not as large as the differences between reflectance and transmittance, or the differences between leaf sides.
4.3. Practical Applicability of the Methods
Based on our results, considerable savings in time can be achieved by applying the fast methods (DIS and LC), compared to using SIS in which the attachment of the leaf sample is slow and the lamp has to be detached and reattached between reflectance and transmittance measurements. The design of the sphere and integration time (dependent on power of the light source and transmissivity of the optical fibers) obviously affect the measurement time. Thus, our results apply only to the specific instruments tested here. However, they give a good overview about the speed of measurements obtainable with different types of methods. It should also be noted that the measurement time depends somewhat on the characteristics of the target. For example, measuring birch leaves was slower than measuring maple leaves with SIS, because attaching a birch leaf to SIS was more difficult than attaching a maple leaf due to the small size of the leaf in comparison to the sample holder.
There are also numerous other criteria that influence which of the methods is best suitable for a specific application. Firstly, ease of operation affects not only time but also how error-prone the measurements are. Based on our experience, attaching a leaf to the sample port of SIS requires particular care. It was difficult, particularly with small leaves, to avoid undulations in the leaf surface. The surface should be as smooth as possible to avoid errors in the measurement. The same problem was also recognized by Lukeš et al. [
16] who noted that particularly the transmittance measurement is prone to errors. This may, however, be a specific property of RTS-3ZC, not all single integrating spheres. When using DIS the leaf is easy to attach and the measurements less error-prone. The only difficulty was that the instrument operator cannot see the illuminated spot on the leaf, and therefore it is sometimes difficult to avoid measuring leaf veins or other irregularities. This may cause additional variation in the measurements. LC is the least error-prone what comes to attachment of the leaf. It is easy to see which area is measured, and the leaf is always firmly attached in the clip. An obvious source of error, however, comes from the difficulty of measuring the white reference, as described in
Section 4.1. This problem could probably be overcome by using a custom-made holder for the white reference.
With DIS and LC the measurements can only be performed from within a few centimeters from the edge of the leaf. Central areas of large leaves cannot be measured unless the leaf is cut, which in turn may induce physiological changes. SIS was more flexible to accommodate small and large leaves.
Ruggedness and portability are important if the methods are applied in field conditions. Both DIS and LC are mechanically durable with very few loose parts. In this sense they are clearly better than SIS when performing measurements in the field or in remote locations. In addition, they are easy to keep clean. In SIS, dust, litter, and other particles easily end up inside the sphere. What comes to portability, LC and DIS are small enough for one person to carry in the field, at least if they are connected to a spectrometer with an integrated light source as provided by the manufacturers. If operated together with an external light source as we did, the amount of equipment to carry is somewhat larger. SIS needs to be set up on a tripod or other stable surface, and it is also heavy to carry for long distances in e.g., forests.
An important thing to consider if measuring in field conditions is the movement of the fiber optic cables. In our measurements we did not excessively move the cables, but small movements were allowed with DIS and LC, when the leaf was attached to the measurement system. By taking repeated spectrometer readings of stable targets (empty DIS or white paper attached to LC) we verified that the additional variability introduced by not fixing the position of the cables was in the order of 1% or less (coefficient of variation) throughout all wavelengths. Thus, it did not considerably influence the results. However, moving the two meter long cable used with DIS for the whole of its length introduced even up to 10% variation in the measurement. However, the short bifurcated cable in LC was not as prone to errors.
4.4. Limitations and Future Directions
Our aim was to compare commercial off-the-shelf methods in a typical experimental setup in which the user needs to determine mean spectra of a given target accurately (e.g., leaves in the upper part of tree canopy in a certain site). The choice of our experimental setup had two consequences. First, the reported within-method variabilities (standard deviations) contain compound effects of the precision of the measurement system itself and sampling variance that occurs when measuring a spatially varying target. Further research should therefore be aimed at analyzing reasons for within-method variability in the measurements even in more detail, e.g., relations between sampling variance and illuminated spot size, variation in optical properties within and between leaves, and measurement precision of the methods when taking repeated measurements of the same spot on a same sample. Second, the illumination conditions (irradiance, duration of exposure) were not normalized between methods, which may have influenced the optical properties of plant leaves, particularly in certain wavelengths in the visible spectral region. However, we avoided analyzing spectral indices known to be particularly sensitive to illumination conditions [
36].
More extensive studies, containing e.g., larger variation of species, could elucidate species dependencies further, i.e., to what extent it is possible to remove between-method differences in spectra by applying general (species-independent) correction factors. Finally, as the accuracy requirements are highly application-dependent, targeted studies which focus on particular applications are needed to further demonstrate practical significance of the differences. These studies could also utilize radiative transfer models to demonstrate the effects of errors in leaf reflectance and transmittance measurements on modeled canopy reflectance.