3.1. Error Characteristics of IMERG V6 and V7
3.1.1. Characterization of Precipitation Amount Errors
Figure 2 presents the spatial distribution of daily precipitation amounts for meteorological stations, IMERG_V06, and IMERG_V07 across different seasons. From the figure, it can be observed that the spatial-temporal distribution of precipitations in IMERG_V06 and IMERG_V07 closely resemble the ground-based observations. On a seasonal scale, both IMERG_V06 and IMERG_V07 products effectively capture the characteristics of abundant summer precipitation, reduced winter precipitation, and moderate precipitation in spring and autumn. Additionally, they accurately reflect the seasonal pattern of higher spring precipitation compared to autumn precipitation in humid regions. In terms of spatial distribution, both IMERG_V06 and IMERG_V07 successfully capture the gradient feature of higher precipitation in the southeastern part and lower precipitation in the northwestern part of mainland China. However, it should be noted that there is a noticeable overestimation of precipitation in coastal areas, particularly in the southeastern region (southeastern part of mainland China), in the IMERG products (
Figure 2). This overestimation issue is observed in all four seasons, with the most significant impact observed during the summer and winter.
Figure 3 illustrates the spatial distribution of different accuracy evaluation metrics for IMERG_V06 and IMERG_V07 at an hourly scale. The Bias metric represents the difference between IMERG SPEs and the precipitation measured by meteorological stations. From
Figure 3a–e, it is evident that both SPEs exhibit significant Bias in the humid region, particularly in the southeastern coastal areas. In the humid region, IMERG_V07 shows a higher level of overestimation (Bias = 0.24 mm) compared to IMERG_V06 (Bias = 0.18 mm). In the semi-humid, semi-arid, and arid regions, both SPEs demonstrate similar performance with relatively smaller overestimation of precipitation. Despite some degree of overestimation errors, the overall consistency of IMERG_V07 with the station observations is noticeably better than that of IMERG_V06, as indicated by the higher CC of IMERG_V07 compared to IMERG_V06. Particularly in the northwestern region (northwestern part of mainland China), the CC of IMERG_V07 has improved from around 0.1 in IMERG_V06 to 0.3–0.4. In the humid region, the overall CC value of IMERG_V07 (CC = 0.41) is higher than that of the arid region (CC = 0.31).
In terms of RB, both satellite precipitation products tend to overestimate precipitation, which is consistent with the distribution of Bias errors. In humid, semi-humid, and semi-arid regions, the RB of both SPEs falls within the range of 0–30%. However, in arid regions, the RB for both versions of IMERG exceeds 80%, with IMERG_V07 (83.5%) having a lower RB value than IMERG_V06 (98.6%). This change may be attributed to the updated algorithm in IMERG_V07, which improved the retrieval accuracy for light precipitation [
7]. As for RMSE, both SPEs exhibit a decreasing trend from the southeast to the northwest, consistent with the distribution of precipitation. This indicates that as the precipitation amount decreases, the RMSE values gradually decrease. RMSE represents the average deviation between meteorological stations and SPEs. Comparing the two precipitation products, it is observed that IMERG_V07 has smaller RMSE values than IMERG_V06 in all four climate regions, particularly in the humid region.
Overall, IMERG_V07 demonstrates better consistency with ground station data, further validating the positive impact of algorithm improvements in IMERG_V07 by the algorithm developers.
To visually understand the precipitation retrieval accuracy and errors of the two versions of IMERG SPEs in different seasons, a Taylor diagram (
Figure 4) is used to display the overall performance at an hourly scale based on three indicators, including correlation coefficient, standardized standard deviation, and root mean square error. As shown in
Figure 4, IMERG_V07 consistently exhibits higher CC values than IMERG_V06 in all seasons, especially in winter. The CC value of IMERG_V06 is only around 0.1 in winter, while the CC value of IMERG_V07 is approximately 0.3–0.4. This is consistent with the spatial distribution of IMERG_V07 in
Figure 4, which shows higher correlation coefficients. It indicates that IMERG_V07 has significantly improved data consistency. Although IMERG_V07 has slightly lower RMSE values than IMERG_V06 in summer and autumn, its performance is inferior to IMERG_V06 in winter and spring. IMERG_V06 has a smaller standard deviation (STD) than IMERG_V07 in spring and winter, while both precipitation products perform similarly in summer. In autumn, IMERG_V07 shows better performance. Overall, IMERG_V06 and IMERG_V07 exhibit their respective advantages in different seasons. IMERG_V07 has shown significant improvement in data consistency compared to IMERG_V06. It performs better than IMERG_V06 in summer and autumn but falls short in winter and spring.
3.1.2. Comparative Analysis of Precipitation Frequency Error Characteristics
The ability to capture precipitation events is an important aspect of evaluating the precipitation retrieval accuracy of SPEs. In this section, the detection capability of IMERG_V06 and IMERG_V07 for precipitation events at the hourly scale is systematically evaluated using three metrics: POD, FAR, and MIS.
Figure 5 shows the spatial distribution of the classification statistics indicators for IMERG_V06 and IMERG_V07. There are significant regional differences in the ability to capture precipitation events between IMERG_V06 and IMERG_V07, similar to the characteristics of precipitation error. Both versions exhibit a decreasing accuracy in capturing precipitation events from the southeastern coastal areas to the northwestern inland regions. Benefiting from changes in the data sources of the sensors and algorithm updates, IMERG_V07 shows significant improvement in the detection capability of precipitation events compared to IMERG_V06.
In terms of POD, both IMERG_V06 and IMERG_V07 demonstrate strong precipitation event detection capabilities in the humid region and the semi-humid region, with POD values exceeding 50%. Moreover, the POD in the semi-humid region is higher than that in the humid region. However, in the semi-arid and arid regions, the POD of IMERG_V06 remains around 34–38%, with a few stations below 10%. On the other hand, IMERG_V07 achieves a POD of 43.6–44.5% in these regions.
In terms of MIS,
Figure 5b–e indicates that IMERG_V07 shows significant improvement in missed precipitation events, particularly in the northwestern region. For several stations in the northwestern region, IMERG_V06 had a high MIS of up to 90%, while IMERG_V07 reduced the MIS to around 60%. This improvement may be attributed to “IMERG_V07 increased PMW estimates over frozen surfaces [
7]”. For the entire mainland of China, the regional average MIS of IMERG_V06 is 49.8%, while that of IMERG_V07 is 45.8%, representing an overall decrease of 4%. The MIS values of both SPEs are lowest in the semi-humid region, followed by the humid region, and highest in the arid region. IMERG_V06’s performance in FAR is less satisfactory, with values increasing from the humid region to the arid region. Although the FAR in the humid region is relatively low, individual stations still reach up to 80%. In the northern regions of China, the FAR of IMERG_V06 reaches 80–100%. In contrast, IMERG_V07 maintains an hourly scale FAR between 50% and 60% in regions other than the arid region, with a uniform distribution and stable performance. In the arid region, although IMERG_V07 still has a relatively high FAR (70.9%), it is greatly reduced compared to IMERG_V06.
In summary, IMERG_V07 demonstrates an overall improvement in the ability to capture precipitation events compared to IMERG_V06, with a 4% increase in regional average accuracy. The improvement is particularly significant in the northwestern region, which may be attributed to the sensor updates and algorithm improvements in IMERG_V07.
3.1.3. Comparative Analysis of Error Components
Due to interdependencies among various metrics, relying solely on conventional indicators to evaluate the performance of SPEs has limitations. For instance, bias can be influenced by the offsetting of positive and negative errors, which can result in the presence of significant absolute errors even when the bias values are small. Ushio et al. [
28] emphasized the crucial importance of assessing error sources for improving SPE algorithms. To enable a more objective error characterization and identify error sources, Tian Yudong [
29] proposed an effective error decomposition method called the Error Component Procedure. It decomposes the Total bias into three error components: Hit bias, Missed bias, and False bias, facilitating further analysis of precipitation errors resulting from the detection of different precipitation events.
Figure 6 illustrates the spatial distribution of Total bias and the three error components for IMERG_V06 and IMERG_V07.
Firstly, from
Figure 6a–e, it can be observed that the values of Total bias are relatively small (ranging from 0 to 0.3 mm) and significantly smaller than the sum of the three error components. This indicates the presence of offsetting positive and negative errors among the error components. Therefore, relying solely on the conventional total precipitation bias to evaluate the performance of SPEs is not sufficiently accurate, and the analysis of precipitation amount errors should consider the characteristics of precipitation events [
30,
31]. Both IMERG_V06 and IMERG_V07 exhibit an overestimation of precipitation in mainland China, particularly in the humid regions.
The Hit bias can have positive or negative values. Based on
Figure 6b–f, it can be observed that the Hit bias of IMERG_V07 is positive in 50% of mainland China, ranging from 0 to 0.2 mm. On the other hand, for IMERG_V06, the portion with positive Hit bias accounts for only 31% and is mainly concentrated in humid regions. From the spatial distribution of Hit bias, it seems that IMERG_V06 performs better in terms of Hit bias.
In relation to the Missed bias, IMERG_V07 exhibits the benefits of algorithm enhancement. The Missed bias values in IMERG_V07, namely 0.69 mm, are slightly lower in comparison to IMERG_V06, which recorded 0.71 mm. This indicates that IMERG_V07 offers more precise estimations regarding missed precipitation, showcasing improved performance resulting from algorithm enhancements. This improvement can be attributed to the incorporation of frozen precipitation retrieval in the IMERG_V07 algorithm.
Among the three error components, False bias is the primary source of error, with values ranging from 0.32 to 1.95 mm. It gradually decreases from humid to arid regions. Overestimating precipitation has been consistently associated with IMERG SPEs since their release [
5,
32,
33,
34,
35]. The IMERG_V07 algorithms have taken several measures to correct overestimation by intercalibrating PMW datasets, which mainly include [
7], firstly, the mean GMI/TMI precipitation rate is not constant along the scan, so the GMI/TMI-constellation calibration was modified to use the full-swath GMI/TMI matched to the narrower swath of CORRA-G/T. Secondly, the GPROF-GMI, GPROF-TMI, and CORRA-G/T footprint sizes differ, so CORRA-G/T is averaged on 0.2° × 0.2° and 0.3° × 0.3° grid box templates to approximately match the GPROF-GMI and GPROF-TMI footprint sizes, respectively. Finally, the GPCP adjustment to CORRA in V06 was unrealistically raising winter precipitation over land and not capturing its longitudinal variability, so the GPCP adjustment is not applied over land in V07 (
https://gpm.nasa.gov/resources/documents/imerg-v07-release-notes, accessed on 21 February 2024). However, the effects of these measures are still not satisfactory, especially for correcting the False bias in the humid (1.04 mm), semi-humid (0.93 mm), and semi-arid regions (0.83 mm), compared to IMERG_V06 (0.99 mm, 0.82 mm, and 0.74 mm, respectively).
Overall, IMERG_V06 and IMERG_V07 exhibit similar spatial distributions of error components, with only slight differences. IMERG_V07 shows a noticeable improvement in Missed bias, but there is still room for improvement in Hit bias and False bias.
3.2. Dependency of Error on Precipitation Intensity
In the previous section, we observed differences in error metrics of IMERG SPEs across different climate regimes, which we speculate may be influenced by precipitation intensity. This indicates that there is variability in satellite identification of precipitation events with different precipitation intensities. Based on this, in this section, we assess the impact of different precipitation intensities on the identification of precipitation events by IMERG SPEs in four seasons, as shown in
Figure 7. We classify precipitation into four thresholds corresponding to light rain, moderate rain, heavy rain, and rainstorm [
36,
37]. It is important to note that the determination of precipitation intensity thresholds relies on ground-based precipitation observations. As a result, the calculation of FAR (false alarms of IMERG SPEs when a ground-based site observed no precipitation event) for different precipitation intensities is not feasible. Therefore, this section focuses solely on the analysis of POD and MIS.
In general, according to the results shown in
Figure 7, there is a strong correlation between the precipitation-capturing capability of IMERG SPEs and the precipitation intensity. The POD of IMERG SPEs is generally below 40%. Among them, IMERG_V06 and IMERG_V07 exhibit the highest POD in summer, followed by autumn, and the performance is least satisfactory in winter (
Figure 7a–d). As the precipitation intensity increases, the recognition ability of IMERG SPEs shows an increasing trend followed by a decreasing trend. The POD of IMERG SPEs is highest for moderate rain, with all values above 30%, while it is lowest for rainstorm events. Although IMERG SPEs can effectively capture precipitation events, there is still a significant misjudgment in capturing precipitation intensity. However, this may also be related to the limited number of samples of heavy precipitation events. It is worth noting that the POD of IMERG SPEs is highest for moderate rain, indicating its strong recognition capability for this category. Compared to IMERG_V06, IMERG_V07 shows a significant improvement in POD, especially for light rain and moderate rain events. By observing
Figure 7e–h, it can be seen that with the improvement in IMERG_V07’s ability to capture light precipitation events, its MIS also increases accordingly. The MIS of IMERG SPEs is lowest in summer, with all values below 35%. The MIS is highest in winter, with IMERG_V07 reaching a maximum of 66.2% and IMERG_V06 reaching a maximum of 63.9% (
Figure 7h), which is associated with snow cover on the ground during winter. As the precipitation intensity increases, the MIS of IMERG SPEs decreases, indicating a lower false-negative rate for strong precipitation events. Except for winter, IMERG_V07 performs best in terms of missed events in rainstorm events, slightly outperforming IMERG_V06. However, these minor improvements do not fully compensate for the missed precipitation events in light rain, moderate rain, and heavy rain categories of IMERG_V07.
Overall, compared to IMERG_V06, IMERG_V07 has made some improvements in capturing precipitation events. However, there is still room for improvement in accurately capturing rainstorm events, reducing false negatives for light precipitation, and improving the accuracy of capturing precipitation events in winter.
3.3. Topographical Dependency of Error
The spatial distribution analysis of accuracy metrics discussed earlier indicates that topographical factors significantly influence the accuracy of precipitation retrieval by IMERG SPEs. To further investigate the impact of terrain on precipitation retrieval accuracy, we have presented line graphs in
Figure 8. These graphs illustrate the influence of topographical factors on the capture accuracy of the precipitation amount and precipitation events in each season. Both IMERG_V06 and IMERG_V07 demonstrate distinct dependencies on elevation, as reflected in the RMSE, POD, MIS, and FAR metrics.
From
Figure 8b,d,f,h, it can be observed that as the elevation increases, both IMERG_V06 and IMERG_V07 exhibit a decreasing trend in RMSE values. The lowest RMSE values are observed in winter (0.05–0.40 mm/h), followed by autumn (0.32–0.80 mm/h), while the highest RMSE values are found in summer (0.73–1.80 mm/h). This pattern is consistent with the spatial distribution shown in
Figure 2 and is associated with the seasonal variation in precipitation. In spring, within the elevation range of 0–600 m, IMERG_V06 has lower RMSE values compared to IMERG_V07. However, beyond an elevation of 600 m, IMERG_V07 demonstrates higher accuracy in precipitation retrieval. IMERG_V07 performs less satisfactorily in summer, with consistently higher RMSE values compared to IMERG_V06. In autumn and winter, IMERG_V06 has lower RMSE values at lower elevations, while IMERG_V07 exhibits lower RMSE values at higher elevations.
Based on the results from the line graphs (
Figure 8a,c,e,g), it is also evident that the capture capability of precipitation events by IMERG SPEs decreases with increasing elevation. This can be attributed to the influence of complex terrain on the distribution and intensity of precipitation, thereby affecting the retrieval results of IMERG SPEs. Within the elevation range of 0–2400 m, both IMERG_V06 and IMERG_V07 show similar performance in capturing precipitation events. However, when the elevation exceeds 2400 m, the retrieval accuracy of IMERG_V07 is lower than that of IMERG_V06.
This indicates that the measures taken to improve the accuracy of precipitation retrieval in high-elevation areas for IMERG_V07 have improved the accuracy of precipitation amount retrieval. However, there are still limitations in capturing precipitation events in areas with elevations exceeding 2400 m.
Based on the observed elevation-dependent characteristics of the error components discussed earlier, we further evaluated the terrain dependency of IMERG_V06 and IMERG_V07 SPEs. In this section, we computed the corresponding error components (Total bias, Missed bias, Hit bias, and False bias) based on seasonal average precipitation and analyzed the dependency of IMERG SPEs’ error components on different elevations (
Figure 9).
According to the results in
Figure 9, we can observe that both types of SPEs exhibit clear elevation dependency in Total bias and error components, with the magnitude of errors varying with altitude. In comparison, the absolute value of Total bias is relatively small and significantly lower than the sum of the absolute values of the three error components. This is due to the phenomenon of positive and negative error components offsetting each other. This conclusion aligns with the spatial distribution of error components presented in
Figure 6. It demonstrates that relying solely on Total bias to evaluate the error characteristics of SPEs may result in an overestimation of their performance.
Based on the dependency of error components on elevation shown in
Figure 9a–h, we observe that the error components of IMERG_V06 and IMERG_V07 exhibit distinct seasonal characteristics at different elevations. In spring, both IMERG_V06 and IMERG_V07 have a Total bias of around 0–0.2 mm. Most of IMERG_V06’s Hit bias is negative, while IMERG_V07’s Hit bias changes from positive to negative at an elevation of 1800 m.
In summer, the absolute values of all error components are significantly higher compared to spring, which is attributed to higher precipitation levels, consistent with the precipitation distribution shown in
Figure 2b,f,j. Both precipitation products exhibit an overestimation of precipitation in summer. Specifically, the False bias and Missed bias of IMERG_V06 and IMERG_V07 exhibit roughly symmetric distributions (
Figure 9c,d). However, False bias remains the dominant error source for both IMERG_V06 and IMERG_V07 in summer, as after positive and negative offset, their Total bias is positive, and Hit bias is smaller than Total bias, indicating that False bias has the largest contribution to the error components.
In autumn, the error components decrease compared to summer, ranging from −0.98 to 0.85 mm. Both IMERG_V06 and IMERG_V07 have positive Total bias, indicating a prevalent overestimation of precipitation. The False bias decreases with increasing altitude, while Missed bias shows the opposite trend.
In winter, the absolute values of False bias and Missed bias of both IMERG_V06 and IMERG_V07 decrease with increasing elevation. Hit bias decreases with elevation in the range of 0–2700 m and then changes from positive to negative, with its absolute value increasing with elevation. Within the elevation range of 0–1200 m, both precipitation products have positive Total bias, which then changes to negative. Therefore, it can be inferred that when the elevation exceeds 1200 m, Missed bias becomes the main error source for both IMERG_V06 and IMERG_V07 in winter.
In conclusion, both types of SPEs exhibit elevation dependency in Total bias and error components. In spring, summer, and autumn, False bias is the main error source for IMERG_V06 and IMERG_V07, while in winter, when the elevation exceeds 1200 m, Missed bias becomes the primary error source for both IMERG_V06 and IMERG_V07.
3.4. Dependency of Errors on Climate Type
Next, we divided the entire mainland of China into four climate regions: humid, semi-humid, semi-arid, and arid regions (
Figure 1a), and studied the sources of errors in these regions. Overall, the error components of IMERG_V06 and IMERG_V07 decrease with increasing aridity, which is consistent with the objective reality.
By observing
Figure 10a,e,i,m, it can be found that in the semi-humid region, IMERG_V07 shows significant improvement in Total bias compared to IMERG_V06, decreasing from 0.175–0.19 mm to 0.025 mm. IMERG_V06 and IMERG_V07 perform almost the same in terms of Total bias in the arid region. However, in the humid and semi-arid regions, IMERG_V06 performs better than IMERG_V07, and IMERG_V07 exhibits higher data dispersion in the arid region. In all four climate regions, both IMERG_V06 and IMERG_V07 have a Total bias greater than 0, indicating an overestimation of precipitation, especially in the humid region, which is consistent with the analysis results in the previous figure (
Figure 6).
In terms of Hit bias (
Figure 10b,f,g,n), except for the arid region, the performance of IMERG_V06 and IMERG_V07 in Hit bias values is nearly the same. The main difference is that the Hit bias of IMERG_V07 is mostly positive, while the Hit bias of IMERG_V06 tends to be negative, especially in the semi-arid region. In the arid region, IMERG_V07 shows improvement, with enhanced numerical values and dispersion compared to IMERG_V06. The improvement in the accuracy of precipitation inversion in the arid region by IMERG_V07 may be attributed to the adoption of more advanced precipitation estimation algorithms and techniques. The skill of the GPROF V07 retrieval algorithm varies with surface types. The GPROF V07 algorithm used to retrieve precipitation from all PMW inputs for IMERG has made progress in handling difficult surface types that tend to yield lesser quality results, including frozen surfaces, orographic areas, and coastal zones. By introducing new precipitation estimation methods, IMERG_V07 can more accurately capture precipitation in arid regions, thereby improving consistency with ground station data.
Figure 10c,g,k,o shows the situation of Missed bias for IMERG_V06 and IMERG_V07. By comparison, we found that the overall performance of Missed bias in the four climate regions is better for IMERG_V07 than for IMERG_V06. This may be associated with algorithm improvements in IMERG_V07, including the inclusion of PMW estimates over frozen surfaces [
7]. Specifically, the Missed bias of IMERG_V06 ranges from 0 to −2.20 mm with high data dispersion, while the absolute values of Missed bias in IMERG_V07 are relatively small, mainly concentrated in the range of 0.5–1.0 mm.
Based on
Figure 10d,h,l,p, it can be seen that the main source of error for both IMERG_V06 and IMERG_V07 in the four climate regions comes from False bias. In the humid region, the False bias of both SPEs is mainly concentrated in the range of 0.8–1.2 mm, but the data for IMERG_V06 is more dispersed, and the data dispersion of IMERG_V06 is higher in the semi-humid region (0.54–1.74 mm). In the semi-arid region, the False bias of IMERG_V06 is better than that of IMERG_V07, mostly distributed in the range of 0.6–0.8 mm, while the False bias of IMERG_V07 is mainly concentrated in the range of 0.8–1.0 mm.
Only by using Total bias, can it be inferred that the performance of IMERG_V06 is better than IMERG_V07 in the humid, semi-arid, and arid regions. However, by comparing the performance of each error component in the four climate regions, it is found that the actual situation is not so simple. This fully demonstrates the necessity of decomposing error components.
In summary, the error components of IMERG_V07 in the four climate regions are relatively more concentrated, with smaller data dispersion, especially in terms of Missed bias. The algorithm improvements in IMERG_V07 are most evident in the arid region. False bias is the main source of errors for both SPEs in the four climate regions, and the secondary contribution comes from Hit bias.