1. Introduction
Atmospheric aerosol is a colloidal suspension of liquid or solid particles [
1]. It can affect the quality of our lives through direct and indirect process. Particulate matter (PM) containing sulfate, organic carbon and nitrate can affect light scattering and reduce the visibility [
2,
3]. Besides, aerosol particles also have a significant impact on earth’s hydrological cycle [
4]; terrestrial and marine eco-system [
5]; agriculture production [
6]; and climate change [
7]. Particulate matter (PM) from both natural and anthropogenic emission sources can cause adverse effects on public health [
8]. Long-term exposure to particulate matter with aerodynamic diameters less than 2.5 μm (PM
2.5) can cause lung and respiratory diseases and even premature death [
9]. PM
2.5 not only threatens people’s health but also causes the decrease of atmospheric visibility and the degradation of the city scenery [
10]. In recent years, with the rapid development of industrialization and urbanization, PM
2.5 has become one of the primary air pollutants in China, especially in most major cities, such as Beijing, Shanghai and Guangzhou, where the fastest economic growth has occurred [
11]. To understand the effects of PM
2.5 on the Earth’s environmental system and human health, routine monitoring of PM
2.5 is necessary.
Given the considerable advantages of remote sensing, especially the large coverage provided at the spatial scale and the stable continuity at the time scale, aerosol optical depth (AOD) retrieved from remotely sensed data has been widely considered to be a good method of atmospheric monitoring [
12]. In recent years, many researchers have revealed the potential of remote sensing to measure AOD and thereby estimate the ground PM
2.5 in China or even at the global scale [
13,
14]. AOD is the column integration of light extinction in the atmosphere, and it is an optical property of aerosol. The relationship between AOD and PM
2.5 corresponds to the vertical distribution of aerosol, extinction efficiency, mass concentration, aerosol types, microphysical and chemical properties, and the hygroscopicity of aerosol [
15,
16,
17,
18].
The total AOD is widely used in the estimation of PM
2.5 [
19,
20]. AOD represents the extinction of all particles, while PM
2.5 is a fine mode particle. To obtain an accurate estimation of PM
2.5, fine-mode AOD is adopted to simulate the extinction caused by PM
2.5. Zhang and Li [
21] used AErosol Robotic NETwork (AERONET) and MODIS FMF products to obtain aerosol extinction relative to fine-mode particles. The retrieval accuracy of AERONET particle size distribution and complex refractive index is acceptable for most remote sensing applications, even in the presence of rather strong systematic or random uncertainties in the measurements [
22]. However, the definition of AERONET FMF and the uncertainty of MODIS FMF need to be discussed. Fine- and coarse-mode separation for AERONET FMF is the minimum point of the particle size distribution line within the size interval from 0.439 to 0.992 µm. It is not stable, but is a dynamic cut point; therefore, the fine-mode particle defined by AERONET cannot fully represent PM
2.5. Furthermore, MODIS FMF over land had large uncertainties and little skill in deriving a meaningful expected error (EE) envelope [
23]. Therefore, it is not recommended to be used for establishing a model. Yan et al. [
24] proposed a LookUP table-SDA (LUT-SDA) method for FMF retrieval, which was validated on the city scales of Beijing, Hong Kong, and Osaka using MODIS data, and tested in northern China using Himawari-8 satellite data [
25]. The LUT-SDA follows the spectral deconvolution algorithm (SDA) first presented by O’Neill et al. [
26] and improved by building a four-dimensional LUT for calculating satellite FMFs. The most significant advantage of the LUT-SDA method is that it has no 0 values comparing with the MODIS FMF products and a better accuracy achievement. MODIS LUT-SDA FMF was used to improve the application of satellite FMF on PM
2.5 retrieval and obtained acceptable results under heavily polluted conditions [
27].
AOD is the column extinction of aerosol, while PM
2.5 is measured on the surface level. The surface level extinction can be obtained from the vertical correction of AOD. The planetary boundary layer height (PBLH) was reported as an important parameter by which the surface extinction coefficient could be extracted from satellite AOD [
28,
29].
The most difficult challenge in obtaining dry mass concentration of fine particulate matters from remote-sensing measurements is the conversion from aerosol extinction to PM
2.5 mass concentrations. A number of studies focus on revealing or simulating the relationship between aerosol extinction and PM
2.5 concentrations [
18,
28,
30]. For example, Koelemeijer et al. [
28] use a linear correlation coefficient to describe the relationship between PM
2.5 and the surface aerosol extinction in different types of stations. Lin et al. [
18] proposed two extinction-PM
2.5 conversion reference parameters, which are stable values for fixed stations. The reference parameters represent the mixing effect of hygroscopic growth, aerosol extinction, particle mass concentrations, and size distribution. Cheng et al. [
30] extracted the mass extinction efficiency (the slope of PM
2.5 and the extinction fitting line) for extinction-PM
2.5 conversion.
The mass concentration obtained by aerosol extinction is under an ambient condition. Humidity correction is needed to convert the ambient particle mass concentration into a dry particle mass concentration. The relative humidity and aerosol hygroscopic growth factor (e.g., f(RH)) are widely used predictors for humidity correction [
13,
18,
31].
In this study, a Specific Particle Swarm Extinction Mass Conversion Algorithm (SPSEMCA) has been introduced. Zhang and Li [
21], and Lin et al. [
18], established the AOD—PM extinction conversion models, which considered the particle size distribution in the physical model. However, their results corresponded to the concentration of fine-mode particles. They used the MODIS FMF (the contribution of the fine-dominated model to the total AOD) to assimilate the AOD related to fine-mode particles, not precisely to PM
2.5. At the same time, SPSEMCA focuses on estimating the concentration of target particles PM
2.5 by performing particle correction with η
2.5 (extinction fraction corresponding to particles with a diameter less than 2.5 μm). The ground-level observed PM
2.5, PBLH and RH reanalyzed by the European Centre for Medium-Range Weather Forecasts (ECMWF), AOD, FMF, particle size distribution, refractive indices from AERONET stations of Beijing area in 2015 were used to establish this model, and datasets of 2016 were used to test the performance of SPSEMCA. The AOD-PM
2.5 conversion algorithm based on AERONET data is explained in
Section 2. The application results and uncertainty sources of SPSEMCA using AERONET observation data and MODIS monitoring data are shown in
Section 3.
Section 4 gives the conclusion and possible further improvements to this study.
3. Establishing the Model
The methodology described above mainly relies on particle size distribution and refractive indices, which are hard to measure and obtain, and the related dataset is limited. Many other parameters that are easier to access are used to simulate the above parameters, e.g., η2.5, AVEC, and AMV.
3.1. Establishing the PM2.5—AOD Retrieval Model
The conversion from AOD to PM2.5, in this study, goes through several steps: Particle correction, vertical correction, extinction conversion to volume, and humidity correction, and every step includes its fitting process. Finally, when all the fitting steps are synthesized together, the mass concentration of PM2.5 will be obtained. The aerosol property and meteorological data in 2015 were used as the test dataset to establish the retrieval model, and the 2016 data were recognized as the prediction dataset to use for validation.
3.1.1. Particle Correction
Necessity of Particle Correction for the Beijing Area
The annual distribution of FMF shows significant seasonal characteristics (
Figure 2a). The FMF in spring is obviously lower than that of the other three seasons, while a large majority of samples in summer have high FMF (amasses approximately 0.8). The FMF of samples in autumn and winter was unstable, scattering from 0.1 to 0.95. For a lower FMF, from 0 to 0.5, the AOD decreased with the increase of FMF; then, AOD turned to increase with FMF in a higher range (FMF from 0.5 to 1) (
Figure 2b). A large majority of high AOD is contributed to by fine-mode particles, as shown in
Figure 2b(A). There also exists the area with lower FMF and higher AOD, as shown in
Figure 2b(B), which is mainly caused by dominant coarse-mode particles in aerosol. For instance, a dust storm frequently occurs in spring, which may affect the relationship between AOD and the concentration of fine-mode particles PM
2.5. Therefore, using the total AOD to retrieve concentrations of fine-mode particles with a diameter of less than 2.5 μm is somewhat unreasonable.
To clarify the stability of FMF, the seasonal particle size distribution of Beijing AERONET station in 2015 is shown in
Figure 3. In spring, aerosol is predominated by coarse-mode particles, and the fine-mode fraction is relatively lower, which can also be seen in
Figure 2b(B). Summer is a season with high humidity. The moisture in aerosol promotes the secondary transformation of particles, so that the concentration of fine-mode particles is obviously higher than that of other seasons. Meanwhile, with the impact of moisture, aerosol hygroscopic growth accelerates, and the fine-mode median radius of summer aerosol particles is bigger than that of the other seasons. The particle size distribution of winter and autumn is similar.
Method of Particle Correction
As in the above analysis, the concentration and fraction of fine-mode particles is not stable, which shows great differences with the change of season and AOD. AOD is the extinction of all particles, which cannot represent extinction contributed by fine-mode particles. It is a very important process to perform the particle correction and eliminate the impact of coarse-mode particles in the PM2.5-AOD retrieval model. Therefore, we adopted η2.5 to perform particle correction for AOD (Equations (1) and (2)).
However, the calculation of η2.5 relies on refractive indices and the particle size distribution function, which rely on the successful retrieval of the size distribution function based on the size distribution scattering plot. A great amount of computation is required before η2.5 can be obtained, which is time consuming. In this study, we try to use parameters that are contained in ground-based or satellite aerosol products to fit with η2.5, which will bring additional possibilities for the generalization of our retrieval model.
In this paper, we calculated η
2.5 based on AERONET particle size distribution and refractive indices data. To avoid introducing large extra uncertainties, the annually averaged refractive indices were used for the input of the Mie calculation for each station. After integration, η
2.5 was obtained. It should be noted that the diameter of 2.5 μm refers to the projected area equivalent diameter, which should be converted into the volume equivalent diameter for Mie calculation [
35]. Here, η
2.5 adopts a specific truncation radius at a volume equivalent to a diameter of approximately 1 μm, which is used as a separation point between fine- and coarse-mode particles. After preliminary experiments, good correlation between the effective radius (EffRad), fine mode fraction (FMF), and η
2.5 was found. We use EffRad and SDA FMF to simulate η
2.5 based on AERONET data of 2015. As is shown in
Figure 4a, η
2.5 displays an exponential decline with the increase of EffRad, the fitting line is η
2.5 = 1.143 × e
(−0.77 × EffRad), and R reaches 0.97. All of the grouped standard deviations are low. The η
2.5 increases with FMF (
Figure 4b), and the simulation function is η
2.5 = 0.339 × ln(FMF) + 0.931, the correlation coefficient is 0.80. The fitting performance for samples with high FMF is good enough. At the same time, the fitting uncertainties are relatively larger for an FMF less than 0.4. Beijing is usually dominated by fine-mode aerosol, and FMF is usually higher than 0.4, so the fitting result is still acceptable. Performing particle correction for ground-based AERONET AOD will adopt an EffRad-η
2.5 simulation relationship. Although MODIS does not release EffRad products, we used the FMF-η
2.5 fitting line to perform particle correction for the satellite data. Here, AERONET SDA FMF was used to establish the relationship between FMF and η
2.5.
3.1.2. Vertical Correction
We used the ECMWF reanalyzed PBLH to perform the vertical correction, converting AOD
2.5 to a ground-level extinction coefficient (Equation (4)). We compared the vertical correction performance of ECMWF PBLH with PBLH retrieved based on CALIPSO backscattering data (
Figure 5). The extraction of CALIPSO PBLH is based on an algorithm that was proposed by Jordan [
36]. This method uses a hybrid standard deviation algorithm, which is more sensitive than the traditional approaches that were used in some cases. In this method, the Haar wavelet and general threshold were employed to approximate the CALIPSO PBLH. As is shown in
Figure 5, the CALIPSO PBLH is lower than the ECMWF PBLH as a whole. In winter and spring, they match well, while a significantly higher ECMWF PBLH occurs from May to June. This generally higher ECMWF PBLH will be discussed and adjusted in the following model modification part.
To convert AOD to surface level aerosol extinction, vertical correction is needed.
Figure 6 shows the vertical correction results for AOD
2.5; both the PBLH from CALIPSO (PBLHc) and from ECMWF (PBLHe) have positive correction effects for the AOD
2.5−PM
2.5 relationship, and the correlation coefficients are 0.69 and 0.65, corresponding to PBLHc and PBLHe, respectively. As PBLHc is based on CALIPSO actual observation data, the vertical correction performance of PBLHc is better. However, the temporal and spatial coverage of CALIPSO data is limited; therefore, ECMWF reanalyzed PBLH was used to perform the vertical correction, and CALIPSO PBLH was used to perform some adjustment to the model, as is described in the model modification part.
3.1.3. Extinction Conversion to Volume
In this study, we introduce AVEC (averaged volume extinction coefficient, μm
2/μm
3, Equation (6)), to obtain
V2.5 (volume of particle with diameter less than 2.5 μm per unit volume, μm
3/μm
3, Equation (5)). AVEC is a specific property of aerosol that may be affected by the aerosol composition and surrounding environmental factors. To figure out the influential factors of AVEC, the relationship of AVEC with fine mode fraction at 500 nm, the RH, the temperature, the surface pressure, and the concentration of trace gases (SO
2, O
3, NO
2, CO), are shown in
Figure 7. Among the eight parameters, only fine mode fraction at 500 nm and RH has a good liner relationship with AVEC. With the increasing of fine mode fraction at 500 nm and the increasing RH, AVEC presents a linear and logarithm growth trend, respectively. The other six parameters show some relativity with AVEC, but their linear relationship is not very obvious.
Meanwhile, the Pearson correlation test result (
Table 2) shows that AVEC is significantly related to fine mode fraction at 500 nm, RH, temperature, SO
2, O
3, NO
2, and CO, at 0.01 level. The correlation coefficient of fine mode fraction at 500 nm, and RH is higher than 0.5.
Considering that the relationships of AVEC with FMF, RH, TM, SO
2, O
3, NO
2, and CO are significant, these seven parameters were used to perform stepwise regression. Using 2015 data to perform this regression, five successful models were obtained (
Table 3). Given that the determination coefficient of models 3, 4, and 5 increases little after introducing O
3, TM, and CO, we adopt model 2 in
Table 3 with predictive variables FMF and RH as the simulation model for AVEC, and the function is AVEC = 3.496 + 2.74 × FMF + 1.9 × (RH/100). To eliminate the impact of the data dimension, we used a standard coefficient to estimate the influence from the predictive variable. The standard coefficients of FMF and RH are 0.629 and 0.29, respectively; in other words, this simulation model is 68% affected by FMF and 32% affected by RH.
AVEC is simulated in 2016 by introducing FMF and RH. Predicted AVEC fits well with measured AVEC, and the correlation coefficient of the predicted and measured AVEC is 0.69. Then, we used this predicted AVEC to convert the extinction to volume, the relationship of PM
2.5 and
V2.5 (AOD
2.5/PBLHe/AVECp) is distributed as shown in
Figure 8, and the correlation coefficient is 0.66.
3.1.4. Humidity Correction Using an Empirical Model
In this study, the AMV (averaged mass volume, cm
3/g, Equation (8)) is defined to convert
V2.5 under moisture conditions into a dry particle mass concentration; in other words, AMV is used to make the humidity correction for
V2.5. We adopted the empirical particle hygroscopic growth function [
37] as follows:
where a and b are parameters of the hygroscopic growth function; here, we used the 2015 data to assimilate these two empirical coefficients.
The simulation results of f
AMV(RH) are shown in
Figure 9, and have a similar trend as that of the atmospheric particulates hygroscopic growth simulation results of Liu [
38] and Wang [
31] for the Beijing area. The empirical coefficients a and b are calculated as 0.97 and 0.61, respectively. The large majority of AMV-RH points fit the simulation line, which indicates that the empirical function assimilated for Beijing area is appropriate. AMV increased slowly and near stably when RH was less than 60%, but for high RH situations, the AMV grew exponentially with the increase of RH. Although samples of RH above 60% accounted for a small portion, AMV was varied in a wide range. Appropriate humidity correction is necessary.
Uncertainties of scattered points with AMV > 4 cm
3/g and RH < 60% in
Figure 9 were analyzed. We found that these points were under a certain boundary layer structure with a high level inversion layer and high surface level wind speed. Surface pollution was lifted upwards and spread with wind. Particle size distribution observed by AERONET could not represent the surface level aerosol properties. This condition is inevitable but rare, which has little influence on the simulation of humidity correction curve. In general, the humidity correction function adopted in this study is efficient.
3.2. Model Modification
Finally, after humidity correction,
V2.5 was converted to PM
2.5. The distribution of predicted PM
2.5 and in situ measured PM
2.5 is shown in
Figure 10. For all of the 727 matching samples of the Beijing area in 2016, a correlation coefficient of predicted and measured PM
2.5 reaches 0.70, which is acceptable. Approximately 79.77% of the 727 matching samples was underestimated, especially for samples with in situ measurement of PM
2.5 larger than 200 μg/m
3. The slope of the linear regression function is only 0.46.
Particle correction, humidity correction and other conversion are calculated strictly according to AERONET and PM2.5 in situ observation data. Therefore, underestimation may arise during the vertical correction. Furthermore, PBLH obtained from CALIPSO backscattering data and ECMWF reanalysis data did not match very well, as was discussed above. ECMWF re-analyzed PBLH data was used for the PM2.5 prediction, but PBLHe is higher than PBLHc as a whole, which is the main reason for the underestimation of PM2.5.
Then, we made an in-depth comparison between PBLHe and PBLHc. The matched PBLHe and PBLHc for the Beijing area in 2015 is distributed in
Figure 11. As is shown in the figure, PBLHc ranges from 0 to 2.8 km, while PBLHe ranges from 0 to 4 km. PBLHe is substantially higher than PBLHc, especially for a PBLHe that is larger than 2 km, which is similar to the underestimation trend of PM
2.5.
Figure 12 shows the CALIPSO backscattering distribution of the Beijing area on February 13, 2015. Aerosol particles mainly spread under 2 km, while particles above 2 km decreased rapidly with the increase of height. Above 4 km, only trace particles could be detected; moreover, its distribution is very smooth. In this figure, the white line indicates PBLH obtained from ECMWF reanalysis datasets, and the black line refers to the PBLH derived from a hybrid standard deviation algorithm using the Haar wavelet and general threshold calculation proposed by Jordan [
36]. PBLHc delimits the boundary layer appropriately; under this height, particles are distributed uniformly. However, for south of the CALIPSO transit area, PBLHe is about one times higher than PBLHc, and PBLHe is too high, which should be revised so that it is in line with the actual situation.
We suppose that the interception of the PBLHc-PBLHe regression function is 0; then, the slope of the liner regression function is 0.58. For the sake of uniformity, the magnitude of PBLHe without changing its relative order, PBLHe × 0.58 is used to perform the modification for the above estimation model, and this revised model is the final PM2.5-AOD retrieval method proposed in this study, which is called the specific particle swarm extinction mass conversion algorithm (SPSEMCA).
3.3. Model Error Analysis
The performance of three simulation functions for η
2.5, AVEC, and AMV in this paper were tested (
Table 4). To obtain a better estimation of the simulation model’s performance and to avoid the influence caused by extreme scattering points, we adopted the median absolute error (AE) and the corresponding relative error (RE) to evaluate the uncertainty of three simulation functions. The median absolute error is approximately 0.030, 0.381 μm
3/μm
2, 0.465 cm
3/g for η
2.5, AVEC, and AMV, respectively, based on the modelling dataset of 2015. The averaged relative error of η
2.5 and AVEC are lower; both are less than 7%, while uncertainties coming from AMV are relatively high, at approximately 36.6%. Therefore, humidity correction in this study may bring large uncertainties to the final estimation.
According to the error propagation theory, the PM
2.5 estimation model suffers from uncertainties of the simulation errors of η
2.5, AVEC, and AMV; furthermore, it also comes from the measurement errors of AOD and PBLH, which follow the equation of:
The measurement errors coming from observation data δAOD and the reanalysis data δPBLH is difficult to estimate; therefore, we just take η
2.5, AVEC, and f
AMV(RH) into account. Based on
Table 4, η
2.5, AVEC, and f
AMV(RH) can cause errors at approximately 3.75%, 6.29%, and 36.6%, respectively. According to Equation (10), the total uncertainty caused by parameterization schemes in our specific particle swarm extinction mass conversion algorithm is 37.32%, regardless of the uncertainties caused by the measurement parameters. The uncertainties related to η
2.5 and AVEC were approximately 22% as the whole, which indicates that the simulation formulae and relationships found in this study are appropriate and accurate. At the same time, almost all of the other uncertainties were caused by f
AMV(RH), which may be affected by the aerosol vertical variation, the measurement error of PM
2.5, the reanalysis error of RH, the input uncertainties of
V2.5, and the style of the empirical formula that we selected. In the future, the vertical distribution of aerosol will take into consideration, and uniform data sources will be used.
5. Conclusions
In this paper, a specific particle swarm extinction mass conversion algorithm (SPSEMCA) has been introduced. This method uses particle correction, vertical correction, and humidity correction to successfully convert AOD into PM
2.5. Both the applications of SPSEMCA to AERONET observation data and the MODIS monitoring data obtained acceptable results, R = 0.70, RMSE = 58.75 μg/m
3 for AERONET data, and R = 0.75, RMSE = 43.38 μg/m
3 for MODIS data. These results perform better compared with the results of Zhang [
21], R = 0.5, RMSE = 64 μg/m
3 based on the MODIS data, with hourly in situ measurements over North China during October–December, 2013. Furthermore, the trend of temporal and spatial distribution of Beijing has been revealed. As for SPSEMCA, there are approximately 37.32% uncertainties that come from the parameterization schemes of η
2.5, AVEC, and f
AMV(RH). Meanwhile, the satellite application of SPSEMCA suffers large uncertainties from the data quality of FMF, which may lead to the systematic underestimation of PM
2.5. Furthermore, the PBLH that was obtained from the ECMWF reanalysis data, which is systematically overestimated compared to the PBLH that was retrieved by CALIPSO backscattering data. A slope modification was made in this paper to rectify this overestimation trend.
This method has five innovation points: (I) We use η2.5 rather than FMF to assimilate AOD2.5, which is contributed to by PM2.5; (II) the assimilation factors of AVEC were selected from eight likely influencing factors and two parameters FMF and RH were finally selected to assimilate AVEC; (III) the performance of PBLH retrieved by satellite Lidar CALIPSO data and a reanalysis by ECMWF were compared in the model establishment process, and CALIPSO PBLH was used to make a systematic correction of the ECMWF PBLH; (IV) we used PM2.5 measured by the ground-based air quality station as the dry mass when calculating the AMV, to avoid the uncertainties derived from the estimation of the particulate matter density ρ; (V) MAIAC AOD with the resolution of 1 km × 1 km AOD was used to retrieve high resolution PM2.5 distribution, and MODIS LUT-SAD FMF was used to avoid the large uncertainties caused by the MODIS FMF product.
In our further study, a consistent source of datasets of PM2.5 and other meteorological parameters should be detected and used. More appropriate linear or nonlinear simulation formulae should be tested to make an accurate humidity correction for PM2.5 retrieving. Vertical correction in SPSEMCA will be improved by detecting more accurate PBLH retrieval methods or high-quality products. Furthermore, an extinction profile monitored by ground-based Lidar is also expected to be used in our future study.