1 Introduction
Exposure to outdoor fine particulate matter (PM; less than 2.5 m in diameter) is a global public health issue, accounting for 8.9 million deaths in 2015 (Burnett et al., 2018). Beyond mortality, short-term exposure to elevated PM levels is associated with numerous adverse health outcomes, including increased hospital admissions for respiratory and cardiovascular issues (Dominici et al., 2006; Wei et al., 2019). Long-term exposure is associated with neurodegenerative diseases such as dementia, Alzheimer's disease, and Parkinson's disease (Kioumourtzoglou et al., 2016). High spatiotemporal monitoring of PM concentrations to inform population exposure is important for both air quality regulation and epidemiological studies. Ground monitors can provide highly accurate measurements but have limited spatial coverage. Here we show how geostationary satellite observations of aerosol optical depth (AOD) over East Asia from the Geostationary Ocean Color Imager (GOCI) can be used with a random forest (RF) machine learning (ML) algorithm to provide continuous long-term reliable mapping of 24 h PM at a 6 6 km spatial resolution.
The potential of satellites for high-resolution monitoring of PM has long been recognized in the public health community (Liu et al., 2004; van Donkelaar et al., 2006). Satellites retrieve AOD by the backscatter of solar radiation. The MODIS sensors launched in 1999 on the NASA Terra and Aqua satellites have been the main source of AOD data, with global coverage twice a day at up to 1 km resolution (Remer et al., 2005, 2013; Lyapustin et al., 2018). Early approaches to relate AOD observations to surface PM used chemical transport models (CTMs) to estimate local PM AOD ratios (Liu et al., 2004; van Donkelaar et al., 2006), with more recent studies adding ancillary satellite data on the vertical distribution of aerosol extinction (Geng et al., 2015; van Donkelaar et al., 2016, 2019). Other approaches have used PM network data to infer PM AOD ratios (Wang and Christopher, 2003), with statistical models based on meteorological and land use predictor variables to enable spatial extrapolation (Gupta and Christopher, 2009; Liu et al., 2009; Kloog et al., 2012, 2014).
More recently, non-parametric machine learning models have been developed to predict PM from satellite AOD observations, including neural networks (Li et al., 2017; Zang et al., 2019), and RFs, including approaches that fuse both (Di et al., 2019). RF has been applied to MODIS AOD to produce high-resolution daily PM products for the USA (Hu et al., 2017) and China (Guo et al., 2021). Others have used RF and satellite AODs to produce monthly PM data over the North China Plain (Huang et al., 2018), in addition to daily PM data in California (Geng et al., 2020) and Cincinnati, Ohio (Brokamp et al., 2018).
Geostationary satellites are now dramatically increasing the capability for the mapping of PM from space. The GOCI instrument launched in 2010 by the Korea Aerospace Research Institute (KARI) observes AOD eight times daily at 0.5 0.5 km pixel resolution over eastern China, the Korean peninsula, and Japan (Choi et al., 2018). The fine-pixel hourly information is intrinsically valuable and also facilitates cloud clearing (Remer et al., 2012). GOCI AOD data aggregated to 6 6 km resolution have been used to estimate PM in regional studies for the Yangtze River Delta (She et al., 2020) and eastern China (Xu et al., 2015). Park et al. (2019) find that PM can be inferred over the Korean peninsula with greater accuracy when using GOCI AOD rather than sparser MODIS data. AOD products from the Advanced Himawari Imager (AHI) on board the Himawari-8 and Himawari-9 geostationary meteorological satellites over East Asia have also been used to infer surface PM (Wang et al., 2017; Chen et al., 2019).
AOD cannot be observed under cloudy conditions, and AOD retrievals from satellites can also fail for other reasons, including snow surfaces. Different methods have been used to fill the data gaps and produce continuous data sets. Some studies use chemical transport model (CTM) AODs when satellite data are missing (Hu et al., 2017; Stafoggia et al., 2019). Kianian et al. (2021) used a statistical interpolation algorithm combining RF with the lattice kriging method to infer missing AOD over the USA, while Di et al. (2019) used an RF trained on gap-free covariates to fill in the gaps for MODIS AOD. Yet, others first estimate PM using available AOD observations and then infer missing PM estimates using a separate gap-filling model (Kloog et al., 2014; She et al., 2020). Brokamp et al. (2018) show that AOD retrieval failure is itself predictive of PM, an insight we leverage in this work.
Here we apply an RF algorithm to 2011–2019 GOCI AOD data to construct a continuous dataset of 24 h PM concentrations at a 6 6 km resolution for eastern China, South Korea, and Japan trained with surface network data. This is a larger spatial domain than has been attempted in previous studies. We ensure continuity by using gap-filled AOD, calculated by blending a CTM simulation with statistical interpolation, along with a parameter characterizing the length scale of the interpolation as inputs to the RF algorithm. This strategy maximizes the training set size and allows the RF to determine a strategy to handle information encoded by retrieval failure. The resulting gap-filled product predicts PM with comparable skill when AOD observations are absent as when they are available. We characterize the error in the RF-produced GOCI PM dataset for both 24 h and annual concentrations and demonstrate the ability of the dataset to capture spatial and day-to-day variability on urban scales. We exploit the continuity of the dataset to determine trends of PM air quality in East Asia over the past half decade.
2 Data and methods
2.1 Datasets
2.1.1 GOCI AODs
GOCI is on board the Korean Communication, Ocean, and Meteorological Satellite (COMS) that was launched by KARI in June 2010 (Choi et al., 2012, 2016). The first ocean color imager placed in geostationary orbit, GOCI covers a 2500 2500 km domain centered on the Korean peninsula at 36 N and 130 E, with 0.5 0.5 km pixels observed every hour from 00:30 to 07:30 UTC. AOD at 550 nm over land is retrieved using the GOCI Yonsei aerosol retrieval (YAER) version 2 (V2) algorithm at an aggregated 6 6 km spatial resolution and 1 h temporal resolution (Choi et al., 2018). Aggregation filters out pixels affected by sunglint or clouds and the darkest 20 % and brightest 40 % pixels within the 6 6 km scene (Choi et al., 2018). We further aggregate the hourly AOD measurements of AOD into a daily mean for use in the RF.
Validation of the GOCI YAER V2 AOD with surface measurements from the AERONET surface network shows a high correlation (), a root mean squared error (RMSE) of 0.16, and a mean bias (MB) of 0.01, with no significant spatial variation across East Asia (Choi et al., 2018). GOCI YAER V2 also reports a fine-mode fraction (FMF) and a multiple prognostic expected error (MPEE) for the AOD, but we find that they are not useful in our RF, as discussed later. For comparison, we also calculate an RF trained on the GOCI–AHI fusion AOD product of Lim et al. (2021). The Advanced Himawari Imager (AHI) instruments on board the Himawari-8 and Himawari-9 geostationary meteorological satellites were launched in October 2014 and November 2016, respectively. AHI has a larger field of view than GOCI but a shorter record.
2.1.2
PM network data
We use hourly PM data from operational air quality networks in eastern China, South Korea, and Japan and average them over 24 h and over the 6 6 km GOCI AOD grid to define targets for the RF algorithm. Data for eastern China are from the National Environmental Monitoring Center (
We use hourly meteorological data from the ERA5 global reanalysis, with a resolution of 30 30 km (Hersbach et al., 2020), as input predictor variables for the RF algorithm. For this purpose, we aggregate the data to 24 h averages and allocate them to 6 6 km GOCI grid cells by bilinear interpolation. We consider boundary layer height, 2 m air temperature and relative humidity (RH), 10 m meridional and zonal winds, and sea level pressure as potential meteorological predictor variables. We also include latitude, year, day of the year (1–366), and nation (eastern China, South Korea, or Japan) as geographical predictor variables. We considered the 2015 population density (CIESIN, 2018) as a potential predictor variable but found that it was not useful, as discussed in Sect. 3.2.
Figure 1
Mean aerosol optical depth (AOD) and surface network PM concentrations over the Geostationary Ocean Color Imager (GOCI) viewing domain, 2011–2019. Panel (a) shows the mean GOCI AOD data on the 6 6 km grid. Panel (b) shows the mean surface network PM data for eastern China (starting in May 2014), South Korea (starting in January 2015), and Japan, using large data symbols for visibility. A magnified inset for South Korea shows the surface network observations with symbols corresponding to the 6 6 km grid of the GOCI data. A log scale is used for the color bar.
[Figure omitted. See PDF]
Figure 1 shows the mean distributions of GOCI AOD and surface network PM for 2011–2019 or for the more limited durations of their records (2014–2019 for eastern China PM; 2015–2019 for South Korea PM). The PM networks are extensive but coverage is nevertheless sparse and often limited to large urban areas, as illustrated by the magnified inset for South Korea. We find that only 1.0 % of GOCI 6 6 km grid cells have PM observations in eastern China, 7.4 % in South Korea, and 7.9 % in Japan. This geographic limitation in the PM networks emphasizes the value of continuous coverage from the AOD data.
2.2 AOD gap-fillingFigure 2 shows the percentage of days with at least one successful hourly GOCI AOD retrieval on the 6 6 km retrieval grid. There are substantial gaps in the record, mostly reflecting clouds and also snow cover in winter (Choi et al., 2018). We seek to fill in these gaps to produce a continuous daily data set while accounting for the associated errors and leveraging information implicitly encoded in retrieval failure. We fuse the following two strategies according to the availability of nearby AOD retrievals: an inverse-distance-weighted (IDW) interpolation AOD of nearby retrievals (Shepard, 1968) and a bias-corrected monthly AOD from the GEOS-Chem CTM, as follows:
1 where is a weighting factor that depends on the distance from nearest retrievals. GEOS-Chem is a widely used CTM for inferring PM from satellite AOD data (Liu et al., 2004; van Donkelaar et al., 2006, 2016, 2019; Geng et al., 2015). Here we use scaled monthly mean GEOS-Chem AODs from a simulation by Zhai et al. (2021) for 2016 in East Asia with a 0.5 0.625 resolution, bias corrected to the annual mean GOCI AODs on the 6 6 km grid. In this way, we obtain a spatial distribution of monthly mean AOD values for 2011–2019 for use in Eq. (1).
Figure 2
Percentage of days in 2011–2019 with at least one successful hourly retrieval of AOD on the 6 6 km grid. Panel (a) shows year-round statistics, while panel (b) shows winter months (December–February; DJF) only.
[Figure omitted. See PDF]
We calculate the weighting factors used in Eq. (1) via the Gaspari–Cohn function, a fifth-order piecewise polynomial with a radial argument (Gaspari and Cohn, 1999). The Gaspari–Cohn function resembles a Gaussian distribution but with compact support that takes on a maximum value of 1 for and a minimum value of 0 for . We define for a given 6 6 km grid cell and day to be the distance from the midpoint of the grid cell to that of the nearest observed grid cell, normalized by a spatial correlation length scale, , determined from available AOD observations in and around that grid cell. We find that the value of ranges from 110 km to 170 km over our domain.
2.3 Random forest algorithmTable 1 lists the predictor variables included in the RF to infer 24 h PM as dependent variable. RF is an ensemble machine learning method, where many individual decision trees are fit to the training data and vote on an output value, with the average value taken as best estimate (Breiman, 2001).
Table 1
Random forest predictor variables for 24 h PM.
GOCI gap-filled AOD | 8 h average AOD at 550 nm |
---|---|
observations | wavelength |
from Eq. (1) | |
Meteorology | Boundary layer height (m) |
10 m meridional wind (m s) | |
10 m zonal wind (m s) | |
2 m temperature (K) | |
2 m relative humidity (%) | |
Sea level pressure (Pa) | |
Metadata | Country dummy variables |
Latitude | |
Day of year | |
Year |
The RF algorithm predicts continuous 24 h PM on a 6 6 km grid for eastern China, South Korea, and Japan after training with PM surface network data. The 8 h average 550 nm AODs on the 6 6 km grid retrieved with the YAER V2 algorithm (Choi et al., 2018). ECMWF ERA5 fields (Hersbach et al., 2020) at 30 30 km spatial resolution and hourly temporal resolution interpolated bilinearly to the GOCI grid and averaged over 24 h. Estimated from temperature and dew point using the August–Roche–Magnus approximation (Alduchov and Eskridge, 1996). The three variables that, for each of eastern China, South Korea, and Japan, have a value of 1 if a grid cell is within those national borders and of 0 otherwise.
Decision trees are fit recursively to the predictor variable. Suppose we have a collection of data elements , denoted as , each composed of predictor variables () and a corresponding list of labels that we would like to learn. In our case, denotes the observed PM concentrations from the surface networks averaged on the 6 6 km grid, and denotes the number of these observations. The algorithm works by splitting the data into left and right subsets, and , at an optimum split point determined from the predictor variables in (Pedregosa et al., 2011). The optimum split point is defined as the one that minimizes the impurity , as follows: 2 where represents the fraction of data in the subset , and represents the mean squared error of each of the subsets, as follows: 3 where is the mean of the target labels within a given subset , and is the number of elements in that subset. From there, the same algorithm is recursively applied to the left and right subsets and until the tree is grown. We follow the advice of Hastie et al. (2009) and grow trees until the data are fully classified (each leaf contains only one value).
Due to the recursive training structure, decision trees are sensitive to the data on which they are trained because a change in one split point changes the composition of all its child nodes. Individual decision trees thus have high error variance but no inherent bias. It follows that averaging many individual and uncorrelated trees should yield a low variance, low bias, prediction. We construct 200 trees in parallel and reduce the correlation between them through a bagging procedure; for each of the 200 decision trees in the RF, we sample the input data with replacement to form a new dataset of the same dimensions and then grow a decision tree from this bootstrapped data (Breiman, 2001). Because of the high input sensitivity, a wide variety of decorrelated trees are grown. The predictions of each individual tree are averaged to yield the prediction of the RF. We fit our RF using the RandomForestRegression class in the Python module Scikit-learn (Pedregosa et al., 2011). We attempted to further decorrelate the trees by following Breiman (2001) and calculating the split points of each individual tree using only a random subset of the predictor variables; however, a sensitivity test we performed showed only minor differences with the base case, and therefore, we follow Geurts et al. (2006) in considering all predictor variables in the training process.
We evaluate how the RF generalizes to predictions for the full 6 6 km domain via a 10-fold cross-validation. For each fold of the cross-validation, we leave out a randomly selected 10 % of PM network sites (averaged on the 6 6 km grid if needed) from each country. These 10 % represent the test set. Because we perform the validation 10 times, each grid cell is in the test set exactly once. We compare the predicted PM to withheld observed PM using four metrics, i.e., root mean square error (RMSE), the RMSE divided by mean observed PM (relative RMSE or RRMSE), the coefficient of variation (), and the mean bias computed by averaging the difference between predicted and observed PM (MB).
An outcome of interest is the ability of our predictions to capture exceedances of National Ambient Air Quality Standards (NAAQS). We categorize each prediction within the test sets into one of the following four classes: true positives (TPs), where both predicted and observed PM exceed the NAAQS threshold, true negatives (TNs), where neither exceed the threshold, false positives (FPs), where an exceedance is predicted but not observed, and false negatives (FNs), where an exceedance is observed but not predicted (Brasseur and Jacob, 2017; Cusworth et al., 2018). We use these classes to compute three overall prediction grades. The first, percent of detection (POD), gives the fraction of observed exceedances that were successfully predicted, as follows: 4 The second, false alarm ratio (FAR), gives the fraction of predicted exceedances that did not occur as follows: 5 The third, equitable threat score (ETS), compares how well the prediction does relative to random chance, as follows: 6 where is the number of true positives obtained by random chance, as follows: 7 ETS is 1 for perfect prediction skill and 0 for no better or worse than chance.
Predictor variable selection is an important task in implementing an RF, as the addition of non-informative variables can decrease performance. Unlike linear regression, which can naturally ignore unhelpful predictors, irrelevant data can, by chance, aid in minimizing impurity at some stage in the optimization process, making all subsequent splits suboptimal. The six meteorological variables given in Table 1 are standard in the AOD PM prediction (e.g., Kloog et al., 2014; Li et al., 2017), while the four spatiotemporal variables (location dummies, latitude, year, and day of the year) and the retrieval gap-filling parameter proved to be informative in sensitivity tests. In addition to the predictor variables in Table 1, we considered as additional variables the population density, the GOCI fine mode fraction (FMF), and the GOCI multiple prognostic expected error (MPEE), but we found that they worsened accuracy of the fit, and so we did not retain them. Because population density worsened the fit, we did not include other spatially varying but temporally fixed land use variables such as road data, elevation, or emissions. We also compared RFs trained on GOCI AOD and on GOCI–AHI-fused AOD and found no significant difference in the fitting of PM. We, therefore, use the GOCI AOD product because of its longer record.
3 Results and discussion3.1 Accuracy and precision of RF predictions
Figure 3 shows scatterplots, color-coded by count, comparing surface observations of 24 h and annual mean PM to the predicted GOCI PM values in grid cells whose records are entirely withheld from training in the cross-validation procedure. GOCI PM values for the annual mean are obtained by averaging the 24 h predictions. Table 2 gives comprehensive GOCI PM evaluation statistics for East Asia and for each country. The 24 h predictions for East Asia have a negligible mean bias of 0.23 g m (annual; 0.22 g m), though the RF underpredicts PM at the high tail of the distribution; we will return to that issue later in the context of NAAQS exceedances. The root mean square error (RMSE) between observed and predicted 24 h PM is 8.8 g m (annual; 3.3 g m), corresponding to a relative RMSE (RRMSE) of 37 % (annual; 14 %), as defined in Sect. 2.3. The prediction captures 89 % of the observed 24 h variance ( 0.89) and 96 % of annual variance ( 0.96). These results compare favorably to previous reconstructions of PM from satellite AOD data. For example, a 1 km 2000–2015 continental USA product and 3 km 2015–2016 eastern China product have cross-validations of 0.86 and 0.87, respectively, for daily PM (Di et al., 2019; Hu et al., 2019), while a global 0.01 1998–2018 product and a 0.1 2000–2016 product for China have cross-validated of 0.90–0.92 and 0.77, respectively, for annual PM (Hammer et al., 2020; Xue et al., 2019). for the annual mean PM in South Korea is relatively low (0.41), which can be explained by the weak dynamic range of observed annual PM in the country (Fig. 1), as will be discussed later in this section.
Figure 3
The ability of the random forest algorithm to predict 24 h (a) and annual mean PM (b) in East Asia. Scatterplots depict the relationship between GOCI and surface network PM at grid cells withheld from training in the cross-validation. The plots are two-dimensional histograms, where the pixel color corresponds to the count of observation/prediction correspondences within the corresponding bin on a logged scale. The identity line is plotted in black. For the annual mean PM, grid cells with fewer than 80 % of PM observation days in a given year are removed to avoid biasing the average values. For panel (a), 0.002 % of the data are not shown, as they exceed the plot range; all data are shown in panel (b).
[Figure omitted. See PDF]
Table 2Error statistics for fitting of PM data by the RF algorithm.
RMSE | RRMSE | MB | MBnr | ||
---|---|---|---|---|---|
(g m) | (g m) | (g m) | |||
24 h PM | |||||
Overall | 8.8 | 37 % | 0.89 | 0.23 | 0.23 |
Eastern China | 15 | 32 % | 0.85 | 0.49 | 0.53 |
South Korea | 6.4 | 26 % | 0.82 | 0.16 | 0.10 |
Japan | 3.6 | 27 % | 0.79 | 0.12 | 0.13 |
Annual PM | |||||
Overall | 3.3 | 14 % | 0.96 | 0.22 | |
Eastern China | 5.6 | 12 % | 0.86 | 0.53 | |
South Korea | 2.9 | 12 % | 0.41 | 0.24 | |
Japan | 1.6 | 12 % | 0.70 | 0.094 |
Comparison statistics between GOCI and surface network PM for the grid cells in each of eastern China, South Korea, and Japan are completely withheld from the RF training process in the cross-validation procedure. Statistics shown are for the root mean square error (RMSE), relative RMSE (RRMSE), coefficient of variation (), mean bias (MB), and mean bias on days where AOD retrieval fails (MBnr).
Our gap-filling strategy does not introduce bias for days without GOCI observations (and with AOD inferred, instead, from Eq. 1). Figure S1 in the Supplement shows that surface network PM has distinct distributions on days where AOD retrieval fails compared to when AOD retrieval succeeds, a pattern successfully reproduced by GOCI PM. Table 2 shows that the mean bias statistic on days where AOD retrieval fails is similar to the whole population. This suggests that the RF algorithm is able to successfully exploit the information encoded in AOD retrieval failure in making a PM prediction, a phenomenon also noted by Brokamp et al. (2018).
Table 3Ability of the RF algorithm to diagnose exceedances of air quality standards.
NAAQS | Exceedance | POD | FAR | ETS | ||
---|---|---|---|---|---|---|
(g m) | frequency | |||||
Observed | RF | |||||
24 h PM | ||||||
Eastern China | 75 | 16 % | 15 % | 78 % | 16 % | 0.63 |
South Korea (old NAAQS) | 50 | 5.9 % | 4.2 % | 57 % | 21 % | 0.47 |
South Korea (new NAAQS) | 35 | 19 % | 17 % | 73 % | 20 % | 0.55 |
Japan | 35 | 1.6 % | 0.91 % | 47 % | 17 % | 0.43 |
Annual PM | ||||||
Eastern China | 35 | 77 % | 83 % | 97 % | 9.2 % | 0.54 |
South Korea (old NAAQS) | 25 | 40 % | 44 % | 67 % | 39 % | 0.23 |
South Korea (new NAAQS) | 15 | 100 % | 100 % | 100 % | 0 % | NA |
Japan | 15 | 24 % | 20 % | 68 % | 20 % | 0.49 |
Calculated using sites withheld from training in the cross-validation procedure. National Ambient Air Quality Standards (NAAQS), which are specific to each country. We show results for the class 2 NAAQS in eastern China and for both pre-2018 (old) and post-2018 (new) NAAQS for South Korea because all observed grid cells exceed the new annual NAAQS of 15 g m. Percentage of site days (24 h standard) or site years (annual standard) exceeding the NAAQS. Percent of detection (POD) defined as the percentage of exceedances successfully detected. False alarm ratio (FAR) defined as the percentage of predicted exceedances that did not occur. Equitable threat score (ETS), which is defined as the ability of the RF to predict exceedances beyond random chance.
One potential application of PM monitoring from space would be to diagnose exceedances of national ambient air quality standards (NAAQS) at locations without network sites. Table 3 shows the NAAQS for 24 h and annual PM for the three countries and the ability of GOCI PM to diagnose NAAQS exceedances in grid cells excluded from the training process in the cross-validation procedure. The 24 h exceedances correspond to the high tails of the distributions but annual exceedances are much more widespread. The POD column shows the percent of true positives successfully detected, while the FAR shows the rate of false positives (defined in Sect. 2.3). The POD for 24 h exceedances ranges from 47 %–78 % by country (16 %–21 % – FAR). PODs are higher for annual exceedances, but that reflects the higher observed frequency of these exceedances. The ETS values ranging from 0.43–0.63 indicate that the model captures exceedances with much better skill than random guessing.
Figure 4
Cumulative probability density functions (pdf's) of 24 h and annual mean PM concentrations in eastern China, South Korea, and Japan. Surface network PM (black) is compared to GOCI PM (colored) taken from the cross-validation. The gray envelope represents the relative root mean square error (RRMSE) of the RF algorithm, as given in Table 2, measuring the predictive capability of the algorithm for individual events. The NAAQS for each country is shown as the horizontal line, with both the pre-2018 and post-2018 NAAQS shown for South Korea. The left panel scales are log–log, while the right panel scales are linear, and the -axis scales vary for the different countries.
[Figure omitted. See PDF]
The main difficulty for GOCI PM in predicting NAAQS exceedances is that many of those exceedances fall within the precision of individual predictions. This is illustrated in Fig. 4, with the cumulative probability density function (pdf) of the 24 h and annual mean PM concentrations in eastern China, South Korea, and Japan representing the same withheld data from the cross-validation as in Tables 2 and 3. The 24 h RRMSE of 26 %–32 %, depending on country (Table 2) is shown as the gray envelope and is relatively flat across the distribution. The prediction of NAAQS exceedances within that uncertainty envelope is limited by the precision of the algorithm. All of the 24 h exceedances in Japan are within that envelope, as are most of the exceedances in eastern China and Korea. China has the largest fraction of exceedances beyond the RRMSE of the GOCI PM and, therefore, the best prediction success. An additional, though smaller, cause of bias is that GOCI PM underestimates the high tail of the pdf, as is apparent in Fig. 4, which explains in particular why we achieve a better FAR than POD for 24 h PM in South Korea and Japan. Our worst NAAQS prediction performance is for annual PM in South Korea for the old 25 g m standard because most of the distribution is within the RRMSE envelope. Additionally, the already small dynamic range of the surface network annual PM (black dots) is underestimated by the GOCI PM (blue dots). These culminate in a GOCI PM estimate with good RMSE but low .
We experimented with several modifications to the RF algorithm to improve the prediction of NAAQS exceedances but with no success. These tests included training separate RFs for each of the three countries, training annual PM predictions on annual (rather than 24 h) PM data, directly predicting NAAQS exceedances by setting the learned label to be true if a day (year) is above the 24 h (annual) NAAQS for a given country, and applying different weights to the data so that the high tail is oversampled in the training process. None of these tests yielded significant improvements. Smoothing of the tails in RFs is a well-recognized problem (Zhang and Lu, 2012). Following Zhang and Lu (2012), we attempted to train RFs to predict and correct the residuals but found this to be ineffective. Part of this tail smoothing could also result from the underlying GOCI AOD land product, which has a negative bias (0.02) for high AODs and a positive bias (0.02) for low AODs (Choi et al., 2018).
3.2PM temporal trends and spatial distributions
Figure 5 shows long-term trends of annual PM for each country, as measured by the PM surface network and as inferred in the GOCI PM for both areal- and population-weighted means. We do not include GOCI PM for years before the networks became available (and, hence, when the RF could be trained) because of concern over extrapolation bias. The PM networks show decreasing trends in all three countries, and these trends are consistent with the GOCI PM for both areal- and population-weighted means, demonstrating that the trends reported by the PM networks are representative of the countries. However, the PM networks in eastern China and South Korea underestimate the population-weighted means. Moreover, trends in South Korea and eastern China become flat between 2018 and 2019 (with a slight population-weighted increase in South Korea). This could possibly reflect interannual meteorological variability (Zhai et al., 2019; Koo et al., 2020) but also an increase in oxidants producing secondary aerosol (Huang et al., 2021). Figure S2 shows maps of annual GOCI PM across the entire study domain.
Figure 5
Trends in the annual mean PM concentrations for eastern China, South Korea, and Japan. Trends determined from the national surface PM networks (dashed black line) averaged over 6 6 km grid cells, requiring at least 80 % of data for all years plotted, are compared to GOCI PM trends inferred by the random forest (RF) algorithm with continuous temporal and spatial coverage on the 6 6 km grid and weighted either by area (solid colored line) or by population (dashed colored line). Here we use an RF trained on all the data. The gridded population data are from CIESIN (2018). The national PM networks include 413 continuously observed grid cells in eastern China, 74 in South Korea, and 307 in Japan. Trends are initialized at the onset of the surface network for complete years of data; due to the unavailability of the early months of the year, 2011 is discarded for Japan, and 2014 is discarded for eastern China.
[Figure omitted. See PDF]
Figure 6 shows the changes in annual mean PM concentrations over South Korea between 2015 and 2019 as observed from the national network and as inferred from GOCI. We focus on South Korea here because it demonstrates how GOCI PM adds considerable information to a region that already has relatively good network coverage, including the detection of PM hot spots missing from the network, such as the Iksan region on the west coast of South Korea in 2015 that was subsequently added to the network by 2019. Figures S3 and S4 show analogous maps for China and Japan, respectively.
Figure 6
Annual mean PM concentrations in South Korea in 2015 and 2019. GOCI PM (a, b) inferred from an RF trained on all available data are compared to AirKorea network observations (c, d). Network observations are shown only if at least 80 % of the year was observed.
[Figure omitted. See PDF]
Figure 7 depicts the relative 2015–2019 trends of PM concentrations in South Korea derived from a linear regression applied to the annual GOCI PM in each 6 6 km grid cell. Such a spatially resolved trend analysis is uniquely enabled by the GOCI coverage. We find decreases across the country, except in the Seoul metropolitan area, which mostly shows no significant trend except for a few pixels in Incheon. These results are consistent with the spatial patterns calculated from AirKorea data by Yeo and Kim (2019), who found 2015–2018 decreases in Incheon but not Seoul or the surrounding Gyeonggi province. Despite the insignificant changes in Seoul, substantial PM decreases are found over other large urban areas including Busan, Ulsan, Daegu, and Gwangju. The three rapidly decreasing spots on the southern coast are Gwangyang, Sacheon, and Changwon, which house industrial complexes related to the South Korean shipbuilding industry that has recently declined (Jung-a, 2016). Figure S5 shows absolute 2015–2019 trends of GOCI PM concentrations across the entire study domain and demonstrates that the North China Plain has the largest overall PM reductions.
Figure 7
The 2015–2019 trends per year in PM concentrations across South Korea. The trends are obtained by ordinary linear regression of the annual mean GOCI PM in each 6 6 km grid cell, with significant regression slopes () where the RF is trained on all the available data. Grid cells with insignificant trends are plotted in gray.
[Figure omitted. See PDF]
AOD and PM in East Asia tend to have opposite seasonalities driven by boundary layer depth and RH (Zhai et al., 2021). Figure 8 compares GOCI and the surface network monthly mean PM in the Beijing, Seoul, and Tokyo metropolitan areas, with predictions coming from withheld data in the 10-fold cross-validation. Correspondence between GOCI and the network PM may be tighter than the nationwide annual means plotted in Fig. 5 because these urban areas are well observed. We see that the RF algorithm fully captures the observed seasonality in PM, although some observed monthly spikes are underestimated. The figure illustrates the lack of trend in the Seoul metropolitan area over 2015–2019 but also shows that winter and summer PM in the region have opposite and roughly equal trends, with winter growing more polluted while summers become cleaner.
Figure 8
Monthly PM concentrations in the Beijing, Seoul, and Tokyo metropolitan areas. GOCI PM inferred from the RF algorithm for totally withheld sites in the cross-validation are compared to network observations. Beijing is defined by the namesake's province boundary, Seoul by the Seoul and Incheon boundaries, and Tokyo is defined as the Ibaraki, Saitama, Chiba, Tokyo, Kanagawa, and Yamanashi prefectures.
[Figure omitted. See PDF]
3.3 Urban-scale pollution eventsWe examine here the ability of GOCI PM to capture the spatial and temporal variability of PM pollution events on urban scales. Figure 9 shows a map of GOCI PM – produced by a RF trained on all the data, with the surface network PM overlaid – across the Seoul metropolitan area on 24–29 May 2016, corresponding to a severe pollution event sampled during the Korea–United States Air Quality (KORUS-AQ) field campaign (Crawford et al., 2021). The dense PM network for Seoul shows large variability at the sub-6 6 km scale that GOCI does not resolve. However, GOCI PM captures most of the variability in the network data aggregated on the 6 6 km grid (). It also successfully captures the day-to-day variability during the event.
Figure 9
The 24 h PM concentrations during a pollution event in Seoul–Incheon (24–29 May 2016). GOCI PM inferred from the RF algorithm (background; 6 6 km grid scale) trained on all available data is compared to observations from the AirKorea surface network (circles).
[Figure omitted. See PDF]
Figure 10
Same as Fig. 9 but for a pollution event in Beijing on 16–21 December 2016.
[Figure omitted. See PDF]
Figure 10 shows an additional test of the RF algorithm with one of the most severe pollution events in the record, the 16–21 December 2016 Beijing winter haze episode. The 24 h PM concentrations exceeded 400 g m at some of the network sites. While there is a tight correspondence between the GOCI and surface network 24 h PM for Beijing grid cells ( range of 0.74–0.99), the network observations are on average 20 g m higher than the GOCI PM. The difference is most pronounced at the December 21 concentration peak which has mean observed value 396 g m to the predicted 348 g m. This reflects the RF smoothing and AOD underestimate for the high tail of the distribution, as previously illustrated in Fig. 4. It nevertheless illustrates the ability of GOCI, combined with our gap-filling method, to capture severe winter haze episodes that are particularly challenging to observe from space.
3.4 Regional air quality model evaluationRegional air quality model predictions of PM are typically evaluated with observations from surface network sites, but the spatially continuous GOCI PM fields offer more extensive coverage and, hence, a broader opportunity for model evaluation. We demonstrate this capability here with the Community Multiscale Air Quality Modeling System (CMAQ, version 4.7.1) simulations for the Korean peninsula including both South and North Korea at 9 km resolution (Bae et al., 2018, 2021). There are no surface PM data in North Korea to train the RF, so we use the South Korea categorical variable to generate the GOCI PM fields there.
The simulation for South Korea was conducted for 2015–2019 using emissions from the Clean Air Policy Support System (CAPSS) 2016 (Choi et al., 2020) for South Korea and KORUSv5 (Woo et al., 2022) for outside South Korea. The simulation for North Korea was conducted for 2016 using emissions from the Comprehensive Regional Emissions inventory for Atmospheric Transport Experiment (CREATE) 2015 (Woo et al., 2020) and CAPSS 2013. Natural aerosols, including sea salt and mineral dust, are included. To prepare the boundary conditions, a coarse domain at 27 km horizontal grid resolution covering northeastern Asia was used.
Figure 11
Mean PM concentrations in South Korea in 2015–2019 as simulated by CMAQ, measured at the AirKorea sites, and inferred from GOCI. Panels (a), (b), and (c) show scatterplots comparing the CMAQ and GOCI PM fields to the AirKorea measurements (398 sites) and CMAQ to GOCI PM on the 9 9 km CMAQ grid (1353 grid cells to compare). Panels (d), (e), and (f) show maps of the mean 2015–2019 concentrations.
[Figure omitted. See PDF]
Figure 11 illustrates the increased capability for model evaluation in South Korea enabled by the GOCI PM fields. The bottom row shows the mean 2015–2019 PM concentrations in CMAQ compared to the AirKorea network and to GOCI PM, and the top row shows comparison scatterplots. The top left panel compares the CMAQ simulation to 2015–2019 mean PM observations from the 398 AirKorea network sites. The top middle panel compares the GOCI PM to the same AirKorea network data, showing excellent agreement. The GOCI PM fields provide 1353 points for South Korea on the 9 9 km CMAQ grid, and the top right panel shows the resulting increase in capability for evaluation of the CMAQ simulation. It shows, in particular, that CMAQ underestimates PM in coastal environments, possibly because of unaccounted ship emissions.
Figure 12
Mean PM concentrations in North Korea in 2016, as simulated by CMAQ and as represented by the GOCI PM product, assuming that South Korea is a categorical variable. Panel (b) shows surface PM concentrations from the AirKorea and China Ministry of Ecology and Environment (MEE) networks.
[Figure omitted. See PDF]
Figure 12 evaluates the CMAQ simulation with the GOCI PM fields over North Korea. Unlike in South Korea, there are no observation sites in North Korea and GOCI PM offers the only opportunity for local evaluation. CMAQ and GOCI PM show dramatically different patterns. The highest PM in CMAQ is in the Pyongyang capital region, while GOCI shows highest values in the north–central region. The lack of reliable emission inventories for North Korea makes it difficult to arbitrate this difference. The RF is not trained for North Korea, which might lead to positive biases because the AOD PM ratio modeled in the Zhai et al. (2021) GEOS-Chem simulation is higher over North Korea outside the mountainous east (range of 0.010–0.013 m g) than over South Korea (0.008–0.010 m g). However, the difference could also be explained by missing emissions in the inventory. Further evaluation could be done with border sites in South Korea and northeastern China. China Ministry of Ecology and Environment (MEE) sites along the border are consistent with high PM in north–central North Korea.
4 ConclusionsWe used 2011–2019 geostationary aerosol optical depth (AOD) observations from the GOCI satellite instrument, in combination with a random forest (RF) machine learning algorithm trained on air quality network data, to produce a continuous 24 h PM data set for eastern China, South Korea, and Japan at 6 6 km resolution. The resulting gap-free GOCI PM product complements the air quality networks that cover only 1 % of 6 6 km grid cells in eastern China, 7 % in South Korea, and 8 % in Japan. It provides a general dataset for PM mapping to serve regional pollution analysis, air quality monitoring, and public health applications.
We trained the RF algorithm on gap-filled AODs from the GOCI instrument and a suite of 12 meteorological, geographical, and temporal predictor variables. Gap-filling of AODs was done by a weighted combination of nearest-neighbor data and chemical transport model fields, with the weight serving as an additional predictor variable. The RF algorithm is successfully able to exploit information encoded in AOD retrieval failure to produce a continuous product. Testing of the RF algorithm by the prediction of withheld network sites shows single-value precisions in each country of 26 %–32 % for 24 h PM and 12 % for annual mean PM, with negligible mean bias. Accuracy statistics for PM inferred on grid cells with no AOD retrieval (i.e., estimated using Eq. 1) show similar accuracy statistics to that of the entire population, indicating that the gap-filling procedure does not bias the results. The algorithm has only moderate success at predicting NAAQS exceedance events because most of these events are within the single-value precision and also because of some smoothing of the extreme high tail of the PM frequency distribution.
We compared the continuous 24 h GOCI PM fields to spatial and temporal patterns observed at the national network sites. National trends of PM inferred from GOCI and weighted by area or population are consistent with those observed at network sites (2015–2019 in eastern China and South Korea; 2011–2019 in Japan), confirming that the trends observed at these sites are representative. However, the network sites in eastern China and South Korea underestimate population exposure. The GOCI PM fields over South Korea show PM hot spots missing in the early AirKorea network (2015) that are confirmed by subsequent addition of sites to the network (2019). The spatial distribution of GOCI PM trends in South Korea shows decreases everywhere, except in the Seoul metropolitan area where trends are flat. We show with the time series in the capital cities (Beijing, Seoul, and Tokyo) that the RF successfully captures the seasonality of PM even though AOD and PM have different and often opposite seasonalities.
We examined the ability of the RF algorithm to map air quality on urban scales by an analysis of two multi-day pollution episodes in Seoul and Beijing. The algorithm captures the day-to-day temporal variability observed by the surface networks and the spatial variability on the 6 6 km scale. The highest PM concentrations are underpredicted, which reflects the smoothing of the high tail of the distribution.
The continuous spatial coverage of PM provided by the GOCI fields enables improved evaluation of the air quality models used in support of emission control policies. A comparison to a CMAQ simulation for South Korea in 2015–2019 reveals a large model underestimate in coastal environments undersampled by the AirKorea network. A comparison to a CMAQ simulation for North Korea in 2016, where the RF provides the only PM data for model evaluation, shows drastically different patterns, with the RF featuring high PM throughout North Korea. The RF results in North Korea could be affected by errors due to lack of training data but they appear consistent with the PM network observations at Chinese border sites.
More work could be done to improve our GOCI PM product. We find, in our current RF algorithm, consistent with Hu et al. (2017), that the addition of certain predictor variables such as population decreases performance. This motivated our practice of excluding spatially varying but temporally constant fields such as elevation and emissions. However, these variables have been found to be useful in other inferences of PM from AOD data (Kloog et al., 2012; Di et al., 2019), so further investigation is needed on how to accommodate them in our modeling framework. A higher-resolution meteorological reanalysis, such as ERA5-Land (Muñoz-Sabater et al., 2021), could be used for the meteorological predictor variables and enable the inclusion of additional variables such as precipitation. Additional remote sensing products such as Normalized Difference Vegetation Index (NDVI) could also be useful. More work needs to be done to address our underestimate of the high tail of the PM distribution, i.e., extreme pollution events. Such an underestimate is common in RF applications (Zhang and Lu, 2012) but could be addressed by leveraging specialized statistical tools like extreme value theory. Additional training methods could be used to improve the ability of the RF to predict NAAQS exceedances, such as data sampling adjustments. Moreover, it is possible that skill in modeling NAAQS exceedance could be improved by leveraging data that better capture diurnal variations of PM, as high concentrations tend to occur at night. The unique geostationary capability of GOCI to generate hourly AOD data could be used to produce an hourly PM product. A new GOCI AOD product with 2 2 km resolution is expected to become available in the near future and will provide the motivation to explore these improvements in a new version of our RF algorithm.
Data availability
The 24 h 6 6 km resolution daily GOCI
PM data are made freely available on Dataverse at
The supplement related to this article is available online at:
Author contributions
DCP and DJJ designed the study. DCP developed the RF and performed analysis. SZ, MB, and SK ran and analyzed the chemical transport model data. SL aided in satellite data processing. JK, HL, and JHK provided the scientific interpretation and discussion. All authors provided input on the paper for revision.
Competing interests
The contact author has declared that neither they nor their co-authors have any competing interests.
Disclaimer
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Acknowledgements
This work was funded by the Samsung PM Strategic Research Program and the Harvard-NUIST Joint Laboratory for Air Quality and Climate (JLAQC). GOCI data were provided by the Korea Institute of Ocean Science and Technology (KIOST). Drew C. Pendergrass was funded by U.S. National Science Foundation Graduate Fellowships Program. We thank the two anonymous reviewers for their thoughtful feedback.
Financial support
This research has been supported by the Samsung Advanced Institute of Technology (Samsung PM Strategic Research Program), the Harvard-NUIST Joint Laboratory for Air Quality and Climate (JLAQC), and the U.S. National Science Foundation Graduate Fellowships Program.
Review statement
This paper was edited by Marloes Penning de Vries and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
We use 2011–2019 aerosol optical depth (AOD) observations from the Geostationary Ocean Color Imager (GOCI) instrument over East Asia to infer 24 h daily surface fine particulate matter (PM
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
2 Department of Atmospheric Sciences, Yonsei University, Seoul, South Korea; Particulate Matter Research Institute, Samsung Advanced Institute of Technology (SAIT), Suwon, South Korea
3 Department of Atmospheric Sciences, Yonsei University, Seoul, South Korea
4 Department of Environmental and Safety Engineering, Ajou University, Suwon, South Korea
5 Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control, Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, School of Environmental Science and Engineering, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China