1 Introduction
Since the Chinese government implemented the Air Pollution Prevention and Control Action Plan in 2013, there has been a notable reduction in emissions (Zheng et al., 2018). However, despite these advancements, the issue of pollution persists and, in certain cases, has shown signs of worsening (Ren et al., 2022). The increase in concentration can be attributed not only to adverse meteorological conditions but also predominantly to unbalanced joint control of non-methane volatile organic compounds (NMVOCs) and nitrogen oxides (; Li et al., 2020). NMVOCs are vital precursors of and have a substantial impact on atmospheric oxidation capacity, thereby altering the lifetimes of other pollutants. Accurately quantifying NMVOC emissions holds significant importance in investigating their impact on chemistry and in formulating emission reduction policies.
Anthropogenic NMVOC emissions have traditionally been estimated using a bottom-up method. However, the accuracy and timeliness of these estimations face challenges owing to the scarcity of local measurements for emission factors, the incompleteness and unreliability of activity data, and the diverse range of species and technologies involved (Cao et al., 2018; Hong et al., 2017). Furthermore, uncertainties arise in model-ready NMVOC emissions due to spatial and temporal allocations using various “proxy” data for different source sectors (Li et al., 2017a). Li et al. (2021) reported substantial discrepancies among emission estimates in various studies, ranging 23 % to 56 %. Biogenic NMVOC emissions are typically estimated using models like the Model of Emissions of Gases and Aerosols from Nature (MEGAN; Guenther et al., 2012) and the Biogenic Emission Inventory System (BEIS; Pierce et al., 1998). NMVOC emissions result from the multiplication of plant-specific standard emission rates by dimensionless activity factors. Nonetheless, apart from inaccuracies in the distribution of plant functional types, empirical parameterization, especially concerning responses to temperature and drought stress, can introduce substantial uncertainties (Angot et al., 2020; Seco et al., 2022; Jiang et al., 2018). Warneke et al. (2010) determined isoprene emission rates through field measurements and conducted a comparison to MEGAN and BEIS estimates, revealing a notable tendency for MEGAN to overestimate emissions while BEIS consistently underestimated them. Similarly, Marais et al. (2014) found that MEGAN's isoprene emission estimates were 5–10 times higher than the canopy-scale flux measurements obtained from African field campaigns.
A top-down approach utilizing observed data has been developed for estimating VOC emissions. For instance, techniques based on aircraft- and ground-based field measurements such as the source–receptor relationships algorithm with Lagrangian particle dispersion model (Fang et al., 2016), mixed layer gradient techniques (Mo et al., 2020), eddy covariance flux measurements (Yuan et al., 2015), and the box model (Wang et al., 2020) have been employed to complement or verify bottom-up results. However, these approaches do not comprehensively consider the complex nonlinear chemical reactions and transport processes that VOCs undergo in the atmosphere. Formaldehyde (HCHO) and glyoxal (CHOCHO) in the atmosphere serve as crucial oxidization intermediates for various VOCs (Hong et al., 2021; Liu et al., 2012). Satellite-based observations can readily detect their presence in the form of vertical column density (VCD) from space, making them widely utilized for estimating NMVOC emissions. A commonly used approach assumes that the observed columns are locally linearly correlated with VOC emission rates (Palmer et al., 2006; Liu et al., 2012). However, this approach does not consider the spatial offset resulting from chemistry reactions and transport processes. Chaliyakunnel et al. (2019) conducted a Bayesian analysis to derive an optimal estimate of VOC emissions using HCHO measurements over the Indian subcontinent. Their results indicated that biogenic VOC emissions modeled by MEGAN v2.1 were overestimated by approximately 30 %–60 %, whereas anthropogenic VOC emissions derived from the REanalysis of the TROpospheric chemical composition (RETRO) inventory were underestimated by 13 %–16 %. Cao et al. (2018) employed the GEOS-Chem model and its adjoint, incorporating tropospheric HCHO and CHOCHO column data from the GOME-2A and Ozone Monitoring Instrument (OMI) satellites as constraints, to quantify Chinese NMVOC emissions. They demonstrated a low bias in the MEGAN model, in contrast to the significant overestimation shown in Bauwens et al. (2016), especially in southern China.
Several investigations have been conducted to explore the implications of inverted VOC emissions on surface . For instance, using the Eulerian box model, Zhou et al. (2023) employed concurrent VOC measurements to constrain anthropogenic VOC emissions. This led to improved simulations of VOCs and , with a reduction in high emissions by 15 %–36 % in the Pearl River Delta (PRD) region. Local model biases in simulating the oxidation of NMVOCs and are closely related to uncertainties in emissions (Wolfe et al., 2016; Chan Miller et al., 2017). To tackle these critical questions, Kaiser et al. (2018) applied an adjoint algorithm to estimate isoprene emission over the southeast US by downwardly adjusting anthropogenic emissions by 50 % to rectify simulations. Their findings indicated that isoprene emissions from MEGAN v2.1 were overestimated by an average of 40 %, slightly lower than the 50 % reduction in Bauwens et al. (2016). Souri et al. (2020) simultaneously optimized NMVOC and emissions utilizing Ozone Mapping and Profiler Suite (OMPS-NM) HCHO and OMI retrievals in east Asia. They found that predominantly anthropogenic NMVOC emissions from the mosaic Asian anthropogenic emission inventory (MIX-Asia) 2010 increased over the North China Plain (NCP), whereas predominantly biogenic NMVOC emissions from MEGAN v2.1 decreased over southern China after the adjustment. Unfortunately, the posterior simulations exacerbated the overestimation of levels in northern China.
Most studies regarding the inversion of NMVOC emissions or its impact on neglected the uncertainties associated with -dependent production or loss of NMVOC oxidation and . An iteratively nonlinear joint inversion of and NMVOCs using multi-species observations is expected to minimize the uncertainties in their emissions and is well-suited to address the intricate relationship among VOCs––. In this study, we extended the Regional multi-Air Pollutant Assimilation System (RAPAS) with the ensemble Kalman filter (EnKF) assimilation algorithm to enhance the optimization of NMVOC emissions over China, utilizing the TROPOspheric Monitoring Instrument (TROPOMI) HCHO retrievals with high spatial coverage and resolution. To more accurately quantify the impact of NMVOC emissions on , emissions were simultaneously adjusted using nationwide in situ observations. Process analysis was subsequently employed to quantify various chemical pathways associated with formation and loss. Through a top-down constraint on both types of emission, this study aims to offer a more scientific insight into the consequences of optimizing NMVOC emissions on and to contribute to the development of appropriate emission reduction policies.
2 Data and methods
2.1 Data assimilation system
The RAPAS system (Feng et al., 2023) has been developed based on a regional chemical transport model (CTM) and on ensemble square root filter (EnSRF) assimilation modules (Whitaker and Hamill, 2002), which are employed to simulate atmospheric compositions and infer anthropogenic emissions by assimilating surface observations, respectively (Feng et al., 2022, 2020). The inversion process follows a two-step procedure within each inversion window, in which the emissions are inferred first and then input into the Community Multiscale Air Quality Modeling System (CMAQ) to simulate initial conditions of the next window. Meanwhile, the optimized emissions are transferred to the next window as prior emissions. The two-step inversion strategy facilitates error propagation and iterative emission optimization, which have proven the superiority and robustness of our system in estimating emissions (Feng et al., 2023). In this study, we extended the data frame to include the assimilation of TROPOMI HCHO retrievals to optimize NMVOC emissions. Concise descriptions of the forecast model, data assimilation approach, and experimental settings follow.
2.1.1 Atmospheric transport model
The Weather Research and Forecasting (WRF v4.0) model (Skamarock and Klemp, 2008) and the CMAQ (v5.0.2; Byun and Schere, 2006) were applied to simulate meteorological conditions and atmospheric chemistry, respectively. WRF simulations were conducted with a 27 km horizontal resolution, covering the entire mainland of China on a grid of cells (Fig. 1). The CMAQ was run over the same domain but with the removal of three grid cells on each side of the WRF domain. The vertical settings in WRF and CMAQ were the same as in Feng et al. (2020). To account for the rapid expansion of urbanization, we updated underlying surface information for urban and built-up land using the MODIS Land Cover Type product (MCD12C1) Version 6.1 from 2022. Chemical lateral boundary conditions for NO, , HCHO, and were extracted from the output of the global CTM (i.e., the Whole Atmosphere Community Climate Model, WACCM) with a resolution of at 6 h intervals (Marsh et al., 2013). Meanwhile, boundary conditions for the other NMVOCs were obtained directly from background profiles. In the first data assimilation (DA) window, initial chemical conditions (excluding NMVOCs) were also derived from the WACCM outputs, whereas in subsequent windows, they were derived through forward simulation using optimized emissions from the previous window. Table S1 in the Supplement lists the detailed physical and chemical configurations. To assess the impact of updated NMVOC emissions on production efficiency, we further decoupled the contribution of the primary chemical processes from the levels using the CMAQ integrated reaction rate (IRR) analysis.
Figure 1
Model domain and observation network (a) and number of TROPOMI HCHO data retrievals during August 2022 in each grid (b). The dashed red frame delineates the CMAQ computational domain; black squares denote surface meteorological measurement sites; navy triangles indicate sounding sites (Text S1 in the Supplement); and red and blue dots represent air pollution measurement sites, where red dots are used for assimilation and blue dots for independent evaluation.
[Figure omitted. See PDF]
2.1.2 EnKF assimilation algorithmThe emissions are constrained using the ensemble square root filter (EnSRF) algorithm introduced by Whitaker and Hamill (2002). This approach fully accounts for temporal and geographical variations in both the transportation and the chemical reactions within the emission estimates. During the forecast step, the background ensembles are derived by applying perturbation to the prior emissions. The perturbed samples are typically drawn from Gaussian distributions with a mean of zero and a standard deviation equal to the prior emission uncertainty in each grid cell. Ensemble runs of the CMAQ were subsequently performed to propagate the background errors with each ensemble sample of state vectors.
In the analysis step, the ensemble mean of the analyzed state is regarded as the best estimate of emissions, which is obtained by updating the background ensemble mean through the following equations: where is the observational vector; represents the observation operator mapping model space to observation space; the expression quantifies the disparities between simulated and observed concentrations; illustrates how uncertainties in emissions relate to uncertainties in simulated concentrations; and the Kalman gain matrix , dependent on background error covariance and observation error covariance , determines the relative contributions to the updated analysis.
State variables for emissions include and NMVOCs. To reduce the degree of freedom in the analysis and avoid the difficulty associated with estimating spatiotemporal variations in background errors for individual species, we focus on optimizing the lumped total NMVOC emissions. During the forecast step, we differentiate individual NMVOC species emissions from the total NMVOC emissions using bottom-up statistical information. For a consistent comparison between simulations and observations, model-simulated was diagnosed at the time and location of surface measurements, whereas model-simulated HCHO was horizontally sampled to align with TROPOMI HCHO VCD retrievals, and subsequently integrated vertically.
In this study, the DA window was set to 1 d and daily TROPOMI HCHO columns were utilized as observational constraints in our inversion framework. The ensemble size was set to 50 to strike a balance between computational cost and inversion accuracy. To reduce the impact of unrealistic long-distance error correlations, the Gaspari–Cohn function (Gaspari and Cohn, 1999) was utilized as covariance localization to ensure the meaningful influence of observations on state variables within a specified cutoff radius while mitigating their negative impacts on distant state variables. The optimal localization scale is interconnected with factors such as the assimilation window, the dynamic system, and the lifetime of chemical species. Given an average wind speed of 2.8 (Table S2 in the Supplement) and a DA window of 1 d, the localization scales for and HCHO, both characterized as highly reactive species with lifespans of just a few hours, were set to 150 and 100 km, respectively.
2.2 Observation data and errors
Considering the availability of HCHO data, we utilized daily offline retrievals of tropospheric HCHO columns from Sentinel-5P (S5P) L3 TROPOMI data obtained through the Google Earth Engine (De Smedt et al., 2018). The S5P satellite follows a near-polar sun-synchronous orbit at an altitude of 824 km with a 17 d repeating cycle. It crosses the Equator at 13:30 local solar time (LST) on the ascending node. The spatial resolution at nadir was refined to on 6 August 2019. Following the recommendations in the S5P HCHO product user manual, we filtered the source data to exclude pixels with a qa_value less than 0.5 for HCHO column number density and 0.8 for aerosol index (AER_AI). The remaining high-quality pixels with minimal snow/ice or cloud interference are averaged to 27 km grids. Figure 1b illustrates the coverage and number of TROPOMI HCHO data retrievals in August 2022 after processing. Although the distribution of filtered data exhibits spatial nonuniformity, most grid cells have observational coverage for over half of the time, particularly in the southern region of China where NMVOC emissions are higher. Based on validation against a global network of 25 ground-based Fourier transform infrared spectroscopy (FTIR) column measurements (Vigouroux et al., 2020), TROPOMI overestimates HCHO emissions by 25 % ( ) in clean regions and underestimates by 30 % ( ) in polluted regions. Therefore, we set the measurement error to 30 %. To evaluate the effect of observational data retrieval errors in emission estimates, we conducted a sensitivity experiment in which HCHO columns were empirically bias-corrected according to the error characteristics described above (Fig. S1 in the Supplement). The posterior emissions increased by 12.8 % compared to those in the base experiment (EMDA), indicating that the existing retrieval error in HCHO measurements likely exerts an influence on the estimation of NMVOC emissions. The representation error can be disregarded because the model resolution significantly surpasses that of the TROPOMI pixels.
To address the chemical feedback among VOCs––, we also simultaneously optimized emissions by assimilating in situ observations. The extensively covered high-precision monitoring network can provide sufficient constraints for emission inversion (Fig. 1a). Hourly averaged surface observations were obtained from national air quality control stations from the Ministry of Ecology and Environment of the People's Republic of China (
2.3 Prior emissions and uncertainties
The prior anthropogenic and NMVOC emissions for China were obtained from the most recent Multi-resolution Emission Inventory for China from 2020 (MEIC,
As previously mentioned, the optimized emissions are transferred to the next DA window as prior emissions for iterative inversion. For biogenic emissions, they are decomposed into hourly scales, based on the daily varying temporal profiles in MEGAN as model inputs. Daily emission variations will largely dominate the uncertainty in emissions. Taking into account compensation for model errors and avoiding filter divergence, we consistently applied an uncertainty of 25 % to each model grid of emissions at each DA window, as in Feng et al. (2020). NMVOC emissions typically exhibit greater uncertainties compared to emissions (Li et al., 2017b). Based on model evaluation, the uncertainty in NMVOC emissions was set to 40 % (Kaiser et al., 2018; Souri et al., 2020; Cao et al., 2018). A sensitivity experiment involving a doubling of the prior uncertainty (80 %) revealed that the differences in posterior NMVOC emissions amounted to a mere 0.2 % (Fig. S2 in the Supplement). The implementation of a two-step inversion strategy allows for the timely correction of residual errors from the previous assimilation window in the current window, thus ensuring that the RAPAS system has relatively low dependence on prior uncertainty settings. This study also addresses uncertainties in emissions for CO, , primary PM, and coarse PM to consider the chemical feedback between different species following Feng et al. (2023).
3 Experimental design
During the summer of 2022, southern China experienced severe heat wave conditions. The combination of high temperatures and drought had a pronounced effect on vegetation growth and NMVOC emissions, thereby influencing production (Wang et al., 2023). Consequently, we opted to focus on August 2022, as it presented an ideal period for testing the capabilities of our DA system. Before implementing the emission inversion, a relatively perfect initial field is generated at 00:00 UTC on 1 August 2022 by conducting a 5 d simulation with 6 h interval 3D-Var data assimilation. Subsequently, daily emissions are continuously updated over the entire month of August (EMDA). Additionally, we designed a sensitivity experiment (EMS) to illustrate the significance of optimizing emissions in quantifying VOC– chemical reactions. In this experiment, emissions were not optimized. To validate the posterior emissions of and NMVOCs in EMDA, we compared two parallel forward simulation experiments, denoted CEP and VEP, corresponding to prior and posterior emission scenarios, respectively, against and HCHO measurements. To investigate the impact of optimizing NMVOC emissions on the secondary production and loss of surface , a forward simulation experiment (CEP1) was conducted with the prior NMVOC emissions and the posterior emissions. Another forward modeling experiment (CEP2) used the posterior emissions of EMS to evaluate its performance. All experiments employ identical meteorological fields, as well as the same gas-phase and aerosol modules. Table 1 summarizes the different emission inversion and validation experiments conducted in this study.
Table 1
The assimilation, sensitivity, and validation experiments conducted in this study.
Exp. type | Exp. name | NMVOC emissions | emissions |
---|---|---|---|
Assimilation | EMDA | MEIC 2020 and MEGAN for August (the first DA window); optimized emissions of the previous window (other DA windows) | MEIC 2020 and MEGAN for August (the first DA window); optimized emissions of the previous window (other DA windows) |
Sensitivity | EMS | Same as EMDA | MEIC 2020 and MEGAN for August |
Validation | CEP | MEIC 2020 and MEGAN for August | MEIC 2020 and MEGAN for August |
VEP | Posterior emissions of EMDA | Posterior emissions of EMDA | |
CEP1 | Same as CEP | Posterior emissions of EMDA | |
CEP2 | Posterior emissions of EMS | Same as CEP |
4.1 Inverted emissions
Figure 2 shows the spatial distribution of temporally averaged prior and posterior NMVOC emissions along with the differences in NMVOC emissions. Hotspots of prior NMVOC emissions were prevalent across much of central and southern China. However, posterior NMVOC emissions were predominantly concentrated in the NCP, Yangtze River Delta (YRD), PRD, and Sichuan Basin (SCB), areas characterized by high levels of anthropogenic activity. High emissions are also located in parts of central and southern China with a warm climate favorable for emitting biogenic NMVOCs. Employing TROPOMI HCHO observations as constraints led to widespread decreases of approximately 60 %–70 % over these areas, indicating substantial biogenic NMVOC emissions. In northwestern China, there was a moderate increase in NMVOC emissions.
Figure 2
Spatial distribution of the time-averaged (a) prior emissions (MEIC 2020 + MEGAN), (b) posterior emissions, (c) absolute difference (posterior minus prior), and (d) relative difference in NMVOCs over China.
[Figure omitted. See PDF]
Potential significant TROPOMI retrieval errors in polluted regions could exacerbate the emission decreases (Text S2 in the Supplement). Additionally, uncertainties in MEGAN parameterization have significant implications for NMVOC emission estimations, particularly concerning the responses of vegetation in MEGAN to temperature and drought stress (Angot et al., 2020; Jiang et al., 2018). Zhang et al. (2021) highlighted that the temperature-dependent activity factor noticeably increases with rising temperatures in MEGAN. P. Wang et al. (2021) pointed out that the lack of a drought scheme is one of the factors causing the overestimation of isoprene emissions in MEGAN. Opacka et al. (2022) optimized the empirical parameter in the MEGAN v2.1 soil moisture stress algorithm, resulting in significant reductions in isoprene emissions and providing better agreement between modeled and observed HCHO temporal variability in the central US. During the study period, China experienced severe heat wave conditions, which may have further hindered MEGAN's ability to effectively capture the impacts of high temperatures and drought on vegetation, thus resulting in significant overestimation in NMVOC emissions (Wang et al., 2022). Ultimately, the biogenic NMVOC emissions decreased by 53.7 %, which was higher than the 43.4 % decrease in anthropogenic NMVOC emissions (Fig. S3 in the Supplement). Overall, the large magnitude of emission decrease of 50.2 % in our inversion is comparable to studies in southern China (Bauwens et al., 2016; Zhou et al., 2023), the southeastern US (Kaiser et al., 2018), Africa (Marais et al., 2014), India (Chaliyakunnel et al., 2019), Amazonia (Bauwens et al., 2016), and parts of Europe (Curci et al., 2010) but opposite to the large-scale emission increase over China in Cao et al. (2018). For (Fig. S4 in the Supplement), the nationwide total emissions decreased by 10.2 %, with the main reductions concentrated in the NCP and YRD, in parts of central China, and in most key urban areas.
Table 2 shows the changes in emissions of biogenic NMVOCs across different land cover types (Fig. S5 in the Supplement) after inversion. The most significant reduction in biogenic emissions occurred within woody savannas, accounting for 26.9 % of the overall reduction, followed by savannas and croplands, accounting for 21.2 % and 17.2 %, respectively. Among all vegetation types, the broadleaf evergreen forests, recognized as the primary source of isoprene emission (H. Wang et al., 2021), presented the greatest uncertainty, with NMVOC emissions experiencing a significant reduction of 66.2 %. Standard emission rates in MEGAN are derived from leaf- or canopy-scale flux measurements and extrapolated globally across regions sharing similar land cover characteristics based on very limited observations (Guenther et al., 1995). This methodology introduces biases due to the large variability in emission rates among plant species.
Table 2Prior and posterior biogenic NMVOC emissions, as well as the differences in different land cover types.
Land cover type | Prior | Posterior | Difference |
---|---|---|---|
(%) | |||
Evergreen needleleaf forests | 955.7 | 549.3 | () |
Evergreen broadleaf forests | 13 985.1 | 4728.2 | () |
Deciduous needleleaf forests | 46.6 | 48.8 | 2.2 (4.7) |
Deciduous broadleaf forests | 8335.5 | 3487.4 | () |
Mixed forests | 8731.0 | 3961.7 | () |
Closed shrublands | 9.7 | 3.7 | () |
Open shrublands | 21.3 | 8.6 | () |
Woody savannas | 39 327.2 | 16 925.2 | () |
Savannas | 28 319.7 | 10 629.4 | () |
Grasslands | 16 912.7 | 14 269.6 | () |
Permanent wetlands | 286.1 | 115.4 | () |
Croplands | 25 537.8 | 11 215.5 | () |
Cropland–natural vegetation mosaics | 10 894.7 | 4289.8 | () |
Sparsely vegetated | 1814.7 | 1644.0 | () |
The emissions were first evaluated by indirectly comparing the forward simulated concentrations with measurements. As shown in Fig. S6 in the Supplement, the CEP with prior emissions exhibited positive biases in eastern China and negative biases in western China. However, when posterior emissions were used in the VEP, a substantial improvement in simulation performance was observed. Biases were limited to within , and correlation coefficients exceeded 0.7 across the entire region. Figure 3 presents the simulated HCHO VCDs using prior and posterior NMVOC emissions, along with their associated biases. Both experiments showed high VCDs over central and eastern China, especially in the YRD and SCB. However, the CEP displayed substantial overestimation across most of mainland China, with the largest bias reaching in central China. Conversely, the VEP demonstrated notable improvements in both the magnitude and spatial distribution of simulated HCHO columns after the inversion compared to TROPOMI retrievals. More than 84 % of the areas exhibited biases of less than , and no significant spatial variation was observed. Overall, the biases in simulated HCHO VCDs decreased by 75.7 % after the inversion. These results emphasize the efficiency of our system in reducing uncertainty in both and NMVOC emissions.
Figure 3
Simulated HCHO vertical column densities using prior (a) and posterior (b) NMVOC emissions, along with their biases (c and d) versus TROPOMI measurements. All model results were sampled at TROPOMI overpass time.
[Figure omitted. See PDF]
4.3Implications for surface
Figure 4 shows the spatial distribution of the mean bias (BIAS), root mean square error (RMSE), and correlation coefficient (CORR) for simulated concentrations in the CEP1 and VEP experiments compared to assimilated observations. Beyond the northwestern region of China, the CEP1 exhibited significant overestimation throughout the entire area, with a BIAS of 20.5 . In the VEP, the modeled chemical production was alleviated, especially in the southern regions of China where NMVOC emissions significantly decreased. Overall, observation-constrained NMVOC emissions resulted in a 49.3 % decrease in the BIAS, bringing it down to 10.4 . Additionally, the RMSE showed noticeable improvement due to the assimilation of HCHO observation, reducing the value from 30.9 to 23.3 . Despite a significant reduction in NMVOC emissions after inversion, notable overestimations persisted in northern provinces such as Liaoning, Hebei, Shanxi, and Shaanxi. This may be attributed to limited NMVOC constraints resulting from insufficient observations during the study period (Figs. 1b and 3d). The remaining discrepancies between simulations and observations can be attributed to the combined results of intricate urban–rural sensitivity regimes and photochemistry reactions, which may not be comprehensively represented by the CMAQ, masking any potential improvement expected from the constrained emissions (See Sect. 4.4). The CORR was comparable between the CEP1 and VEP experiments, reflecting the fact that the CMAQ effectively simulated the temporal variation in concentrations. The biases at the independent sites were similar to those at the assimilated sites (Fig. S7 in the Supplement). In comparison to CEP1, the decreasing ratios in BIAS and RMSE in the VEP were 46.7 % and 23.4 %, respectively.
Figure 4
Spatial distribution of mean bias (BIAS; a and b), root mean square error (RMSE; c and d), and correlation coefficient (CORR; e and f) for simulated using prior (a, c, and e, CEP1) and posterior (b, d, and f, VEP) emissions versus assimilated observations.
[Figure omitted. See PDF]
Figure 5 shows the time series of simulated and observed hourly concentrations and their RMSEs, verified against surface monitoring sites. The VEP achieved better representations of diurnal variations compared with those in the CEP1, especially excelling in reproducing elevated concentrations at noon. Constraining the NMVOC emissions also led to better model simulations in terms of RMSE throughout the entire study period. The time-averaged BIAS and RMSE decreased from 20.6 and 37.3 to 10.6 and 31.0 , respectively. We also evaluated the simulation results for seven key cities (i.e., Beijing, Shanghai, Guangzhou, Wuhan, Chongqing, Yinchuan, and Changchun, which represent key cities in North, East, South, Central, Southwest, Northwest, and Northeast China, respectively), and the biases in the VEP with posterior emissions all showed a significant reduction (Fig. S8 in the Supplement). Overall, the assimilation of HCHO column observations effectively reduced NMVOC emission uncertainties and consequently improved simulations of HCHO and . These improvements hold promise for further research into the implications of emission optimizations on regional photochemistry.
Figure 5
Time series comparison of hourly surface concentrations () and RMSE () from the CEP1 and VEP experiments versus all observations at 1701 monitoring sites. The blue and red values on the graph represent the time-averaged statistics in the CEP1 and VEP experiments, respectively.
[Figure omitted. See PDF]
As crucial precursors, the abundance of NMVOCs plays a significant role in modulating production. Here we employed the IRRs to elucidate changes related to production and loss stemming from constrained and NMVOC emissions at the surface. Figure 6 illustrates comparisons of the simulated maximum daily 8 h average (MDA8) surface levels and net reaction rates before and after the inversion. The CEP1 exhibited an overestimation of levels, with a BIAS of 22.6 % compared to observed concentrations. This overestimation corresponded to the high net chemical rates of in these areas (Fig. S9 in the Supplement). After inversion, net rates decreased in most regions. Consequently, the VEP experiment yielded results that closely aligned with observations, with a BIAS of 9.2 %. Referring to Fig. 6e and f, differences in production rates of closely track the changes in the NMVOC emissions (Fig. 2). The discrepancies in specific regions may be attributed to the complex nonlinear relationships associated with and its precursors, which depend on prevailing chemical regimes and regional transport. Additionally, changes in production predominantly drive the overall decrease in concentrations, outweighing changes in loss.
Figure 6
Comparisons of (a, b) simulated maximum daily 8 h average (MDA8) concentrations, (c, d) net reaction rates, and (e, f) differences in production and loss rates between CEP1 and VEP experiments at the surface. Surface MDA8 values (circles) from the national air quality control stations were overlaid in (a) and (b).
[Figure omitted. See PDF]
Figure 7 shows the differences in the six principal pathways responsible for loss and formation when comparing simulations employing prior and posterior emissions. The reactions of and are treated as the pathways leading to formation, whereas loss involves reactions including , , , and (Wang et al., 2019). Our analysis was focused on the time frame from 12:00 to 18:00 according to China standard time (CST). The differences were computed by subtracting the simulation with posterior emissions from that with prior emissions. Following emission, NMVOCs undergo rapid oxidation by atmospheric hydroxyl (OH) radicals. Due to the substantial decrease in NMVOC emissions, there was a reduction in the production of hydroperoxy radicals () and organic peroxy radicals (; Fig. S10 in the Supplement). Consequently, this reduction in levels, coupled with their reaction with NO, resulted in diminished production (Fig. 7a and b). A strong correlation was observed between changes in production via the reaction and NMVOC emissions (Fig. 2), consistent with the findings of Souri et al. (2020). Typically, in NMVOC-rich environments, a decrease in NMVOC emissions boosts OH concentrations. Consequently, we noted an enhancement in the reaction in the eastern and central regions of China. In response to heightened concentrations over these areas, increased loss through the pathway was observed. Furthermore, we detected a substantial decrease in loss through reactions with NMVOCs, especially in southern China where substantial isoprene emissions are prevalent. This reduction was primarily attributable to the decrease in NMVOC and levels. While the NMVOC reaction proceeds at a substantially slower rate than that of NMVOC OH, this specific chemical pathway remains significant in oxidizing NMVOCs and forming in forested areas (Paulson and Orlando, 1996). The difference in is primarily driven by the decrease in photolysis. Although the rate of loss decreases in some chemical pathways, overall, the rate of production dominates the changes in concentration.
Figure 7
Differences in six major pathways of production and loss between the CEP1 and VEP experiments at the surface. Time period: August 2022, 12:00–18:00 CST. P and L represent the pathways of formation and loss, respectively.
[Figure omitted. See PDF]
4.4 Discussionssimulations over China have a tendency to be overestimated in studies involving chemical transport modeling. For example, by intercomparing 14 state-of-the-art CTMs with observations within the framework of the Model Intercomparison Study for Asia (MICS-Asia) III, Li et al. (2019) identified a substantial overestimation of annual surface in east Asia, ranging from 20 to 60 . Notably, the NCP exhibited substantial overestimations, with most models overestimating by 100 %–200 % in May–October. Despite our optimization of precursor emissions, the posterior simulations still exhibit some degree of overestimation (Fig. 4), suggesting that there may indeed be an effect of systematic bias, such as meteorological fields, spatial resolution, model treatments of nonlinear photochemistry, and other physical processes. The WRF can generally reproduce meteorological conditions sufficiently in terms of temporal variation and magnitude over China (Fig. S11 in the Supplement), with small biases of , %, 0.3 , and m for temperature at 2 m, relative humidity at 2 m, wind speed at 10 m, and planetary boundary layer height, respectively. However, due to the relatively coarse spatial resolution, NO titration effects in urban areas may not be well represented in the model, which can lead to an overestimation of in these areas. Additionally, model-inherent errors arising from model structure, parameterization, and the simplification or lack of chemical mechanisms inevitably affect the simulations. For example, Li et al. (2018) reported that heterogeneous reactions of nitrogen compounds could weaken the atmospheric oxidation capacity and thus reduce surface concentration by 20–40 for polluted regions over China. These reactions have not been fully incorporated in CMAQ chemical mechanisms. However, there is still a lack of reasonable and effective algorithms for addressing model errors through assimilation (Houtekamer and Zhang, 2016). concentration and (VOC) emissions are positively correlated in - (VOC-)limited regions and negatively correlated in VOC- (-)limited regions (Tang et al., 2011). Therefore, the uncertainty in emissions can affect the model diagnosis of ––VOC sensitivity, thereby introducing substantial model errors in the HCHO yield from VOC oxidation. In the base inversion experiment (EMDA), we simultaneously assimilated and HCHO observations to optimize and NMVOC emissions. To evaluate the impact of optimized emissions on –VOC chemistry, EMS disregarded the uncertainty in and focused on optimizing NMVOC emissions. Compared to the EMDA, in areas where is significantly overestimated, NMVOC emissions in the EMS have correspondingly decreased (Fig. 8b). This might be due to the fact that under high conditions, HCHO production occurs promptly, thereby compensating for the substantial amount of HCHO already present in the atmosphere by reducing emissions (Chan Miller et al., 2017). Figure S12 in the Supplement shows comparisons of concentrations and RMSE between the simulations using posterior emissions from EMS and EMDA experiments. Compared to VEP, CEP2 showed a larger RMSE, highlighting the necessity for simultaneous optimization of emissions when evaluating the impact of NMVOC emission optimization on . Additionally, CEP2 using prior emissions exhibited lower levels over parts of NCP and YRD as well as over some urban areas (Fig. 8c) but with larger biases and RMSEs (Fig. 8d). The reduction in NMVOC emissions contributed to a partial decrease in concentration. More significantly, these areas typically align with VOC-limited mechanisms (Wang et al., 2019; W. Wang et al., 2021). Therefore, the overestimation of emissions (Fig. S4) excessively inhibits accumulation due to the titration effect, thereby disrupting the evaluation of NMVOC contributions to . This substantial disparity also seriously affects source apportionment, precursor-sensitive area delineation, and emission reduction policy formulation.
Figure 8
Spatial distribution of (a) posterior emissions in the EMS experiment, (b) differences in posterior emissions between EMS and EMDA, and differences in (c) simulated concentrations and (d) RMSE between CEP2 and VEP experiments. EMS did not optimize emissions compared to EMDA.
[Figure omitted. See PDF]
5 Summary and conclusionsIn this study, we extended the RAPAS assimilation system with the EnKF assimilation algorithm to optimize NMVOC emissions using TROPOMI HCHO retrievals. Taking the MEIC 2020 for anthropogenic emissions and MEGAN v2.1 output for biogenic sources as a priori emissions, NMVOC emissions over China in August 2022 were inferred. Importantly, we implicitly took the chemical feedback among VOCs–– into account by simultaneously adjusting emissions using nationwide in situ observations. Furthermore, we quantified the impact of NMVOC emission inversion on surface pollution using the CMAQ-IRR.
The application of TROPOMI HCHO observations as constraints led to a substantial reduction of 50.2 % compared to the prior emissions for NMVOCs in August 2022. A domain-wide significant decrease was found over areas of central and southern China with abundant forests, especially with the broadleaf evergreen forests, implying a considerable overestimation of biogenic NMVOC emissions. Observation-constrained emissions significantly improved the performance of surface and HCHO column simulations, reducing biases by 97.4 % and 75.7 %, respectively. This highlights the effectiveness of RAPAS in reducing uncertainty in and NMVOC emissions. Isolating the impact of emission changes, the posterior NMVOC emissions significantly mitigated the overestimation in prior simulations, resulting in a 49.3 % decrease in surface biases. This is mainly attributed to a substantial decrease in the reaction rate (a major pathway for production) and an increase in the reaction rate (a major pathway for loss) during the afternoon, resulting in a decrease in the simulated MDA8 surface concentrations of approximately 15 .
Sensitivity inversions demonstrate the robustness of top-down emissions to variations in prior uncertainty settings, yet they are sensitive to HCHO column biases, highlighting the importance of comprehensive validation studies utilizing available remote-sensing data and, if possible, airborne validation campaigns. Moreover, we found that, in comparison to optimizing NMVOC emissions alone, the joint optimization of NMVOC and emissions can significantly improve the overall performance of simulations. Ignoring errors in emissions introduces uncertainty in quantifying the impact of NMVOC emissions on surface , especially in areas where overestimated emissions can unrealistically amplify titration effects, highlighting the necessity of simultaneous optimization of emissions.
Data availability
The observations used for assimilation and the optimized emissions from this study can be accessed at 10.5281/zenodo.10079006 (Feng and Jiang, 2023).
The supplement related to this article is available online at:
Author contributions
SF and FJ conceived and designed the research. SF developed the data assimilation code, analyzed data, and prepared the paper with contributions from all co-authors. FJ supervised and assisted in conceptualization and writing. TQ, NW, MJ, SZ, JC, FY, and WJ reviewed and commented on the paper.
Competing interests
The contact author has declared that none of the authors has any competing interests.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Regarding the maps used in this paper, please note that Figs. 1–4, 6–8, and the key figure contain disputed territories.
Acknowledgements
This work is supported by the National Key R&D Program of China (grant no. 2022YFB3904801), the National Natural Science Foundation of China (grant nos. 42305116 and 42377102), the Natural Science Foundation of Jiangsu Province of China (grant no. BK20230801), and the Hangzhou Agricultural and Social Development Scientific Research Project (grant no. 202203B29). The authors also gratefully acknowledge the use of the High-Performance Computing Center (HPCC) blade cluster system of Nanjing University for the performance of the numerical calculations in this paper.
Financial support
This research has been supported by the National Key Research and Development Program of China (grant no. 2022YFB3904801), the National Natural Science Foundation of China (grant no. 42305116 and 42377102), the Natural Science Foundation of Jiangsu Province (grant no. BK20230801), and the Hangzhou Science and Technology Bureau (grant no. 202203B29).
Review statement
This paper was edited by Zhibin Wang and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Non-methane volatile organic compounds (NMVOC), serving as crucial precursors of
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details




1 Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, International Institute for Earth System Science, Nanjing University, Nanjing, 210023, China
2 Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, International Institute for Earth System Science, Nanjing University, Nanjing, 210023, China; Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, 210023, China; Frontiers Science Center for Critical Earth Material Cycling, Nanjing University, Nanjing, 210023, China
3 School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210042, China
4 College of Carbon Neutrality Future Technology, Sichuan University, Chengdu, 610207, China
5 Hangzhou Municipal Ecology and Environment Bureau, Hangzhou, 310020, China
6 Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, International Institute for Earth System Science, Nanjing University, Nanjing, 210023, China; Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, 210023, China