1 Introduction
Air pollution is a major environmental risk for human health and ecosystems in Europe. Over the past decades the European Union (EU) has put in place several measures to reduce anthropogenic emissions of pollutants. In response to emissions reductions, concentrations of SO, NO and particles measured over Europe have shown a clear decrease since 1990 (EEA, 2018; EMEP, 2016).
The evolution of O trends is less clear despite the decrease in its precursors. The magnitude of high ozone episodes decreased, while annual average ozone levels measured at EMEP (European Monitoring and Evaluation Programme) stations were increasing in the 1990s and show a limited negative trend from 2002 onwards. As shown in the Tropospheric Ozone Assessment Report (Tarasick et al., 2019), this feature is generally attributed to the changing global tropospheric ozone baseline for which further hemispheric control strategies are needed. The same conclusions could be drawn from the Malherbe et al. (2017) study, which focused on France, with significant reductions in NO and particle concentrations and an increase in average O offset by a slight decrease in peak O. Despite these reductions in emissions and pollutant concentrations (with the exception of the annual average O), a proportion of French citizens are still exposed to concentrations above the EU limit and target value, and air quality in EU remains one of the main reasons for premature deaths (IHME, 2013).
As a complement to observations (which provide only partial spatial information), accurate, high-spatial-resolution and up-to-date air pollution maps are important information for assessing air pollution trends and exposure. They should provide geographically detailed information on the concentrations of air pollutants over the whole territory. These maps serve as a basis for informing citizens, designing and stratifying monitoring networks, supporting policy strategies, and measuring their impact. They are also used to estimate population exposure to air pollutants, which is essential for epidemiological studies.
On a European scale, different mapping approaches have been used to produce maps of pollutant concentrations. These maps can be obtained by modeling using a regional chemistry transport model (CTM) that simulates the concentration of pollutants over Europe. However, these models cannot always be used over the whole of Europe with a high resolution and have some biases and limitations in spatial representativeness. Regression methods (Briggs et al., 2000; Beelen et al., 2009) are also used at different scales. These stochastic modeling techniques develop statistical associations between potential “predictor variables” (land use, emission sources, topography) and measured pollutant concentrations in order to predict concentrations at an unsampled site. Other frequently used techniques are kriging techniques. These geostatistical techniques are based on the assumption that the data are spatially autocorrelated and therefore take into account the distances between measurements and the spatial structure of the variable. Different types of kriging are used to map the concentrations of air pollutants. Over France, kriging methods combining information from a regional CTM (CHIMERE; Mailler et al., 2017) and observations are produced daily by the Prev'air operational forecasting and mapping system for air quality (Rouïl et al., 2009). Since 2003 (for ozone) and 2005 (for PM), the concentration maps simulated for the day before in Prev'air have been corrected each morning using observations. The kriging technique used in Prev'Air has evolved over time, and PM and NO concentrations are now also corrected for the day before. Today, a kriging of hourly observations with CHIMERE as external drift is applied to map NO and O concentrations. Since 2017, for the mapping of PM and PM concentrations, the method used is an hourly co-kriging of PM and PM data with CHIMERE as external drift. These choices are the results of successive studies that compared different kriging techniques (Malherbe and Ung, 2009, Beauchamp, 2015). A similar methodology was implemented for an earlier reconstruction of outdoor air pollution in Europe for the period 1989–2008 in Bentayeb et al. (2014). There are also ambient air pollution maps produced at European scale at 1 km resolution by the European Environment Agency but only for selected annual indicators and without consistency for multi-year reconstructions (Horálek et al., 2012, 2020). The Copernicus Atmosphere Monitoring Service has also produced European analyses since 2015, but again there is no multi-year consistency as these European maps are produced on an annual basis with gradually improving methodologies (Marécal et al., 2015). At the global scale, the Global Burden of Disease also makes available air pollution exposure maps; a recent update of the methodology was presented in (Shaddick et al., 2017), but the resolution is 0.1 or about 10 km.
The purpose of this paper and the associated datasets is to present and provide mapped data of O, NO, PM and PM concentrations at high spatial and temporal resolution and associated regulatory indicators covering the French metropolitan territory for the period 2000–2015 (2007–2015 and 2009–2015 for hourly concentrations of PM and PM). The same kriging technique as in the Prev'air system is used to combine modeled and observed concentrations. Hourly concentrations of PM, PM, NO and O are produced and mapped over France, and these hourly data are then used to calculate and map European and French air quality standards.
2 Methods
Model outputs and measurements from the permanent monitoring network were combined by external drift kriging (Malherbe and Ung, 2009; Benmerad et al., 2017) to construct hourly concentration maps over France for a long period: 2000 to 2015. Details on the input data and methods used are described in the following paragraphs. From these corrected hourly concentration data, annual regulatory air quality maps are then constructed over France.
2.1 Monitoring data
Hourly measurements are extracted from validated reference datasets. For France, observations are extracted from the national air quality databases: BDQA (Base de Données de Qualité de l'Air) before 2013 and GEODAIR (
Table 1
Number of background French monitoring sites for the years 2000 to 2015.
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | 2015 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
O | 284 | 310 | 337 | 362 | 378 | 396 | 404 | 405 | 399 | 385 | 376 | 360 | 347 | 318 | 319 | 331 |
NO | 274 | 290 | 299 | 322 | 337 | 353 | 353 | 350 | 352 | 337 | 334 | 316 | 299 | 284 | 282 | 300 |
PM | 119 | 125 | 171 | 212 | 219 | 238 | 126 | 219 | 252 | 241 | 249 | 245 | 240 | 218 | 173 | 251 |
PM | 62 | 69 | 74 | 84 | 89 | 90 | 105 |
Until 1 January 2007, operational monitoring of PM and PM was carried out in France by automatic measuring systems of the TEOM (tapered element oscillating microbalance; PM, PM) or Beta (PM) type. However, compared to the reference method EN 12341 (gravimetry), these systems underestimate the concentrations of particles. This is a known artifact related to the loss of semi-volatile compounds. To correct PM concentrations measured before 2007, a simple approach consists of applying a uniform correcting factor over France. This method is not suitable for correcting hourly or daily concentrations, but it has been shown to work well for annual average PM concentrations (Malherbe et al., 2017, Bessagnet et al., 2008). The factor (1.36) is a median value calculated on the PM data from “reference” sites (Bessagnet et al., 2008). As a consequence, for the period 2000 to 2006, the only PM indicator available is the annual average concentration. Concerning PM, given the few reference measurements available before 2009, the reliability of even annual measurements is low. It was therefore decided to apply the kriging methodology only from the year 2009 onwards, which is when the change in measurement method had become widespread.
Figure 1
PM: statistical indicators calculated using cross-validation technique on daily mean PM values measured and estimated over rural background stations for the years 2007 to 2015. (a) Number of rural stations for each year. (b) Mean bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
Figure 2
PM: statistical indicators calculated using cross-validation technique on daily mean PM values measured and estimated over URBAN background stations for the years 2007 to 2015. (a) Number of rural stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
Figure 3
PM: statistical indicators calculated using cross-validation technique on daily mean PM values measured and estimated over rural background stations for the years 2009 to 2015. (a) Number of rural stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
Figure 4
PM: statistical indicators calculated using cross-validation technique on daily mean PM values measured and estimated over URBAN background stations for the years 2009 to 2015. (a) Number of rural stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (dotted horizontal lines).
[Figure omitted. See PDF]
2.2 CHIMERE simulationsThe CHIMERE chemistry transport model (Couvidat et al., 2018) is used to estimate air pollution levels for metropolitan France with a resolution of about 4 km () over the years 2000 to 2015. This model has long been implemented and assessed in France as the main component of the national air quality forecasting and monitoring system PREV'AIR (Honoré et al., 2007). Two types of input data are used to simulate concentrations.
Prior to 2010, a configuration similar to the one used in the EURODELTA-Trends project (Colette et al., 2017) is used. The methodology of Colette et al. (2017) is used to reconstruct the emissions of the main air pollutants (non-methane volatile organic compound (NMVOC), , CO, SO, NH and primary PM): the annual emissions of each country, broken down by SNAP (Selected Nomenclature for reporting of Air Pollutants) sectors, are estimated using the GAINS (Greenhouse gases and Air pollution Interactions and Synergies) model (Amann et al., 2011) for the years 2000, 2005 and 2010 . To derive emissions for intervening years, sectorial results for 5-year periods are linearly interpolated. Meteorological data are simulated with the Weather Research and Forecast Model (WRF version 3.3.1; Skamarock et al., 2008) from 2000 to 2010.
For the period 2011 to 2015, year-to-year emissions of the main pollutants are taken from the cooperative program for the monitoring and evaluation of the long-range transmission of air pollutants in Europe (EMEP) available at
For these two datasets, the spatialization of emissions over France is performed with a 1 km proxy based on the national bottom-up emission inventory (available at
Figure 5
O: statistical indicators calculated using cross-validation technique on daily mean O values measured and estimated over rural background stations for the years 2000 to 2015. (a) Number of rural stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
2.3 KrigingHourly atmospheric concentration fields are estimated by universal kriging, a geostatistical method. Kriging aims to estimate the value of a random variable (random process which describes the observations) at locations from the measurements. Kriging relies on the concept of spatial continuity, which implies that measurements that are close to each other will be more similar than distant measurements. In addition, kriging requires a good knowledge of the spatial structure of the interpolation domain which is represented by the variogram or co-variogram (second-order properties) of a random function (Goovaerts, 1997; Wackernagel, 1996; Chiles and Delfiner, 2012; Lichtenstern, 2013). Kriging involves deriving the linear combination of the observations which ensures the minimal estimation variance under a non-bias condition. At a point , the concentration estimate is given by Eq. (1).
1 where ), , are the observed concentrations at sampling locations through the entire domain (unique neighborhood) or within a limited neighborhood of (moving neighborhood), and , , are the kriging weights.
Figure 6
O: statistical indicators calculated using cross-validation technique on daily mean O values measured and estimated over URBAN background stations for the years 2000 to 2015. (a) Number of urban stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
Among the kriging methods, the universal kriging (especially external drift kriging) allows us to consider additional information to make estimates more accurate. This approach is based on a linear regression with auxiliary variables and a spatial correlation of the residuals, and it allows us to combine simultaneously observations and additional information. The main hypothesis is that the global mean of the random variable is not constant through the domain, and it relies on explanatory variables. This kriging technique has been used for several years in monitoring air quality systems for spatial interpolation at the regional scale (PREV'AIR; Malherbe and Ung, 2009). For , which is the pollutant concentration to be estimated at a location , the hypothesis is a linear relation between and the considered auxiliary variables as explained by Eqs. (2) and (3). where is the drift of the mean, are the coefficients of the linear regression, and are the auxiliary variables. corresponds to the stationary random process which is associated with a semi-variogram. In addition, the kriging weights must satisfy the drift condition described in Eq. 4. 4
In this work, kriging is performed with surface monitoring observations, and the drift is described by the outputs from the CHIMERE chemistry transport model. European stations located outside the French domain are included in the kriging to increase accuracy at the borders. The kriging is performed using a moving neighborhood as this allows for local adjustment of the relationship between the measurements and CHIMERE. The concentration at each grid point is estimated within a window of 80 monitoring sites. This number has been adjusted in previous studies by sensitivity tests (Benmerad et al., 2017; Beauchamp et al., 2017). In addition, smoothing is applied to avoid discontinuities in the map (Beauchamp et al., 2015); the smoothing methodology was adapted from Rivoirard and Romary (2011). The final output resolution is the same as for the CHIMERE model: approximately 4 km resolution ().
For PM (particles with a radius 10 ) and PM (particles with a radius 2.5 ) a co-kriging with external drift is applied. Co-kriging is an extension of kriging to the multivariate case. It allows the estimate of PM or PM concentrations by a linear combination of the two-variable data. The particularity of co-kriging is the use of the cross variance or semi-variance between the principal variable and the secondary variable. In the case of co-kriging with external drift, the simple and cross variograms are built based on residuals (De Fouquet et al., 2007). Co-kriging allows us to take into account the correlation between PM and PM and to improve consistency between PM and PM estimates (Beauchamp et al., 2015). This co-kriging also allows the PM estimate to benefit from the higher density of PM monitoring stations.
2.4 Output: regulatory air quality indicatorsFrom the hourly kriged concentrations, several air quality indicators (regulatory and used in health impact assessments) are calculated and mapped over France. The complete list and definition of these indicators are given in Table 2.
Table 2
Yearly regulatory air quality indicators from EU legislation or French legislation and usual indicators.
ID | Pollutant | Statistics | Threshold | Threshold origin | Target toprotect |
---|---|---|---|---|---|
NO2_avgannual | Yearly average | 40 m | Limit value (EU) | Human health | |
O3_avgannual | Yearly average | ||||
O3_AOT40 | AOT40 from May to July | 6000 m | Long-term objective | Vegetation | |
O3_AOT40_5years | AOT40 from May to July (5-year average) | 18 000 m | Target value (EU) | Vegetation | |
O3_SOMO35 | Sum of excess of max daily 8 h averagesover 35 ppb ( m) calculated forall days in a year; SOMO35 (sum of means over 35 ppb) | Health impactassessment | Human health | ||
O3_T120 | Number of days for which the runningaverage over 8 h exceeds 120 m | Quality objective (EU) | Human health | ||
O3_T120_3years | Number of days for which the running 8 h average exceeds 120 m (averaged over 3 years) | Not to exceed more than 25 d a year | Target value (EU) | Human health | |
O3_T180 | Number of hours exceeding the average value of 180 m | Recommendation and informationthreshold (France) | Human health | ||
O3_T240 | Number of hours exceeding the averagevalue of 240 m | Alert threshold(France) | Human health | ||
PM10_avgannual | PM | Yearly average | 40 m | Limit value (EU) | Human health |
PM10_t50 | PM | Number of days exceeding the averagevalue of 50 m | Not to exceed more than 35 d a year | Limit value (EU) | Human health |
PM10_t80 | PM | Number of days exceeding the averagevalue of 80 m | Alert threshold(France) | Human health | |
PM25_avgannual | PM | Yearly average | 25 m | Limit value (EU) | Human health |
AOT40 (expressed in m h) means the sum of differences between hourly concentrations greater than 80 m ( ppb or parts per billion) and 80 m for a given period using only the 1 h values measured daily between 08:00 and 20:00 CET.
3 Data validationUsually the quality of the estimated concentration maps is assessed using statistical indicators that compare observations and estimated concentrations at the monitoring stations in the domain. Here, information of all background stations in the domain is already used to produce the maps. Therefore, for a fair comparison, the cross-validation method is used. The cross-validation method calculates the quality of the spatial interpolation for each measurement station point from all available information except the selected station point; i.e., it retains one data point and then makes a prediction at the spatial location of this point. This procedure is repeated for all measurement points in the available set, thus allowing the quality of the predicted values to be assessed at locations without measurements (provided they are within the area covered by the measurements).
It was noticed that the scores are systematically different at rural and urban stations (even though the kriging technique used here is not differentiated by the type of station). This is why the results of the cross-validation are described by pollutant and differentiated by station type (rural and urban types are presented here). Three statistical indicators are calculated on the basis of the daily average concentration: the mean bias, the root mean squared error (RMSE) and the Pearson correlation (). For each year, they are first calculated for the “left out” station, and then the median values over all stations are calculated.
Leave-one-out validation is a commonly used method in the air quality community (see, for example, ETC reports on air quality mapping (Horálek et al. 2017)) which is presently recommended by FAIRMODE (Janssen and Thunis, 2020). However, scores derived from the results of the leave-one-out validation might be influenced by areas where the density of sampling points is highest. For this reason, during the FAIRMODE project (Riviere et al., 2019), for which a kriging method similar to the one conducted here was conducted, a comparison has been performed between cross-validation results obtained by the leave-one-out cross-validation and cross-validation results obtained by the 5-fold cross-validation (cross-validation leaving 20 % of stations out). Results and related scores were very similar. We therefore decided to continue with the leave-one-out cross-validation process for the validation of our kriging results.
Figure 7
NO: statistical indicators calculated using cross-validation technique on daily mean NO values measured and estimated over rural background stations for the years 2000 to 2015. (a) Number of rural stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
Figure 8
NO: statistical indicators calculated using cross-validation technique on daily mean NO values measured and estimated over URBAN background stations for the years 2000 to 2015. (a) Number of urban stations for each year. (b) Bias (black circles), RMSE (colored rectangles), correlation (grey crosses and the associated dashed lines) and mean observation (horizontal lines).
[Figure omitted. See PDF]
3.1PM
The scores show a good representation of the observations by the kriged data with correlations between 0.77 and 0.86 and RMSE of about 7 m, i.e., between 30 % and 50 % of the annual mean PM concentration. The mean biases are particularly low for urban stations with values below %. For rural stations the average bias is less than 3 m, i.e., less than 15 %. The proportion between rural and urban stations varies between and . The larger number of urban stations allows a better capture of the spatial variability in concentrations in urban environments.
Looking at the evolution of the scores over the years for rural stations, the number of stations available first increases from 2009 to 2012 before decreasing until 2014. In 2015 a new increase in the number of stations in France begins. For urban stations, the decrease starts earlier (2010), but the evolution is the same. The temporal evolution of the scores generally follows the number of stations with higher correlations and smaller relative mean biases and RMSE when more stations are available. Indeed, the greater the number of stations there are, the more representative the kriging technique will be of the real spatial variability. There are exceptions, however, like in 2015 for rural stations, which had the second worst scores even though that year has the largest number of stations.
3.2PM
There are between a half and a third as many PM stations as PM stations. However, by using a co-kriging technique, the PM mapping also benefits from PM information so that the correlations, mean bias and RMSE are almost similar to the PM scores. The mean biases for rural stations do not exceed 20 % of the mean concentrations and are very low for urban stations (between 0 and 3 %). As for PM, this bias is systematically positive for rural stations (overestimation) and slightly negative over urban stations (underestimation). This is mainly related to the resolution of the data which smoothes the concentration gradients, giving a unique value for each grid (about 4 km horizontal resolution). For urban station, located close to PM precursor emissions and generally having high concentration values, this smoothing effect leads to an underestimation. For rural areas far from emission precursors, the opposite is observed.
The correlation is generally higher than 0.8, and the RMSE does not exceed 7 m (at most 50 % of the annual mean concentration).
Figure 9
[Figure omitted. See PDF]
Figure 9
PM annual mean concentrations from 2000 to 2015. Concentrations are obtained through a combination of regional modeling and observations.
[Figure omitted. See PDF]
3.3O
The comparison between estimated and observed ozone at rural stations shows good correlations (0.8 to 0.87), small relative mean negative biases ( % to %) and low RMSE (around 20 % of the annual mean concentration). Between 2000 and 2007, the number of rural stations increased, resulting in improved modeled concentration maps. The small decrease in the number of stations after 2007 does not penalize the scores for these years.
The same conclusions can be drawn for the urban ozone scores. The higher number of urban stations even leads to slightly better scores, with correlations above 0.9 for all years and relative mean positive biases not exceeding 5 %. A satisfactory RMSE is also obtained for all years with values around 20 % of the annual mean concentration. It can be seen that the positive and negative biases are reverse with respect to the PM scores. Indeed, the highest O values are generally observed in rural areas where precursors have had time to produce O and where O destruction is lower than in urban areas. Therefore, the smoothing effect has the opposite effect to that of PM.
Figure 10
Trends in PM annual mean concentration. Sen slope coefficient ( m yr) calculated over the period 2000–2015.
[Figure omitted. See PDF]
Figure 11
PM annual mean concentrations from 2009 to 2015. Concentrations are obtained through a combination (kriging) of regional modeling and observations.
[Figure omitted. See PDF]
Figure 12
Trends in PM annual mean concentration. Sen slope coefficients ( m yr) calculated over the period 2009–2015.
[Figure omitted. See PDF]
Figure 13
[Figure omitted. See PDF]
Figure 13
SOMO35 indicator for the period 2000 to 2015. Ozone concentrations are obtained through a combination (kriging) of regional modeling and observations.
[Figure omitted. See PDF]
Figure 14
Trends in annual mean O concentrations (in m yr) (a), as well as SOMO35 (in m d yr) (b) and AOT40 (in m h yr) (c) indicators. Sen slope are calculated over the period 2000–2015.
[Figure omitted. See PDF]
3.4NO
Rural scores for NO are worse for particles and O. The correlations are between 0.55 and 0.7, but above all, strong positive biases are observed for all years with an overestimation of the observations of 60 % to 80 %. This also affects RMSE scores that can exceed 100 % of the annual mean concentration. This poor performance can be explained by the strong spatial gradients in NO concentrations due to its shorter atmospheric lifetime than O or particles. There are too few rural stations to properly capture this variability in the kriging technique used here, so the urban stations have too much weight, and the raw model concentrations also overestimate rural concentrations.
The urban scores for NO are much better than the rural scores. The correlations are around 0.8, the biases do not exceed , and the RMSE is between 10 and 12 m (less than 25 % of the annual mean concentration). The high number of urban background stations seems satisfactory to allow the kriging technique to correctly reproduce the spatial variability in NO in urban background environments. It should be noted, however, that traffic stations are not used in the present analysis (whether as observational data to be compared with or included in kriging).
Figure 15
[Figure omitted. See PDF]
Figure 15
NO annual mean concentrations for the period 2000 to 2015. NO concentrations are obtained through a combination of regional modeling and observations.
[Figure omitted. See PDF]
3.5 Comparison with other scoresIn order to evaluate the added value of the kriging technique compared to the raw CHIMERE model simulations, the cross-validation scores can be compared to the raw model scores. Table 3 shows the scores averaged over all years and all observations without distinction of typology.
Table 3
Validation scores for the raw data and the kriged concentrations (cross-validation). Annual scores (bias, RMSE and the Pearson correlation coefficient ) are calculated over France for all years and all stations and are averaged.
NO | O | PM | PM | |
---|---|---|---|---|
Raw | ||||
Bias | 3.46 | |||
RMSE | 12.97 | 17.26 | 12.63 | 8.73 |
0.55 | 0.73 | 0.71 | 0.75 | |
Kriged concentration | ||||
Bias | ||||
RMSE | 10.41 | 12.54 | 7.64 | 5.83 |
0.81 | 0.92 | 0.85 | 0.87 |
All scores are strongly improved by the kriging method of observations with CHIMERE in external drift. However, as can be seen in the previous figures, this improvement is more pronounced in urban areas than in rural areas due to the much larger number of stations in urban areas, which makes the kriging more representative of these areas.
The cross-validation scores can also be compared with those obtained in Europe with other mapping methods. Chen et al. (2019) compared 16 algorithms to develop Europe-wide spatial models of PM and NO, included linear stepwise regression, regularization techniques and machine learning methods. Those models were developed based on the 2010 routine monitoring data from the Airbase dataset, satellite observations, dispersion model estimates and land-use variables as predictors. De Hoogh et al. (2018) also performed cross-validation of their fine-spatial-scale land-use regression models (also based on Airbase dataset, satellite observations, dispersion model estimates and land-use variables as predictors) used in Europe for the year 2010. Results from their cross-validation are compared to our own cross-validation results in Table 4.
The comparison of performance in these three studies is of course limited by the fact that the spatial coverage differs: in De Hoogh et al. (2018) and Chen et al. (2019), the cross-validation is computed over the whole of Europe. In this study, the performances are assessed over France.
For all pollutants the spatial correlation () is better in our study. At the same time, higher RMSEs are also found for our study. This may be due to a larger bias, but we also demonstrated in our paper that the bias was very small, except at rural stations. Since the RMSE score also depends on the absolute concentrations, the different spatial coverage may also play a role. The lower RMSE over Europe could be an artifact of including areas where absolute concentrations of , PM or are lower than over France.
The validation scores obtained, as well as the comparison with raw data and with other mapping methods, allow us to be confident about the validity of the concentrations obtained and their good representativeness of background concentrations, in particular in urban areas. A crucial step appears, however, when it comes to the representativeness of rural concentrations which are overestimated in our results.
4 Results and discussionAfter ensuring the validation of the kriged concentration data, yearly indicators, trend over years and human exposition are calculated. Hourly concentrations fields are produced from 2000 to 2015 for NO, O and PM; however, as explained in Sect. 2, for PM only annual mean indicator maps are produced before 2007. PM hourly concentrations are calculated for the years 2009 to 2015 due to the limited number of background stations available before 2009.
4.1 Concentration maps and trends
All the indicators presented in Sect. 2 are calculated, but the following sections focus on the annual averaged concentrations of PM, PM, NO and O, as well as SOMO35 and AOT (two indicators associated with O), for which mapped data are presented. These indicators are presented in this paper and available on a Zenodo repository and on an online map library (see Sect. 5). A total of 34 trend analyses over the period are performed by calculating the Sen–Theil regression slope for each grid point on the domain. To characterize the significance of these trend slopes, the 95 % confidence interval is calculated. This confidence interval represents the lower and upper values above or below which there is 95 % confidence that the trends will occur. The smaller the confidence interval is, the more statistically significant the trend will be. Large confidence intervals are considered as unrepresentative, especially those containing 0. Trend slopes and confidence intervals are calculated for each grid point in the domain, and country-averaged values are also given in Table 5.
4.1.1
PM
Maps of annual average PM concentration maps are presented in Fig. 9 for the period 2000–2015. The resolution of the grid (around 4 km) allows us to see patterns such as interconnected cities, especially in the latest years during which the patterns of large inter-regional concentrations are decreasing. The impact of meteorological conditions is also visible through the interannual variability. For example, the 2003 heat wave year is associated with higher PM levels due to the increased formation of secondary aerosols.
Figure 16
Trends in yearly mean NO concentrations. Sen slope coefficients ( m yr) are calculated over the period 2000–2015.
[Figure omitted. See PDF]
Figure 10 shows the mapped trends in annual average PM expressed as Sen–Theil regression slope in micrograms per cubic meter per year and calculated over the period 2000–2015.
There is a downward trend in PM annual mean concentrations everywhere in France and in particular in the regions with the highest PM concentrations at the beginning of the period: the south of France (east and west), the Auvergne–Rhône–Alpes region, the east (Grand Est) and the extreme north of France. A country-averaged downward trend in PM concentrations of m per year is estimated over the period 2000–2015 (spatial average of the trends calculated on each grid point). This trend is statistically significant on average over France with a narrow 95 % confidence interval – [, ] – that does not include zero (see Table 5) and applies to almost all grid points (maps of confidence interval not shown here). Taking the year 2000 as the base year, this amounts to a 39 % reduction. In a study conducted for France over the period 2000–2010, Malherbe et al. (2017) estimated a downward trend that was twice as small (0.4). This reflects the accelerated decline in concentrations in France in recent years.
This significant downward trend is the result of the decrease in primary pollutant emissions over these 16 years in response to emission reduction measures. From 2000 to 2015, primary PM emissions over France were reduced by 39 %, as well as emissions of PM precursors such as emissions ( %) and emissions ( %) (data calculated by CITEPA (Interprofessional Technical Centre for Studies on Air Pollution) and extracted from the 2015 French national air quality report: https://www.statistiques.developpement-durable.gouv.fr/sites/default/files/2018-10/datalab-bilan-de-la-qualite-de-l-air-en-france-en-2015-octobre-2016-c.pdf, last access: 19 May 2022).
4.1.2PM
The highest PM values are observed at the beginning of the period and are more concentrated in the main source regions than PM. Significant reductions in annual average background concentrations are observed over the years. The Sen slope coefficients calculated for the annual average PM (Fig. 12) over the period show negative trends over the whole territory and more pronounced ones over the southeast region, the Auvergne–Rhône–Alpes region, northern France and Brittany. A downward trend of m per year from a national average is calculated, again with statistical significance (95 % interval of [, ] which does not contain zero). Taking 2009 as a reference year, this amounts to a 35 % decrease in 7 years. As for PM, this negative trend is associated with the reduction in primary PM emissions and in PM precursor emissions (, and volatile organic compounds (VOCs)).
4.1.3 OzoneThe SOMO35 indicator shows strong interannual variability. O is a photochemical pollutant produced by secondary reactions in the presence of , VOCs and sunlight. The hot year 2003 is distinguished by a very high SOMO35 over almost the entire territory. For each year, the highest SOMO35 is found in southeastern France and to a lesser extent in the Alsace region. The trends in SOMO35, annual average O and AOT40 over the years are shown in Fig. 14 for the period 2000–2015.
Figure 17
Yearly evolution of the country-averaged population-weighted (a) NO concentration, (b) PM concentration and (c) SOMO35. Trends and 95 % confidence intervals are calculated (d).
[Figure omitted. See PDF]
For the O average annual concentration, small positive trends are found over France. Two exceptions are the southeast (Provence–Alpes–Côte d'Azur, or PACA, region) and the Grand Est region (east of France), i.e., the regions with the highest O concentrations, showing negative trends. Averaging over France, this leads to a positive trend of 0.32 m yr which corresponds to an increase of 6.5 % over 16 years. The same order of magnitude was found for the period 2000–2010 by Malherbe et al. (2017). Both negative (in the south of France) and positive trends are significant according to the mapped 95 % confidence interval (not shown). SOMO35 and AOT40 indicators, which are indicators with a threshold value below which concentrations are not taken into account, show mostly negative trends. However, according to the value of the mapped 95 % confidence interval (not shown here) on most grid points, the confidence interval is wide and contains zero, indicating a lack of significance of the calculated trends. These results are consistent with other European studies (EMEP, 2016; Malherbe et al., 2017) that show an increase in background concentrations and a decrease in O peaks.
Table 4Validation scores for De Hoogh et al. (2018), Chen et al. (2019), and this study. The following scores are calculated by cross-validation for the three studies: Pearson correlation coefficient , the bias, and the root mean square error (RMSE).
De Hoogh et al. | Chen et al. | This | ||
---|---|---|---|---|
(2018) | (2019) | study | ||
NO | 0.57 | 0.57–0.62 | 0.81 | |
RMSE | 9.51 | 9–9.5 | 10.41 | |
Bias | ||||
PM | 0.58–0.68 | 0.48–0.63 | 0.87 | |
RMSE | 2.97–3.3 | 3.1–3.9 | 5.83 | |
Bias | ||||
O | 0.63 | 0.92 | ||
RMSE | 6.87 | 12.54 | ||
Bias |
Country-averaged slope and its 95 % confidence interval.
Indicator | Mean tendency slope | Mean 95 % |
---|---|---|
(or mean trend; in | confidence interval | |
m yr) | (in m yr) | |
PM – avg annual | [, ] | |
PM – avg annual | [, ] | |
O – avg annual | 0.32 | [0.005, 0.59] |
O – SOMO35 | [, 76.7] | |
O – AOT | [, 315] | |
NO – avg annual | [, ] |
NO
NO is mainly emitted by road transport. All maps show the same pattern, with cities and interconnected major roads showing the highest NO concentrations. Trends over the period 2000–2015 are shown in Fig. 15. Decreases in NO concentrations are observed in both rural and urban areas throughout the country. However, we recall that rural levels were found to be overestimated with our approach (see Sect. 3.4). The decrease is more important when NO concentrations are high. As with PM, these results highlight the combined benefit of large-scale emission management policies that target emission sectors and locally oriented policies.
On average, a significant negative trend of m is calculated over France with a narrow 95 % confidence interval (see Table 3). This downward trend is slightly stronger than that calculated in Malherbe et al. (2017) over the period 2000–2010 over France ( m yr) and corresponds to a reduction of about 30 % (taking 2020 as the base year).
4.2 Exposure trendsPopulation-weighted annual average concentrations are good estimates of population exposure as they give more weight to the air pollution where people mainly live. Here, the country-averaged population-weighted concentrations of NO, PM and SOMO35, which are the three main indicators used to calculate health impacts, are calculated for each year from the hourly kriged mapped data over France. For one pollutant, it is obtained by adding the result of multiplying the concentration by the population on all the country's grids and then dividing by the total population of the country. The population database used in this study is the LCSQA (L'expertise au service de la qualité de l'air) national population database (Létinois et al., 2014) established for the year 2015. It is based on detailed files from the French Ministry of Finance with information at building level. It is important to note that the French population used here has not varied over the years. The French population increased by about 10 % between 2000 and 2015. However, if we considered that the demographic evolution is homogeneous over the country (the urbanrural ratio has only increased by about 2.5 % in France over the same period), the population-weighted concentration on national average should be the same whatever the year of the population database.
As for the concentrations, a very clear downward trend is observed for population-weighted NO with a trend of m yr and a narrow 95 % confidence interval: [, ], i.e., a reduction of about 30 % in 16 years. A downward trend of m yr is also clearly calculated for PM (95 % confidence interval: [, ]) over the period 2009–2015, i.e., a reduction of about 31 % in 7 years. In contrast, there is no clear trend for the SOMO35 indicator over the period 2000–2015.
When the abovementioned indicators are multiplied by the total population (to obtain the total exposure, i.e., the sum of the population-weighted concentrations over a country), the outcome indicators are those used to calculate the health impact assessment based on dose-response functions, as suggested by the WHO review “Health Risks of Air Pollution in Europe”, described in Holland (2014a, b). Exposure to SOMO35, anthropic PM and NO (with or without threshold depending on the health impact indicator) contributes to both morbidity and mortality impacts. For example in France, they were used in the PREPA (National Air Pollutant Emissions Reduction Plan) evaluation study for which about 50 political measures to be implemented in France were evaluated and ranked on different criteria, such as air quality impact, health impact and cost–benefit assessment (Schucht et al., 2018). At constant population evolution, the trends are similar between both indicators (total exposure and population-weighted average concentration). However, the evolution in population (even if it is homogeneous over the territory) has an impact on the total exposure of the population. Therefore, we expected a reduced impact on health compared to those on population-weighted concentrations.
5 Data availability
Mapped regulatory indicators and exposure data for all 15 years and the four pollutants described here are available on a Zenodo repository in the Netcdf format (version no. 4) and csv format for data at the municipal or regional level. The DOI link for the dataset is 10.5281/zenodo.5043645 (Real et al., 2021). It is also available through a web-based map library (
6 Conclusions
A 16-year datasets of mapped air pollution concentrations and indicators over France was constructed using a data fusion technique (kriging) that combines measurements from background surface monitoring stations and modeling from the regional model CHIMERE. The resulting data are hourly concentrations at a resolution of about 4 km over France for the period 2000–2015 (shorter period for PM and PM hourly indicators).
The kriging technique implemented combines kriging with external drift for NO and O and co-kriging with external drift for particulate matter, allowing the PM estimation to benefit from the highest density of PM monitoring stations. These datasets have been evaluated over several years using a cross-validation process that takes into account the incorporation of measurements in the correction process by retaining a data point before calculating the score. The kriging technique significantly improves the validation scores, especially in urban areas with very low biases and high correlations. However, a crucial point appears concerning the representativeness of concentrations in rural areas which are overestimated by the model. A new methodology is being developed to better map concentrations in these rural areas. It should be noted that the performance increases with the number of measurements taken into account until a threshold is reached at which the addition of stations no longer seems to improve performance. This threshold depends on the pollutant and is higher for pollutants with a strong spatial gradient (i.e., NO which has a shorter lifetime).
The main annual indicators (mean NO, PM, PM, O, SOMO35 and AOT40) are analyzed in this article, and the annual trends are calculated. Significant downward trends are calculated over the whole period for annual average concentrations of PM, PM and NO. They reflect the reductions in precursor emissions that have taken place in Europe since the 1990s. The trends for O over the 16 years are less significant. In general, the background O level is increasing, mainly due to large-scale pollution, and high (peaks) O levels are decreasing due to reductions in local O precursor emissions. This results in a positive trend for the annual average O concentration over most of France, but a small downward trend is also observed in the regions with the highest O levels (southeast and east). No significant trend is calculated for the two O indicators detailed here (SOMO35 and AOT40). Population exposure is also calculated over France. The average weight of NO and PM in the population of the country decreases respectively by 30 % in 16 years and 31 % in 7 years. No clear trend was found for the population weight of SOMO35.
Author contributions
Data kriging, results evaluation by cross-validation process, and maps and graph production for the paper were performed by ER. The CHIMERE modeling concentration data over the period 2000–2015 were produced by FC. Software developments for the kriging and cross-validation methods were provided by AU, LM and AG. The web-based map library used to store and visualize the data was developed by BR. All work has been supervised and conceptualized by AC. The manuscript draft was mainly written by ER with contributions from all co-authors.
Competing interests
The contact author has declared that neither they nor their co-authors have any competing interests.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Acknowledgements
This work was supported by the French ministry in charge of ecology. Part of the simulations was carried out in the XENAIR project funded by the ARC Foundation for Cancer Research.
Financial support
This research has been supported by the Ministère de l'Écologie, du Développement Durable et de l'Énergie (grant no. program DRC16).
Review statement
This paper was edited by Bo Zheng and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
This paper describes a 16-year dataset of air pollution concentrations and air quality indicators over France. Using a kriging method that combines background air quality measurements and modeling with the CHIMERE chemistry transport model, hourly concentrations of NO
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer