1 Introduction
Nitrogen oxides () play a significant role in the atmosphere, as they catalyse tropospheric ozone formation via a suite of chemical reactions, impact the oxidizing capacity of the atmosphere and, thus, influence the atmospheric burdens of major pollutants like methane and carbon monoxide . In addition, they are responsible for secondary aerosol formation . Fossil fuel combustion is the dominant source of the global emission budget ( %), followed by natural emissions from soils, lightning and open vegetation fires . High ozone, aerosol and have adverse effects on human health , and the recommended limits from the European Union (EU) and the World Health Organization (WHO) are often exceeded, especially in densely populated and industrialized regions . Therefore, emissions of have been the main target of abatement strategies worldwide
Observations from satellite nadir-viewing sensors are essential for mapping the global multiyear picture of the distribution and trend. However, the quality of these data sets needs to be carefully assessed using ground-based measurements at different sites
The EU Seventh Framework Programme (FP7) QA4ECV (Quality Assurance for Essential Climate Variables) project (
-
The development of a quality assurance framework for climate data records (CDRs; ), covering aspects such as product traceability, uncertainty description, validation and documentation, following international standards . Among its components are a generic validation protocol (, building upon ), a compilation of recommended terminology for CDR quality assessment and a validation server ; the latter is a prototype for the operational validation servers for S5P-MPC (Sentinel-5P Mission Performance Center) and CAMS (Copernicus Atmosphere Monitoring Service).
-
The establishment of multi-decadal CDRs for six ECVs following the guidelines of the quality assurance framework; among them are the QA4ECV and the version 1.1 satellite products, which are available for several sensors.
-
The development of an and long-term ground-based data set for 10 MAX-DOAS instruments, harmonized with respect to measurement protocol and data format and with an extensive uncertainty characterization .
In this work, we report a comprehensive validation of the QA4ECV version 1.1 data product on the OMI sensor using the ground-based measurements acquired by DOAS (differential optical absorption spectroscopy) UV–Vis instrument networks developed in the context of the Network for the Detection of Atmospheric Composition Change (NDACC) as a reference. Zenith-scattered light DOAS (ZSL-DOAS) data obtained routinely as part of NDACC monitoring activities are used to validate the stratospheric vertical column density (VCD), while multi-axis DOAS (MAX-DOAS) data, either from NDACC or further harmonized within the QA4ECV project, are used to validate the tropospheric VCD. We focus on how well the ex ante
An ex ante quantity does not rely on a statistical comparison with external data . This is to be contrasted with ex post quantities like the mean difference of satellite data vs. reference data.
uncertainties and comparison errors explain the observed discrepancies, making use of the framework and methodology developed within the QA4ECV and GAIA-CLIM projects.This paper is structured as follows. In Sect. , the satellite and reference data sets are described. Section provides details about the validation methodology. In Sect. , we outline how the quality screening of QA4ECV OMI , notably the exclusion of cloudy scenes, leads to underestimated early afternoon tropospheric VCDs. Section presents the comparison of the QA4ECV OMI stratospheric VCD with ZSL-DOAS. In Sect. , the satellite tropospheric VCD is compared with measurements from 10 MAX-DOAS instruments. The differences are analysed in relation to the uncertainties and the comparison errors. Potential causes of the discrepancies (e.g. horizontal smoothing difference error, low-lying clouds or aerosols, and profile shape uncertainty) and attempts to resolve the discrepancies are then discussed. Finally, the conclusions are formulated in Sect. .
2 Description of the data sets
2.1 Satellite data
2.1.1
QA4ECV OMI
The QA4ECV OMI version 1.1 data product is retrieved from Level 1 UV–Vis spectral measurements (OMI-Aura_L1-OML1BRVG radiance files) from the Dutch–Finnish UV–Vis nadir-viewing OMI (Ozone Monitoring Instrument) spectrometer on NASA's Earth Observing System Aura (EOS-Aura) polar satellite. The nominal footprint of the OMI ground pixels is (across along track) at nadir to at the edges of the 2600 swath, and the ascending node local time is 13:42 LT. For more details on the instrument, see . The data product provides a Level 2 (L2) tropospheric, stratospheric and total VCD.
The QA4ECV algorithm includes the following steps: (i) retrieving the total slant column density (SCD) using differential optical absorption spectroscopy (DOAS), (ii) estimating the stratospheric SCD from data assimilation using the TM5 (Tracer Model, version 5) chemistry transport model (CTM), (iii) obtaining the tropospheric contribution by subtraction and (iv) calculating the tropospheric air mass factors (AMFs) by converting the SCD to a VCD (see Table ). The retrieval equation is as follows: 1 More information can be found in the “Product Specification Document for the QA4ECV ECV precursor product” and in and . A preliminary evaluation of the data indicated that QA4ECV values are 5 %–20 % lower than the earlier version of the OMI data product, DOMINO v2, over polluted regions, and that they agree slightly better with MAX-DOAS VCD measurements in Tai'an (China) and De Bilt (the Netherlands) than the DOMINO v2 VCDs .
The data product files contain a comprehensive amount of metadata. For each pixel, the satellite data product provides a total ex ante uncertainty on the retrieved tropospheric VCD as well as a breakdown of the uncertainty into an ex ante uncertainty budget with the following uncertainty source components: uncertainty in total SCD ; stratospheric SCD ; and tropospheric AMF , which contains contributions from uncertainties in surface albedo , cloud fraction (CF) , cloud pressure and a priori profile shape ; and an albedo-CF cross-term, with representing the error correlation coefficient between both properties
However, the reference data in the current work are column retrievals or profile retrievals with a limited vertical resolution and are based on an a priori profile that is different from the satellite retrieval. Before smoothing, satellite and reference retrievals should be adjusted such that they use the same a priori profile ; therefore, knowledge of the satellite a priori profile is relevant. These can be derived from the TM5-MP data files , which are available upon request
In this work, we considered data from 2004 up to and including 2016 for the tropospheric VCD and up to and including 2017 for the stratospheric VCD.
2.1.2OMI STREAM stratospheric
The Stratospheric Estimation Algorithm From Mainz (STREAM; ) was included as an alternative stratospheric estimation scheme in the QA4ECV data files. In STREAM, the estimate of stratospheric columns is based on satellite observations with a negligible tropospheric contribution, i.e. generally over regions with low tropospheric levels, and for satellite pixels with high clouds, where the tropospheric column is shielded. The stratospheric field is then smoothed and interpolated globally, assuming that the spatial pattern of stratospheric does not feature strong gradients.
2.1.3 NASA OMNO2 data productAlthough not the main focus of this work, we do include NASA's OMI data – OMNO2 version 3.1 – as a benchmark comparison of an alternative retrieval product with QA4ECV MAX-DOAS. Like QA4ECV OMI , OMNO2 is also based on the DOAS approach, although nearly all retrieval steps are different between the QA4ECV and NASA OMI algorithms (Table ). A detailed comparison of the QA4ECV and NASA fitting approaches showed small differences between SCDs ; thus, differences between the spectral fitting approaches only explain a small part of the differences in the tropospheric VCDs. The stratospheric correction approach differs between the two algorithms. Although the QA4ECV and NASA stratospheric SCDs have not been compared directly, previous evaluations suggest that differences between the approaches typically lead to small but spatially widespread differences of up to 0.5– in tropospheric VCDs. This leaves differences between the tropospheric AMF calculations (and especially the prior information used in their calculations) as the most likely explanation for the lower NASA values compared with QA4ECV VCDs
Table 1
The OMI satellite data products considered in this work.
| Data product | Spectral fitting | Stratospheric correction | Tropospheric AMF |
|---|---|---|---|
| OMI QA4ECV v1.1 | Data assimilation in TM5-MP | Surface albedo from 5-year climatology at 0.5 0.5; clouds from OMI – algorithm | |
| OMI STREAM | Weighted (observations with a negligible tropospheric contribution – clean regions and cloudy pixels) convolution | ||
| OMNO2 v3.1 | Three-step (interpolation, filtering and smoothing) stratospheric field reconstruction to fill in the tropospheric contaminated scenes | Surface albedo from 5-year climatology at 0.5 0.5; clouds from OMI – algorithm (OMCLDO2 data product), a priori profiles from monthly Global Modelling Initiative data at 1 1.25 |
OMI STREAM stratospheric VCD is contained in the OMI QA4ECV v1.1 data files.
2.2 Ground-based data2.2.1 Zenith-scattered light DOAS
The ZSL-DOAS data are part of the Network for the Detection of Atmospheric Composition Change (NDACC;
Zenith-sky measurements are performed during twilight at sunrise and sunset. Due to this measurement geometry with a long optical path in the stratosphere, the measured column is about 14 times more sensitive to stratospheric than to tropospheric . Moreover, it allows for usable measurements to also be made during cloudy conditions. Processing followed the NDACC standard operation procedure (
Figure 1
Global distribution of the ZSL-DOAS instruments used in this study. Red markers indicate SAOZ instruments, and blue markers indicate other NDACC ZSL-DOAS instruments.
[Figure omitted. See PDF]
2.2.2 Multi-axis DOASThe tropospheric VCD data used as a reference are a long-term record of MAX-DOAS (multi-axis DOAS) measurements from 10 instruments, reprocessed by different teams for the QA4ECV project (see Table ). MAX-DOAS instruments measure scattered sunlight under different viewing elevations from the horizon to the zenith . The observed light travels a long path (the length is dependent on the elevation angle) in the lower troposphere, while the stratospheric contribution is removed by a reference zenith measurement. Two different MAX-DOAS data processing methods were used for the current validation study, QA4ECV MAX-DOAS and bePRO (Belgian Profiling) MAX-DOAS , with the latter being part of NDACC.
Thanks to an extensive harmonization effort within the QA4ECV project, reference QA4ECV MAX-DOAS data sets were produced by the different teams for all 10 instruments. These data sets are available at
The second processing method, bePRO MAX-DOAS , is available for three BIRA-IASB instruments (at Bujumbura, Uccle and Xianghe). This approach, which is based on the optimal estimation method
There is a clear distinction between the QA4ECV MAX-DOAS and bePRO retrieval algorithms. In the QA4ECV MAX-DOAS algorithm, the VCD is obtained by dividing a differential SCD by a differential AMF at a single elevation angle (see Sect. 1.3 of ). In the bePRO approach , a VCD is obtained by integrating a vertical profile retrieved by an optimal estimation method using measurements at several elevation angles.
MAX-DOAS instruments probe the lower troposphere, with the highest sensitivity (described by the column averaging kernel) close to the surface, typically in the lowest 1.5 of the atmosphere. Nevertheless, the vertical grid extends to for QA4ECV MAX-DOAS and for bePRO MAX-DOAS.
Table 2
Overview of contributing sources for the QA4ECV MAX-DOAS reference data set.
| Station | Location | Start and end time | Class | Contributor |
|---|---|---|---|---|
| Bremen (DE) | 53.10 N, 8.85 E | Feb 2005–Dec 2016 | Urban | IUP-UB |
| De Bilt (NL) | 52.10 N, 5.18 E | Mar 2011–Nov 2017 | Suburban | KNMI |
| Cabauw (NL) | 51.97 N, 4.93 E | |||
| Uccle (BE) | 50.80 N, 4.36 E | Apr 2011–Jun 2015 | Urban | BIRA-IASB |
| Mainz (DE) | 49.99 N, 8.23 E | Jun 2013–Dec 2015 | Urban | MPG |
| Observatoire de Haute-Provence (FR) | 43.94 N, 5.71 E | Feb 2005–Dec 2016 | Rural/background | BIRA-IASB |
| Thessaloniki (GR) | 40.63 N, 22.96 E | Jan 2011–May 2017 | Urban | AUTH |
| Xianghe (CHN) | 39.75 N, 116.96 E | Apr 2010–Jan 2017 | Suburban | BIRA-IASB |
| Athens (GR) | 38.05 N, 23.86 E | Sep 2012–Oct 2016 | Urban | IUP-UB |
| Nairobi (KEN) | 1.23 S, 36.82 E | Jan 2004–Nov 2014 | Rural/urban | IUP-UB |
| Bujumbura (BU) | 3.38 S, 29.38 E | Jan 2014–Dec 2016 | Suburban | BIRA-IASB |
Contributing teams: the Aristotle University of Thessaloniki (AUTH), the Royal Belgian Institute of Space Aeronomy (BIRA-IASB), the Institute of Environmental Physics at the University of Bremen (IUP-UB), the Max Planck Institute (MPG) and the Royal Netherlands Meteorological Institute (KNMI). For this sensor, bePRO MAX-DOAS data (providing profile data) are also available. The same instrument was operated at two different locations, De Bilt and Cabauw, which are approximately 30 km apart. An AERONET instrument, measuring aerosol optical depth, is located at this site or in close proximity.
The MAX-DOAS sites span a wide range of levels, from relatively low at Observatoire de Haute-Provence (OHP) and Bujumbura, with a mean tropospheric MAX-DOAS VCD around the OMI overpass time of , to strongly polluted at Xianghe, with a mean MAX-DOAS value of (see Fig. c, black boxplots), whereas the other sites are moderately polluted (mean value of between 5.6 and 11 ).
The MAX-DOAS tropospheric VCD is provided with an ex ante uncertainty in the GEOMS data files. Unfortunately the uncertainty estimation approach employed is not harmonized among all data providers. Therefore, we set the total uncertainty at 22.2 % of the retrieved VCD for QA4ECV MAX-DOAS instead, following the QA4ECV deliverable D3.9 recommendation . Using sensitivity tests, aerosol effects (20 %) and the a priori profile shape (8 %) were identified as the main contributors to the MAX-DOAS uncertainty, whereas the uncorrelated instrument noise was only 2 %. However, we did not follow D3.9 in its recommended division of the uncertainty into the random error and systematic error contributions
In D3.9, the systematic error uncertainty is set at 3 %, arising from absorption cross-section-related systematic error uncertainty on the SCD, whereas the random error uncertainty is set at 22 %, arising from uncertainty on the AMF. However, the assumption that an error in a priori profile shape, for example, would translate to a random error on the retrieved column is not evident in our opinion. In a later analysis , a comparison of QA4ECV MAX-DOAS with more advanced MAX-DOAS profiling methods was performed. This highlighted systematic differences between % and %, which are considerably larger than the systematic error uncertainty of 3 % recommended by the D3.9. This suggests that a larger part of the total uncertainty is due to systematic error. Therefore, in this work, we only consider a total uncertainty of 22.2 %, which is derived from the sum of the recommended systematic and random components in quadrature.
and consider only a total uncertainty. Regarding bePRO MAX-DOAS, we consider a 12 % total uncertainty for Uccle and XiangheWe note that the bePRO profile retrieval algorithm has recently been compared to several other retrieval algorithms . In future validation work, the consideration of other retrieval algorithms that performed well in the intercomparison exercises of and would be of high interest.
As the accuracy of satellite or ground-based remote sensing can be affected by the presence of aerosol, tracking aerosol optical depth (AOD) is useful. The bePRO MAX-DOAS provides AOD measurements at the same temporal sampling resolution as the measurements. The QA4ECV MAX-DOAS provides an AOD climatology based on AERONET (Aerosol Robotic Network) data ; however, we found that the precision of this climatological data set was inadequate for the current work, especially for urban sites. Instead, we considered AOD directly from AERONET (;
A limitation when investigating AOD dependencies in satellite MAX-DOAS comparisons using AERONET AOD with QA4ECV MAX-DOAS tropospheric VCD data (compared with using bePRO AOD with bePRO data) is that it implies subsetting: for part of the QA4ECV MAX-DOAS data, no co-located AERONET AOD data are available. Moreover, as opposed to the bePRO /bePRO AOD combination, co-located QA4ECV MAX-DOAS AERONET AOD data pairs have a temporal co-location mismatch and (where instruments are at different locations) a spatial co-location mismatch. A test was performed (results not shown) using the bePRO AERONET AOD combination, and it was generally found that the results are less clear than for the bePRO /bePRO AOD combination.
3 Validation3.1 Validation methodology
The generic validation protocol is similar to that outlined by , and it is tailored within the QA4ECV project for the ECVs , and . Terms and definitions applicable to the quality assurance of ECV data products have been agreed upon within QA4ECV ; the full set can be found in . The discussion and analysis of comparison error follows the terminology and framework detailed within the GAIA-CLIM project .
In the following sections, we detail the baseline validation methodology.
3.1.1 Screening criteria
Filters are applied to the satellite data product following the recommendations in the QA4ECV product specification document (PSD; ) as well as to minimize comparison error with MAX-DOAS.
Following the QA4ECV PSD , satellite data are kept for tropospheric validation if the following conditions are met:
(1)
no raised error flag;
- (2)
a satellite solar zenith angle (SZA) less than ;
- (3)
the so-called “snow–ice flag” indicates “snow-free land”, “ice-free ocean” or a sea ice coverage below 10 %;
- (4)
the ratio of the tropospheric AMF over the geometric AMF, , must be higher than 0.2 in order to avoid scenes with a very low tropospheric AMF (which typically occur when the TM5 model predicts a large amount of close to the surface in combination with aerosols or clouds, effectively screening this from detection);
- (5)
an effective cloud fraction (CF) less than . This last filter is comparable to the PSD recommendation of a cloud radiance fraction (CRF) below , and it was chosen because the effective cloud fraction is a more general property than the CRF. Note that the satellite-retrieved cloud fraction and cloud height are effective properties that are sensitive to both aerosol and cloud . It should be mentioned that cloudy pixel retrievals – although subject to larger errors than clear-sky pixels – can still be used (e.g. in data assimilation), provided that the averaging kernel is taken into account .
- (6)
This condition is not mentioned in the PSD, but it is applied by and is a filter to limit the impact of aerosol haze and low clouds. In , this was accomplished by excluding ground pixels with a high retrieved cloud pressure, i.e. . Unfortunately, this filter can remove a substantial portion of the data; therefore, a less strict filter was required in the current work. Low cloud can lead to a high uncertainty in the retrieved tropospheric value when it is uncertain if the cloud is located above the trace gas (a high screening effect and, therefore, a low AMF) or is at similar height (a partial screening effect and partial surface albedo effect and, therefore, a higher AMF). This is registered in the uncertainty component due to the cloud pressure available within the data product. Data analysis reveals that a relatively small number of ground pixels are responsible for an important contribution to the root mean square (RMS) of the ex ante satellite uncertainty for several sites (Xianghe, Uccle, De Bilt, Bremen and Athens) via the cloud pressure component . Most of these high-uncertainty ground pixels have a low retrieved effective cloud pressure (Fig. S1 in the Supplement), which is indicative of aerosol haze or low-lying cloud. The aforementioned cloud pressure filter used by would effectively remove these suspicious ground pixels but many other pixels with a low as well. Therefore, we chose to apply filter (6) instead, which is a one-sided sigma-clipping on : data where are removed. This sigma-clipping removes a smaller percentage of the data, while still achieving its goal of limiting and . After this filtering step, is only a minor contributor to the OMI uncertainty budget.
- (7)
Finally, satellite ground pixels with a footprint greater than 950 , corresponding to the five outermost rows at each swath edge of the OMI orbit, are removed in order to limit the horizontal smoothing difference error with the MAX-DOAS data. Filter (7) is not a filter on satellite data quality but rather a limit on the scope of the validation.
Regarding the OMNO2 data product, we followed the recommendation of by only including ground pixels for which the least significant bit of the VcdQualityFlags variable is zero (indicating good data). Furthermore, the effective cloud fraction must be less than 0.2 and the pixel area must be less than .
No screening was applied to the ground-based reference data sets. In particular, filtering is not applied on the MAX-DOAS cloud flag as a baseline, as it is not available for all data sets. It should be noted that clouds can impact the quality of MAX-DOAS retrievals (see e.g. radiative transfer simulations of , and ).
3.1.2 Co-location criteria and processingStratospheric column
The air mass to which a ZSL-DOAS measurement is sensitive spans over many hundreds of kilometres towards the rising or setting Sun
Figure 2
Illustration of a single co-location between OMI and a sunrise ZSL-DOAS measurement using the dedicated observation operator. The red dot marks the location of the ground instrument, the cyan lines indicate the coastlines of this part of the Mediterranean, the greyscale background contains the full orbit data and the coloured pixels are those that have their centre within the observation operator (black polygon), i.e. those that are averaged to obtain a satellite measurement comparable with that of the ZSL-DOAS instrument.
[Figure omitted. See PDF]
To account for effects of the photochemical diurnal cycle of stratospheric , the ZSL-DOAS measurements, obtained twice daily at twilight at each station, are adjusted to the OMI overpass time using a model-based factor. The latter is extracted from LUTs that are calculated with the PSCBOX 1-D stacked-box photochemical model initiated by daily atmospheric composition and meteorological fields from the SLIMCAT chemistry transport model . The amplitude of the adjustment depends strongly on the effective SZA assigned to the ZSL-DOAS measurements; it is taken here to be 89.5. The uncertainty related to this adjustment is of the order of 10 % or 1 to 2 .
Tropospheric column
Regarding the tropospheric column validation, satellite data are kept if the satellite ground pixel covers the MAX-DOAS instrument location and if a MAX-DOAS measurement is within a 1 interval centred on the satellite measurement time. The average of all MAX-DOAS measurements within this 1 interval is taken. The typical number of MAX-DOAS measurements taken within this time interval was between two and four for most sites. This procedure was applied to both QA4ECV OMI and the OMNO2 comparisons.
3.2 Impact of quality screeningQuality screening is a necessary step before a satellite data product can be used, but it can be a limit to the data product's scope. Figure a presents the remaining fractions of the satellite overpass data at the MAX-DOAS sites at each of the seven successive filter steps described in Sect. . Note that the Cabauw and De Bilt sites are not included, as the results are very close to that of Uccle.
With respect to the data filtering conditions mentioned earlier, the error flag (1) removes %–30 % of the data; filters on the SZA and the snow–ice flag (2 and 3) have a relatively small impact; the filter on the AMF ratio (4) has a large impact on the Bremen, Mainz, Cabauw, De Bilt, Uccle and Xianghe sites (35 %–40 % of data removed); and the filter on CF (5) has an important screening impact at all sites (see Fig. ), removing up to 60 % of the data at the Bujumbura site. As an alternative to the CF filter, we also tested the filter; for most sites the CRF and CF filters have a near-identical impact, although for Bujumbura and Nairobi the CRF filter is more restrictive (results not shown). In combination, the quality filters recommended by the PSD (filters 1–5) remove between 56 % (Athens) and 90 % (Bremen) of the data.
Figure 3
Starting from satellite data with a ground pixel covering the MAX-DOAS site, panel (a) shows the remaining data fraction after the application of each of the seven filter criteria. The criteria are explained in Sect. . The Cabauw and De Bilt sites are not included here, as the fractions are very close to that of Uccle. Panel (b) shows the remaining fraction per month after the application of all filters. Panel (c) shows boxplots of QA4ECV MAX-DOAS data (“MXD”) co-located with QA4ECV OMI for each site, before the application of the filters (black), after the application of the filters (blue) and QA4ECV OMI co-located with MAX-DOAS after the application of the filters (“SAT”, red). The sites are sorted according to the median MAX-DOAS value before filtering. The box edges represent the 1st and 3th quartiles, the orange line represents the median, the green cross represents the mean, and the whiskers represent the 5th and 95th percentiles.
[Figure omitted. See PDF]
Filter (6), the filter on the uncertainty component due to cloud pressure , removes 5 % of the data at most at the Xianghe site, whereas the alternative filter on cloud pressure would have removed 15 % of the data (Fig. S1). The filter on ground pixel size (7) removes an additional 3 %–16 % of the data.
The screening can have a strong seasonal effect; for example, the winter months are strongly underrepresented for the western European urban sites (Fig. b). Figure c presents box plots of co-located MAX-DOAS tropospheric measurements for each MAX-DOAS site before (black) and after (blue) screening. Both the mean and median values decrease due to the filtering step. We conclude that the quality screening tends to reject scenes with a high tropospheric VCD, i.e. the restriction to quality-screened scenes leads to a negative sampling bias with respect to the ensemble of all scenes. On an absolute scale, the screening effect is strongest at the Xianghe site, leading to a decrease in the yearly mean tropospheric from 24 to 15 (40 % decrease). At Nairobi, Thessaloniki, Bremen, De Bilt and Cabauw, the tropospheric VCD is reduced by several peta molecules per square centimetre (). The cloud filter is a main contributor to this sampling bias. This is in accordance with the results of , who found that higher tropospheric was measured by MAX-DOAS in Beijing under cloudy conditions compared with clear-sky conditions. Indeed, cloudy conditions lead to less photochemical loss of tropospheric , as explained by the model results . In comparisons of OMI tropospheric with independent data, care should be taken to ensure that the independent data are also sampled for clear-sky conditions . A systematic influence of clouds on the MAX-DOAS retrievals might contribute to the observed sampling bias effect.
It can be argued that the AMF ratio filter (filter 4) is too restrictive. In Sect. S2 in the Supplement results are presented for the less restrictive . The remaining data fraction is slightly increased at the Bremen, Mainz, Uccle, De Bilt and Cabauw sites (from % to %), and the winter months are better represented (see Fig. S2). The negative sampling bias at De Bilt and Bremen is reduced, whereas it is removed at Mainz. As will be shown in Sect. , this adapted filtering generally has no negative impact on the satellite vs. MAX-DOAS comparisons.
Figure 4
(a) Time series of OMI and SAOZ stratospheric above the Kerguelen NDACC station in the Indian Ocean, which is typical of clean background conditions. Panel (b) is similar to (a) but for the Observatoire de Haute-Provence (OHP), which shows more significant tropospheric columns in winter due to anthropogenic pollution.
[Figure omitted. See PDF]
3.3Comparison of OMI stratospheric with ZSL-DOAS
Figure contains time series of stratospheric columns, from both satellite (QA4ECV product) and ground-based instruments, at two illustrative ground sites: Kerguelen in the southern Indian Ocean, which is representative of very clean background conditions, and the Observatoire de Haute-Provence in France, which is affected by significant tropospheric pollution in local winter that often exceeds the wintertime stratospheric column. The graphs show the well-known seasonal cycle in stratospheric , which is captured similarly by satellite measurements and the ZSL-DOAS instrument. It is already evident from perusal of the results at OHP that the stratospheric comparison is hardly affected by the peaks in tropospheric pollution, e.g. in winter 2005–2006, indicating a good separation between the troposphere and stratosphere in the QA4ECV OMI retrievals.
Figure 5
Climatological, i.e. all years mapped to a single year and with a 1-month smoothing function applied, comparison between QA4ECV OMI stratospheric and the ZSL-DAOS instruments at Kerguelen and the Observatoire de Haute-Provence (OHP), revealing overall good agreement.
[Figure omitted. See PDF]
To better reveal differences in the representation of the seasonal cycle, Fig. presents the full time series at these two stations as a function of the day of the year (DoY), with a 1-month moving mean applied. While the seasonal cycle is generally well represented, with accurate levels in local summer, the QA4ECV OMI stratospheric column does appear to be a little lower than the ground-based value in local winter at these two sites. However, this is not a network-wide feature; this is illustrated in Fig. , which shows the median difference for each day of the year for every station, ordered by latitude, where the median is taken over the entire 14-year time series.
Figure 6
Median difference for each station (ordered by latitude) and for each day of the year, taken over the entire 14-year record, between QA4ECV OMI stratospheric and the co-located, photochemically adjusted, sunset ZSL-DOAS measurements.
[Figure omitted. See PDF]
From this figure, it is clear that the agreement is poorer at high latitudes, owing to more difficult measurement conditions (such as a high SZA) and at times a highly variable atmosphere (e.g. vortex dynamics), which amplify errors due to imperfect co-location. At more moderate latitudes, some seasonal features can be observed, but their sign varies from station to station, e.g. for Lauder and Kerguelen. A potential source of seasonal errors lies in the use of cross sections at a fixed temperature. The QA4ECV retrieval includes a second-order a posteriori temperature correction to adjust for the difference in the absorption cross section between the assumed 220 and the true effective temperature . However, the ZSL-DOAS data were not temperature corrected, and estimate the impact to range between a 2.4 % overestimation in local winter and a 3.6 % underestimation in local summer for ZSL-DOAS measurements at Jungfraujoch. In other words, the amplitude of the seasonal cycle should be about 6 % larger than that currently reported by the ZSL-DOAS at mid-latitudes for an assumed effective stratospheric temperature of 220 . Therefore, this effect could explain part of the discrepancy between satellite and ground-based seasonal cycles at sites such as Kerguelen, but it requires confirmation with a proper ZSL-DOAS temperature correction. Developmental work on this is ongoing (François Hendrick, personal communication, 2019), but it is beyond the scope of the current paper. The excellent agreement between sunrise and sunset ZSL-DOAS measurements after mapping them to the OMI overpass time at Kerguelen suggests that the photochemical adjustment works well, but it does not exclude the presence of biases that are common to sunrise and sunset measurements. At OHP, the wintertime agreement between sunrise and sunset after photochemical adjustment is not as good. Contamination by tropospheric pollution is expected to be similar for both sunrise and sunset measurements, as it contributes to the air mass below the scattering altitude, i.e. the column above the station, as opposed to the large and offset area of sensitivity in the stratosphere. Differences between sunrise and sunset contamination could still be caused by a diurnal cycle in the tropospheric column, but an analysis of that diurnal cycle (e.g. from MAX-DOAS data) is beyond the scope of this work.
Figure presents the network-wide results in terms of the bias and comparison spread for each station as a function of latitude. On average, QA4ECV OMI stratospheric seems to have a minor negative bias ( ) with respect to the ground-based network. In view of the station-to-station scatter of the order of 0.3 and the uncertainties on the ground-based data, this is hardly significant and is roughly in line with validation results for other OMI stratospheric data sets
Figure 7
Meridian dependence of the mean (the circular markers) and standard deviation ( error bars) of the individual differences between QA4ECV (a) and STREAM (b) OMI stratospheric column data and ZSL-DOAS reference data represented at individual stations from the Antarctic to the Arctic. The values in the legend correspond to the mean and standard error of all mean (for each station) differences.
[Figure omitted. See PDF]
3.4Comparison of OMI tropospheric with MAX-DOAS
A key issue in the geophysical validation of satellite data sets with respect to suborbital reference measurements is the additional uncertainty that appears when comparing different perceptions of the inhomogeneous and variable atmosphere, i.e. when comparing data sets characterized by different sampling and smoothing properties, both in space and time, which is a main topic of the European GAIA-CLIM project . Potential comparison error sources for satellite vs. MAX-DOAS are discussed in Sect. –, following the framework and terminology of and . The impact of the horizontal smoothing difference error on the bias is presented in a qualitative way in Figs. and S5–S8.
The results of the comparison of QA4ECV OMI with MAX-DOAS are provided in Sect. . The overall bias and dispersion are provided in boxplots of the differences for each site (Fig. ); comparisons of the NASA OMI data product OMNO2 with MAX-DOAS are also shown. The seasonality of the bias for each site is shown in Figs. and . Figure presents the overall discrepancy between QA4ECV OMI and MAX-DOAS as given by the mean-squared deviation (MSD), which is split into bias, seasonally cyclic and residual components. This figure also presents the consistency of the RMSD with the combined ex ante uncertainty. The impact of adapting screening criteria on bias and dispersion is shown in Figs. S9–S13. A priori profile harmonization and vertical smoothing is presented in Fig. for the bePRO sites at Uccle and Xianghe. The discussion of these figures is given point by point in Sect. . Table gives an overview of the error source attributions.
3.4.1 Sources of comparison errors: overviewPart of the discrepancies between the OMI and the MAX-DOAS data sets are due to comparison errors. Starting from the general comparison equation , the difference between satellite and reference measurements can be approximated in this specific case as
4 where and are the tropospheric VCD values measured by satellite and reference ground-based sensors respectively, and are the errors in both measurements, is the horizontal smoothing difference error (as the horizontal projection of the probed air mass of satellite and ground-based measurements is different), and and are the horizontal, temporal and vertical sampling difference error respectively (as satellite and ground-based measurement are not taken at exactly the same space and time).
3.4.2 Temporal sampling difference errorThe temporal sampling difference error and the MAX-DOAS uncorrelated random error are already mitigated by averaging the MAX-DOAS measurements within a 1.0 interval. We found that using larger time intervals can lead to an increase in the bias, which is likely due to the photochemical evolution and transport of the molecule, but at this small time window the temporal sampling difference error has a random character
This is checked by comparing MAX-DOAS measurements before and after the satellite overpass time for the different overpasses.
. The residual uncertainty can be estimated by taking the uncertainty of the mean of the MAX-DOAS values within each time interval. Subtracting this component in quadrature from the RMSD, the – discrepancies at the different sites would be reduced by less than 0.1 for the OHP, Bujumbura, Athens and Nairobi sites, and by 0.1 to 0.5 at most for the other sites. Therefore the temporal sampling difference error and the MAX-DOAS uncorrelated random error can be considered to be insignificant contributions to the – discrepancies, and they are not discussed further here. In agreement with this, found that the impact of the temporal sampling difference error on satellite vs. MAX-DOAS tropospheric VCD comparisons was negligible.3.4.3 Horizontal sampling difference error
Tropospheric has a large spatial variability, especially at polluted sites; therefore, random and systematic features in the true field at the scale of the distance between the MAX-DOAS location and the co-located satellite ground pixel (typically a few kilometres to a few tens of kilometres, –14 on average) can be expected. However, one must realize that (i) there is no directional preference in the co-locations, meaning that directional features are averaged out in the comparison, and (ii) the satellite measurements are strongly spatially smoothed.
To estimate the impact of the horizontal sampling difference error, we compare two sets of QA4ECV OMI tropospheric VCDs. Regarding the first set (), it is required that its ground pixel covers the MAX-DOAS site and its pixel centre is within 5 of the site. The second set () has its ground pixel second-nearest to the site, within the same overpass. pixels are within 3–4 of the site on average, pixels are within 11–12 of the site, and the distance between and pixels is typically 13.6 , i.e. comparable to the mean distances encountered in the OMI vs. MAX-DOAS comparisons. Note that the discrepancy between and is due to both the horizontal sampling difference error and the random noise error.
Details on the analysis are given in Sect. S3 in the Supplement. The main conclusions are as follows:
-
The bias caused by the horizontal sampling difference error reaches at most (at Athens, Bremen and Mainz) and is therefore only a very minor contributor to the observed bias between OMI and MAX-DOAS (discussed later in Sect. ).
-
The dispersion of can, in principle, be due to variation in the total slant column, in the AMF or in the stratospheric slant column (see Eq. ). It is shown in the Supplement that it is largely due to variation in the slant column. It follows that uncorrelated random noise error mainly originates from the slant column, not from AMF or stratospheric column (as these do not vary much between neighbouring pixels). This then justifies the use of the ex ante uncertainty component due to SCD uncertainty, , as an estimate of the total random error uncertainty. Note that was scaled such that it only accounts for the random error of the slant column , not for systematic error.
-
At the Bujumbura and Nairobi sites, exceeds the variance of the difference, indicating that is sometimes overestimated.
-
The standard deviation caused by the horizontal sampling difference error (obtained by subtracting the dispersion due to random noise in quadrature) is minor compared with the discrepancies encountered in the OMI vs. MAX-DOAS comparisons.
3.4.4 Vertical sampling difference error
Two sources of vertical sampling difference error can be identified. First, the surface altitude of the ground-based MAX-DOAS sensor and the mean surface altitude of the OMI ground pixel are not exactly the same. To estimate a correction, we applied a VMR-conserving vertical shift of the satellite a priori profile, described by . The ground levels are shifted by 0.03 on average (Cabauw and De Bilt) to 0.4 (Athens and Bujumbura). This hardly changed (bias changes of 0.3 or less). This VMR-conserving approach probably underestimates the discrepancy at the Athens and Bujumbura sites which have a complicated orography. The MAX-DOAS instrument at Athens is located on one of the hills surrounding the city at an altitude of 527 , while the mean surface altitude of the co-located satellite pixels is . Therefore, the MAX-DOAS measurement misses the lowest part of the tropospheric column, and correcting for this would increase the already negative bias. The MAX-DOAS instrument at Bujumbura is at an altitude of 860 at the edge of the city, which is located in a valley surrounded by 2000–3000 high mountains ; this causes the mean surface altitude of the co-located satellite pixels ( ) to be higher than the MAX-DOAS instrument.
A second source of vertical sampling difference error is the fact that the MAX-DOAS only measures the lower tropospheric VCD, whereas the satellite measures the full tropospheric VCD. This is, in principle, a source of positive bias in and, therefore, cannot explain the observed negative bias in the comparison. A proper quantification of this bias source depends critically on the assumed vertical profile shape and is outside the scope of the current work.
3.4.5 Horizontal smoothing difference error
Ideally, subpixel variation in the tropospheric VCD would be estimated using a high-resolution model with grid cell area comparable to the MAX-DOAS horizontally projected area of the probed air mass. Instead, we employ two semi-quantitative approaches here to estimate the bias from horizontal smoothing difference error.
In the first approach, the horizontal smoothing effect is estimated from the QA4ECV OMI data. The “superpixel” OMI tropospheric VCDs are constructed by averaging OMI VCDs of individual pixels of a relatively small size ( ) within a 20 radius centred on the MAX-DOAS site. For each overpass, a superpixel VCD is compared with the individual ground pixel VCD covering the MAX-DOAS site. Using this procedure, a superpixel consists of three individual ground pixels on average. The mean difference per season, from 2004 to 2016, is presented in Fig. . The second approach is similar, but uses S5P TROPOMI data, from May 2018 to May 2019, RPRO (reprocessed) OFFL (offline) data with processor version 01.02.00–01.02.02, and the superpixel tropospheric VCD is constructed by averaging VCDs of individual pixels that are within a latitude, longitude box of , centred on the MAX-DOAS site for each overpass. TROPOMI has a similar overpass time to OMI (early afternoon) and a considerably finer resolution ( at nadir). The area of this superpixel corresponds to –900 , i.e. about the size of a bigger OMI pixel, and typically contains 20 TROPOMI ground pixels.
The OMI-based approach has as the advantage that the time range is appropriate, but it is limited by the large ground pixel size. Regarding the finer resolution TROPOMI data, one should keep in mind that its ground pixel size is still large compared with the horizontally projected area of the probed air mass of the MAX-DOAS
The horizontal distance of the QA4ECV MAX-DOAS measurements is small compared with a TROPOMI pixel in both the viewing and the perpendicular direction. Regarding bePRO MAX-DOAS, although it has a small field of view, its probed distance in the viewing direction ( ) is of similar or slightly larger magnitude compared with the cross section of a TROPOMI ground pixel.
; hence, the contribution of the horizontal smoothing difference error to the bias and comparison might still be underestimated. Another limitation is that the TROPOMI time range considered does not overlap with the time range considered for OMI. Both approaches suggest a negative bias contribution due to the horizontal smoothing difference error at the Mainz and Thessaloniki sites and no such bias contribution at OHP, whereas the results are mixed for other sites (the bias varies over the seasons, and/or different results are found between the OMI- and TROPOMI-based calculations). Differences between the OMI- and TROPOMI-based calculations are likely caused by (i) the much larger central pixel of OMI compared with TROPOMI, which leads to a lower sensitivity to fine-scale variations in Fig. a, and (ii) the evolution in parameters such as concentration patterns, which are captured differently by the different temporal ranges used in Fig. a and b. A case in point is the positive mean differences in JFM (January–February–March) and OND (October–November–December) captured in the TROPOMI-based calculation but not in the OMI-based calculation. Both MAX-DOAS sensors are not located in urban centres, although pollution centres are located nearby. Therefore, the positive mean differences in JFM and OND captured by TROPOMI are likely due to fields in the periphery of the TROPOMI superpixel. This is in agreement with the very recent work by on the horizontal smoothing effect. The estimated “horizontal dilution factors” in Fig. S3 of are positive for Cabauw and Xianghe, indicating that is higher on the periphery than at the MAX-DOAS location.Figure 8
(a) Mean difference for each season between the QA4ECV OMI superpixel (ground pixels averaged within 20 of the central site) and the central OMI ground pixel using data from 2004 to 2016. (b) Similar to (a) but using TROPOMI data from April 2018 to May 2019, and the superpixel is defined within a latitude, longitude box of , centred on the MAX-DOAS site.
[Figure omitted. See PDF]
A tropospheric monthly field with subpixel variability is derived from the QA4ECV OMI data using a variant of the temporal averaging approach of
Here, the arithmetic average of covering satellite ground pixels is taken for each grid cell rather than using a weighted average, as done by . Only ground pixels with an area less than are considered.
(as shown in Figs. S5–S8) for months with a minimal (left column) and maximal (right column) OMI vs. MAX-DOAS bias (as derived from Figs. –). Fields are constructed for each month by averaging over the 2004–2016 period. The resulting field is horizontally smoothed; the variability is an underestimate of the true horizontal variability. Subpixel enhanced tropospheric approximately centred on the MAX-DOAS site can be identified in high-bias months at Nairobi, Thessaloniki and Mainz, whereas this is clearly not the case for the OHP, Bujumbura, Uccle, De Bilt/Cabauw and Xianghe sites. In Athens the pollution peak centre is some 10 from the sensor, and for Bremen no clear peak is identified.The contribution of the horizontal smoothing difference error to the (OMI–MAX-DOAS) bias at Mainz is consistent with the results of , who achieved a significant bias reduction by adjusting the OMI data with factors derived from air quality simulations at a high spatial resolution of 2 .
Similar maps have been constructed in studies such as and to estimate the impact of the horizontal smoothing effect on satellite vs. DOAS comparisons.
3.4.6 Comparison resultsBias and dispersion
Figure (black boxplots) presents boxplots of the difference of QA4ECV OMI with co-located QA4ECV MAX-DOAS for each MAX-DOAS site. At all sites, the bias of QA4ECV OMI with respect to QA4ECV MAX-DOAS is negative. On an absolute scale, it is the lowest at the less polluted OHP and Bujumbura sites (mean difference of and respectively), and highest at the Thessaloniki and Mainz sites (mean difference of ). On a relative scale, the bias is lowest (median relative difference between % and %) at the Uccle, Cabauw, De Bilt and Xianghe sites and highest (median relative difference of %) at Bujumbura and Nairobi. The difference dispersion, expressed as the interquartile range (IQR), is lowest at the Bujumbura, OHP and Nairobi sites (–2 ) and largest at the Mainz and Xianghe sites (–6 ).
Figure 9
Boxplots for each site showing QA4ECV OMI vs. QA4ECV MAX-DOAS (black boxes), QA4ECV OMI vs. bePRO MAX-DOAS (green boxes, only for three sites), OMNO2 vs. QA4ECV MAX-DOAS (blue boxes) and OMNO2 vs. QA4ECV MAX-DOAS, but for the subset of OMNO2 pixels where QA4ECV OMI has an accepted pixel only (red boxes). Panel (a) displays boxplots of SAT–REF tropospheric VCD differences, and panel (b) displays boxplots of (SAT–REF)/REF. The same boxplot conventions are used as in Fig. . Outlying mean relative differences (green crosses) can occur when low REF values are present.
[Figure omitted. See PDF]
As discussed in Sect. –, among the different comparison error components, only the horizontal smoothing difference error is expected to induce an important negative bias, and this is only true for some sites (e.g. Thessaloniki and Mainz), whereas for other sites (e.g. OHP and Xianghe) this is not expected. This means that the bias is (at least in some cases) due to error in the satellite and/or MAX-DOAS measurement, not due to comparison error.
We present in the same figure boxplots of the tropospheric VCD difference between OMNO2 data and QA4ECV MAX-DOAS measurements (blue boxplots). The bias of OMNO2 vs. QA4ECV MAX-DOAS is comparable to that of QA4ECV OMI vs. QA4ECV MAX-DOAS, although slightly more negative at all sites except Cabauw. If one only considers the subset of the OMNO2 pixels where QA4ECV OMI has an accepted pixel, the OMNO2 bias becomes closer to that of QA4ECV OMI for most sites. Although bePRO MAX-DOAS has a better correction for aerosols and vertical profile effects compared to QA4ECV MAX-DOAS in principle, the bias of QA4ECV OMI with respect to bePRO MAX-DOAS (Fig. , green boxes, only for the Bujumbura, Uccle and Xianghe sites) is comparable to that of QA4ECV OMI vs. QA4ECV MAX-DOAS.
In most cases, we conclude that mutual differences between the tropospheric VCD of the two OMI satellite data products and between both MAX-DOAS processing methods are small compared with the differences between the satellite OMI data products and the MAX-DOAS measurements. The main exception is at OHP, where the median difference and relative difference of OMNO2 vs. QA4ECV MAX-DOAS ( , %) is considerably more negative than that of QA4ECV OMI vs. QA4ECV MAX-DOAS ( , %). The observation of higher MAX-DOAS tropospheric VCD compared with satellite measurements is a common finding in the literature
Seasonal cycle of the bias
Figures and present a seasonal plot (i.e. all data mapped to 1 year) of QA4ECV OMI tropospheric VCD, of QA4ECV MAX-DOAS, and of the difference for each site. Also indicated are the rolling monthly mean and median as well as outliers identified by iterative 4 clipping.
A seasonal cycle in the bias, with a larger underestimation in seasons with high , is a recurring feature (Fig. ). This is the case at the more polluted sites such as Xianghe, Mainz and Thessaloniki in winter months and is in agreement with several literature results . Note, however, that we also find a seasonal cycle in the bias at the relatively clean OHP site. A very strong seasonal cycle in bias (tenfold increase) is present at Nairobi, where the MAX-DOAS sensor measures a strongly elevated , peaking in July and August, which is either not detected or hardly detected by the satellite. This is likely a (spatially) local phenomenon, which would be consistent with the locally enhanced in Fig. S5. This site is characterized by local traffic. The enhanced concentrations in July and August (as measured by MAX-DOAS) are possibly related to meteorology. This season is characterized by low precipitation, low wind speeds (see
Figure 10
Seasonal cycle plots for the OHP, Bujumbura, Athens, Nairobi, Thessaloniki and Bremen sites. The top panel for each site shows the tropospheric VCD of QA4ECV OMI and QA4ECV MAX-DOAS as well as the rolling monthly mean and median of both. The bottom panel for each site shows the differences between QA4ECV OMI and QA4ECV MAX-DOAS, the outliers indicated by 4 clipping, and rolling monthly mean and median of the difference.
[Figure omitted. See PDF]
Figure 11
As in Fig. but for the Uccle, Mainz, Cabauw/De Bilt and Xianghe sites.
[Figure omitted. See PDF]
Overall discrepancy and consistency with ex ante uncertainty
The discrepancy, as measured by the root-mean-squared difference (RMSD) between satellite measurements and MAX-DOAS data, exceeds the combined ex ante uncertainty for all sites
Although the root-mean-squared error (RMSE) and uncertainty are not exactly equivalent, they should be roughly comparable if all error sources are well characterized. If all error is purely random, the RMSE equals the standard deviation of errors, of which the uncertainty is an ex ante estimate. If the error is fully systematic and constant, the RMSE equals the absolute value of the bias, which is expected to be smaller than twice the uncertainty with a 95 % probability.
(see Fig. , for the squared quantities). Clearly, comparison error contributes significantly to the RMSD, and/or there are underestimated/unrecognized errors in the satellite or reference data.Figure 12
Two stacked bar plots are provided for each site. The left bar shows the mean-squared difference of QA4ECV OMI vs. QA4ECV MAX-DOAS, which is split into three components: (i) the square of the mean difference, (ii) the variance of the rolling monthly mean difference and (iii) the variance of the residual difference. The right bar shows the combined ex ante uncertainty of QA4ECV OMI –QA4ECV MAX-DOAS, which is split into four components: (i) the MAX-DOAS squared uncertainty, (ii) the QA4ECV OMI squared uncertainty contribution from the total SCD and (iii) from the stratospheric SCD, and (iv) the QA4ECV OMI squared uncertainty contribution from the tropospheric AMF. Also shown is the AMF component of the methodological uncertainty, derived by intercomparing the retrieval methodologies by , which is referred to as structural uncertainty in this work. The right axis provides a square-root scaling of the corresponding RMS.
[Figure omitted. See PDF]
The mean-squared difference in Fig. is split into three additive components: (i) the squared mean difference (bias component), (ii) the variance of the rolling monthly mean difference (seasonal cycle component) and (iii) the variance of the residual difference. The first two components can be attributed to systematic error, whereas the third component can be attributed to random error and any uncharacterized systematic error. The leading component can be different for each site (e.g. the bias component at Bujumbura, the seasonal component at Nairobi, and the residual at Mainz and Xianghe).
The satellite and reference data products do not provide the information to split the squared uncertainty according to the random or systematic nature of the error source. Instead, the squared uncertainty in Fig. is separated into additive components according to origin: (i) uncertainty in the MAX-DOAS measurement , (ii) uncertainty in the satellite measurement due to error in the SCD (expected to be mainly random in nature) , (iii) stratospheric SCD and (iv) uncertainty in satellite measurement due to error in tropospheric AMF . For the sites with the lowest levels (OHP and Bujumbura), uncertainty in the SCD is the main contributor, whereas the MAX-DOAS uncertainty becomes the leading component for the other sites.
By analysing and intercomparing the tropospheric AMF calculation methods between different retrieval groups, concluded that the uncertainty due to differences in retrieval methodology (i.e. methodological uncertainty – termed structural uncertainty by ) is 32 % under unpolluted conditions and 42 % under polluted conditions, which is mostly due to the different choices regarding ancillary data surface albedo, a priori profile and cloud parameters by different groups. In Fig. , this AMF component of the methodological uncertainty, , is presented as an alternative to the ex ante obtained by uncertainty propagation, where we classified OHP and Bujumbura as unpolluted sites and the others as polluted. In all cases, the methodological uncertainty exceeds the ex ante uncertainty . At four sites, the discrepancy between OMI and MAX-DOAS can be explained for the most part (Uccle and Cabauw/De Bilt) or even completely (Xianghe) using this methodological uncertainty, but this is not the case at the other sites.
Modifying screening criteria
Applying a more strict screening protocol can, at least in principle, mitigate discrepancies in bias and dispersion, at the expense of data loss. In the case at hand, results are mixed for the different sites (see Figs. S9–S13); stricter criteria do not resolve bias or dispersion for all sites. For the Uccle, Mainz, Cabauw and Xianghe sites strong reductions in bias and/or dispersion (–2 ) can be achieved by filtering more strictly on the effective cloud properties cloud fraction, cloud pressure, the uncertainty component due to cloud pressure , the MAX-DOAS cloud flag (removing scenes with thick or broken clouds) or the AOD. This suggests that part of the discrepancy is caused by clouds and/or aerosol. More minor reductions in the bias and/or dispersion are achieved for the Bujumbura, Nairobi, Athens, Bremen and De Bilt sites.
Screening more strictly on the ground pixel area leads to improvements in the bias for Mainz and Thessaloniki, confirming (see Sect. ) that the horizontal smoothing difference error is a component of the bias. Improvements in dispersion are found for Mainz, Thessaloniki, Uccle and Xianghe.
Using a stricter filter on effective cloud properties, the RMSD can be made consistent with the ex ante uncertainty for the Uccle and Cabauw-De Bilt sites (results not shown). For Mainz, this can be achieved if ground pixels larger than 400 are also excluded (keeping only 25 % of the data). Finally, we note that the RMSD and uncertainty are consistent in the months from May to (and including) August (when values are low) at the OHP site without the need for stricter filtering.
For most sites, additional screening (within reasonable limits) cannot lower the RMSD enough that it matches the uncertainty. Some uncertainty components in either OMI or MAX-DOAS data are likely underestimated or not included.
While we found that stricter screening using the uncertainty component due to cloud pressure, , often leads to better results, the threshold values obtained are quite low. This indicates that is underestimated in the satellite data product. As expected, relaxing the cloud fraction filter beyond the baseline can lead to an increase in bias and/or dispersion (see e.g. Bujumbura, Nairobi and Uccle in Figs. S9–S13), motivating the CF (or almost equivalently CRF ) recommendation. On the other hand, relaxing the AMF ratio filter beyond the baseline has no large impact on the comparison, whereas further restricting it sometimes has a negative impact (e.g. an increase in the bias and/or dispersion at Uccle, Xianghe and Cabauw). Therefore, the current recommended baseline lower bound () can be replaced by a lower value (e.g. 0.1 or 0.05).
Vertical smoothing
The nonuniform vertical sensitivity of the satellite measurement, combined with an approximate a priori profile shape, is a source of error in the satellite measurement. The bePRO MAX-DOAS provides not only column but also profile shape information (albeit with a limited vertical resolution) and, therefore, allows for this error source to be assessed separately. Figure shows the impact of directly applying Eq. () on the bePRO MAX-DOAS profile (after vertical alignment using the method from ) on the mean-squared deviation (MSD) as well as its bias, seasonal cycle and residual components for the Uccle and Xianghe sites. While direct smoothing of the MAX-DOAS profile improves the MSD for Uccle, it increases it for Xianghe because the seasonal cycle component increases. The increase in seasonal variance is caused by the interplay of the seasonal variation in the MAX-DOAS vertical profile and of the satellite vertical averaging kernel. Specifically, for the Xianghe case, it is found that averaging kernels have higher values close to the surface in wintertime, while MAX-DOAS profiles can also be peaked at the surface. This combination causes increased MAX-DOAS columns upon vertical smoothing. This is also seen in cases such as the comparison of the GOME-2 AC SAF GDP 4.8 product with MAX-DOAS at Xianghe (see Fig. 7.14 and 7.15 of , and Figs. S3 and S5 of ). While the a priori harmonization seems to mitigate this effect, it does not resolve it. Thus, whether the situation could be improved by improved MAX-DOAS a priori profiles and/or improved satellite averaging kernels should be a focus of future research.
Figure 13
Mean-squared deviation of QA4ECV OMI vs. bePRO MAX-DOAS tropospheric VCD at Uccle and Xianghe split into the squared mean difference (blue), variance of the rolling monthly mean difference (orange) and variance of the residual difference (green) components. For each site, from left to right, (i) baseline comparison, (ii) MAX-DOAS profile smoothed by the OMI averaging kernel, (iii) MAX-DOAS a priori replaced with that of the satellite and (iv) a priori harmonization followed by smoothing are shown. Details of the operations are provided in Sect. S6. At the baseline (i), the squared ex ante uncertainty (divided into components) is also provided. The same squared ex ante uncertainty minus the satellite profile shape uncertainty contribution is provided in (iv).
[Figure omitted. See PDF]
However, one should consider that the retrieved bePRO profiles have a low vertical resolution and depend on their own a priori profile shape. As is well known (Eq. 10 of , see also the general profile harmonization overview of ), a priori profiles of satellite and reference data should be harmonized before comparison and smoothing. Here, we aligned the surface levels of the profiles following and changed the a priori shape profile of the bePRO data to that of the satellite while keeping the bePRO a priori VCD size (which is actually obtained from measurement, see ) intact. More detail on the operations applied is provided in Sect. S6. The harmonization operation reduces all components of the MSD (bias, seasonal cycle and residual component) for the Xianghe site. When smoothing is also applied after the a priori harmonization, the bias component (blue bar in Fig. ) is almost completely removed, but the other two components increase. Therefore, application of the averaging kernel does not necessarily lead to an improvement in all aspects of a comparison; this should be the focus of further research. As the bias component is almost completely removed after harmonization and smoothing at Uccle and Xianghe, one can conclude that – at least at these two sites – the bias is largely due to errors in the a priori profile shape. Therefore, using better quality a priori profiles in both satellite and MAX-DOAS data (e.g. from regional-scale models) is recommended.
When the averaging kernel is applied, it is recommended that the satellite a priori shape component is removed from the uncertainty budget . This component was tentatively assigned 10 % of the VCD value. This only leads to a modest reduction in the combined uncertainty in Fig. (compare the non-hatched and hatched pink bars), as the dominant contribution to the OMI AMF uncertainty component is related to surface albedo rather than profile shape. For example, the combined uncertainty at Uccle decreases from 2.8 to 2.7 . However, the smoothing operation decreases the RMSD at Uccle by about 2 , which strongly suggests that the current 10 % uncertainty assignment is an underestimate.
Table 3Overview of discrepancies and error sources studied in this work. (MXD refers to MAX-DOAS.)
| Contribution | Description |
|---|---|
| Full discrepancy | Negative bias ranging from (OHP) to (Mainz and Thessaloniki). RMSD ranging from 2 (OHP and Bujumbura) to (Xianghe). RMSD dominated by bias in Bujumbura and Thessaloniki, by seasonal cycle dispersion in Nairobi, and by residual dispersion otherwise. |
| Comparison errors | |
| Temporal sampling diff. error | Mitigated by averaging MXD within a 1 interval. No systematic component. Impact on dispersion is (low pollution) to 0.1 to 0.5 (high pollution). |
| Horizontal sampling diff. error | Mitigated by excluding ground pixels not covering the site. Systematic component between 0 and . Impact on dispersion is (low pollution) to (high pollution). |
| Vertical sampling diff. error , surface level | The alignment of the satellite a priori profile to the MXD surface level using the method from changes the bias by . Bujumbura's complicated orography might lead to a higher bias. In Athens, the MXD's location on a hill is a likely source of positive bias |
| Vertical sampling diff. error , top grid level | The MXD VCD is restricted to the lower troposphere. The correction is estimated from the satellite upper tropospheric a priori profile which increases the bias. |
| Horizontal smoothing diff. error | Qualitatively assessed. Contributes to bias in Nairobi, Thessaloniki and Mainz, but does not contribute (significantly) to the bias in OHP, Cabauw or Xianghe. For other sites, the results are mixed. |
| Measurement/retrieval errors | |
| OMI total SCD error | Impact of the noise term on the dispersion is for low pollution sites and negligible for high pollution sites. |
| OMI strat. SCD error | Bias in the stratospheric VCD of translates (via ) to in tropospheric VCD. |
| OMI trop. AMF error | Error of between 32 % and 42 % , dominated by the choice of a priori profile, cloud parameters and surface albedo. This could explain (most or all) of the discrepancy at Uccle, Cabauw/De Bilt and Xianghe. |
| Error due to cloud or aerosol (OMI or MXD) | Strong reduction in bias and/or dispersion due to stricter filtering for Uccle, Mainz, Cabauw and Xianghe. Simulations indicate that cloud or aerosol can cause a factor of 2 underestimation for satellite data and up to a 20 % overestimation for MXD data. |
| Error due to vertical smoothing | Only assessed with bePRO MXD at Uccle and Xianghe, applying a priori harmonization and smoothing. The mean difference decreases from to , and the median difference decreases from to 0 . The RMSD shows a small reduction. |
“Impact on dispersion” refers to the potential reduction in the standard deviation of if the estimated standard deviation due to this particular error source was subtracted in quadrature.
4 ConclusionIn this work, stratospheric and tropospheric VCDs of the QA4ECV OMI 1.1 data product are validated using ground-based NDACC ZSL-DOAS data and MAX-DOAS data respectively. Two MAX-DOAS processing methods are used: the NDACC bePRO profile retrieval and the harmonized QA4ECV column retrieval.
Quality screening according to the data product provider's recommendations is an essential step before the satellite product can be used. However, users (e.g. developers of L3-type temporally averaged data) should be aware that this leads to a preference of cloud-free scenes for tropospheric VCD and, therefore, to a negative sampling bias, especially at polluted sites (a strong reduction in mean VCD from 24 to 15 at Xianghe, and a reduction of a few peta molecules per square centimetre () at Nairobi, Bremen, Thessaloniki and De Bilt/Cabauw). This sampling bias is reduced at De Bilt and Bremen by relaxing the lower bound filter on from 0.2 to 0.05.
The QA4ECV OMI stratospheric VCD has a small (mostly wintertime) bias with respect to the ZSL-DOAS measurements of the order of (5 %–10 %) and a dispersion of 0.2–1 , with good representation of the seasonal cycle.
QA4ECV OMI tropospheric VCD is negatively biased vs. the MAX-DOAS data. This is not unique to this data product: the same conclusion is reached for NASA's OMI OMNO2 data product and for several other tropospheric data products in the literature. The overall discrepancy exceeds the combined ex ante uncertainty of satellite and MAX-DOAS data. This is a contrasting conclusion to that of , who states that uncertainties seem to be overestimated, although their findings were for a single site over a 1-month time period (Tai'an, China, June 2006).
We studied a wide range of potential error sources of the discrepancy in tropospheric VCD between satellite and MAX-DOAS data. An overview is provided in Table .
At several sites, the MAX-DOAS instrument is located close (within satellite pixel distance) to an emission source; therefore, the horizontal smoothing difference error explains (part of) the bias, but there are also a few cases (OHP, Cabauw and Xianghe) where this does not hold. Sampling difference errors were found to be either minor (temporal and horizontal), or they were found to contribute in the opposite direction (vertical).
Measurement/retrieval error in satellite and MAX-DOAS data are other potential sources of discrepancy. Errors in the satellite total SCD and stratospheric SCD do not contribute much, leaving errors in satellite tropospheric AMF or MAX-DOAS data as candidate error sources. Part of the discrepancy is caused by errors in either the satellite or MAX-DOAS measurement induced by (low) clouds and/or aerosol (e.g. at the Mainz and Xianghe sites). According to radiative transfer simulations , these effects impact the satellite tropospheric VCD measurements (factor of decrease) more than the MAX-DOAS measurements (overestimation of 20 % at most). Moreover, the nonuniform vertical sensitivity of OMI and the uncertainty in the a priori profile shape contributes to the discrepancy, as shown here with the QA4ECV OMI vs. bePRO MAX-DOAS comparison. This is in agreement with the work of , who showed that the uncertainty in the retrieval method (due to inter-team retrieval setting differences; shorthand methodological uncertainty) in the tropospheric AMF is dominated by differences in the a priori profile, cloud parameters and surface albedo. Moreover, using this uncertainty estimate for the AMF instead of the ex ante, one can explain the SAT–REF tropospheric VCD discrepancies for three sites (Uccle, Cabauw/De Bilt and Xianghe). For these three sites, consistency can also be reached by filtering cloud parameters more strictly.
Finally, for some of the discrepancies there is no straightforward explanation. An example of this is the negative bias at OHP in wintertime. This is possibly related to a lower tropospheric AMF in wintertime, as the planetary boundary layer is shallower and the SZA is higher. As a result, comparisons become more sensitive to factors such as errors in the profile shape. Another example of the unexplained discrepancy is the negative bias at Nairobi, even when focusing on the months from December to March when tropospheric VCD values measured by MAX-DOAS are relatively low.
The potential impact of the horizontal smoothing difference error was analysed in a rather qualitative way in this work. Analysis using “Observing System Simulation Experiments” at a fine spatial resolution or other experimental set-ups (e.g. sensors measuring in multiple azimuth directions; ) can improve on this.
The inter-team harmonization of MAX-DOAS data within the QA4ECV project is an important step forward for satellite validation, although some issues remain regarding factors such as the harmonization of reported uncertainties. The FRM4DOAS project (
The availability of an ex ante uncertainty for each measurement as well as its decomposition in source components greatly facilitates the validation. However, information on how individual measurement uncertainties should be combined is incomplete in the satellite and MAX-DOAS data files. This limits the ability to check certain things, such as if the respective overall bias, dispersion or seasonal cycle of the bias are within expectations; in this work, we only checked the consistency of the overall discrepancy (expressed as the RMSD) with the combined total uncertainty. It is recommended that information on the systematic/random nature and error correlation is included in the satellite data product.
The ex ante uncertainty for each pixel in the QA4ECV satellite data product is likely underestimated. A solution for this could be to explicitly account for the methodological uncertainty on the AMF in a similar fashion to the process carried out for the QA4ECV data product . Alternatively, the uncertainty component due to the profile shape in the OMI product could be increased, as tests in this work show that the current 10 % assignment is an underestimate. The QA4ECV recommended filter on the AMF ratio can be made less restrictive (e.g. 0.05 lower bound), reducing data loss and sampling bias without compromising the comparisons with MAX-DOAS. Furthermore, the replacement of the coarsely resolved TM5 profiles with high spatial resolution profiles from regional air quality analyses (e.g. CAMS regional,
Code and data availability
The QA4ECV OMI data are available from
Part of the validation processing was performed using the HARP data harmonization toolset (© s[&]t, the Netherlands), which is available at
The supplement related to this article is available online at:
Author contributions
SC coordinated the paper and carried out the validation analysis. TV carried out the stratospheric VCD validation analysis. GP contributed insights into the tropospheric VCD validation analysis. DH, AK, JCL and TV contributed validation expertise. JG, SN and BR created software tools for validation. JG performed general data collection and format harmonization. FH coordinated the creation of the QA4ECV MAX-DOAS improved data sets. AB, JPB, FH, AP, JR, AR, MVR and TW are principal investigators for the QA4ECV MAX-DOAS measurements. FH and MVR are principal investigators for the bePRO MAX-DOAS measurements. FG, AP and JPP are principal investigators for the SAOZ ZSL-DOAS measurements. FB, HE, AR, IDS, AL, JvG, EP, MVR and TW are the authors of the QA4ECV OMI data set. JCL is the coordinator of this research. All authors reviewed and commented on the paper.
Competing interests
The authors declare that they have no conflict of interest.
Acknowledgements
This research was carried out in the framework of the EU FP7 “Quality Assurance for Essential Climate Variables” (QA4ECV; grant no. 5 607405) project with support from the EU H2020 “Gap Analysis for Integrated Atmospheric ECV CLImate Monitoring” (GAIA-CLIM; Ares (2014)3708963/grant no. 640276) project. The authors are particularly grateful to QA4ECV for the generation of harmonized OMI and MAX-DOAS data sets within a rigorous quality assurance framework and to GAIA-CLIM WP3 (Comparison Error Budget Closure) for dedicated metrology support.
Several validation analysis tools were funded by the Belgian Science Policy Office (BELSPO) and ESA through the ACROSAT PRODEX-10 project.
We are grateful to Marina Zara (KNMI) for clarification regarding the different QA4ECV OMI uncertainty fields.
Ground-based ZSL-DOAS data and other MAX-DOAS data used in this publication were obtained as part of the Network for the Detection of Atmospheric Composition Change (NDACC) and are publicly available (see
Financial support
This research has been supported by the European Commission, FP7 (QA4ECV (grant no. 607405)) and Horizon 2020 (GAIA-CLIM (grant no. 640276)), and by the Belgian Federal Science Policy Office (BELSPO), and ESA (ProDEx-10 ACROSAT grant).
Review statement
This paper was edited by Ralf Sussmann and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2020. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The QA4ECV (Quality Assurance for Essential Climate Variables) version 1.1 stratospheric and tropospheric
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
; Verhoelst, Tijl 1
; Pinardi, Gaia 1
; Granville, José 1 ; Hubert, Daan 1
; Keppens, Arno 1
; Niemeijer, Sander 2 ; Bruno Rino 2 ; Bais, Alkis 3
; Beirle, Steffen 4
; Boersma, Folkert 5
; Burrows, John P 6
; De Smedt, Isabelle 1
; Eskes, Henk 7
; Goutail, Florence 8
; Hendrick, François 1 ; Lorente, Alba 9
; Pazmino, Andrea 8 ; Piters, Ankie 7 ; Peters, Enno 6
; Pommereau, Jean-Pierre 8
; Remmers, Julia 4 ; Richter, Andreas 6
; Jos van Geffen 7
; Michel Van Roozendael 1 ; Wagner, Thomas 4 ; Jean-Christopher, Lambert 1 1 Royal Belgian Institute for Space Aeronomy (BIRA-IASB), Uccle, Belgium
2 s[&]t Corporation, Delft, the Netherlands
3 Aristotle University of Thessaloniki, Laboratory of Atmospheric Physics (AUTH), Thessaloniki, Greece
4 Max Planck Institute for Chemistry (MPIC), Mainz, Germany
5 Royal Netherlands Meteorological Institute (KNMI), De Bilt, the Netherlands; Wageningen University, Meteorology and Air Quality Group, Wageningen, the Netherlands
6 Institute of Environmental Physics, University of Bremen (IUP-B), Bremen, Germany
7 Royal Netherlands Meteorological Institute (KNMI), De Bilt, the Netherlands
8 Laboratoire Atmosphères, Milieux, Observations Spatiales, CNRS, Guyancourt, France
9 Wageningen University, Meteorology and Air Quality Group, Wageningen, the Netherlands





