Validation of tropospheric NO2 column

Full text

Turn on search term navigation

1 Introduction

Nitrogen dioxide ( ${NO}_{2}$ ) is a key species for atmospheric chemistry, present both in the stratosphere and in the troposphere. In the troposphere, nitrogen oxides ( ${NO}_{x} = NO + {NO}_{2}$ ) together with volatile organic compounds are key ingredients for ozone and photochemical smog formation in polluted regions. By reaction with the hydroxyl radical (OH), ${NO}_{2}$ forms nitric acid ( ${HNO}_{3}$ ), which leads to acid rain and consequently acidifies soils and waterbodies with negative impacts on the environment. In addition to its important role in air quality (human health and environmental acidification), ${NO}_{2}$ is also relevant for climate processes at high concentrations, contributing to direct radiative forcing and the extension of atmospheric lifetimes of gases such as ${CH}_{4}$ . The main sources of ${NO}_{x}$ include anthropogenic and natural emissions, such as fossil fuel combustion, biomass burning, lightning and microbial soil emissions. There is a need for accurate ${NO}_{2}$ measurements to assess and forecast its impact on air quality.

${NO}_{2}$ can be measured by several methods, such as in situ sampling and active or passive remote sensing. The differential optical absorption spectroscopy (DOAS) technique (Platt and Stutz, 2008) is widely used to retrieve ${NO}_{2}$ in the atmosphere from measurements taken from satellites, from balloons and from the ground. Since the mid-nineties, ${NO}_{2}$ has been measured from space by mid-morning low earth orbit (LEO) nadir satellite instruments, such as GOME on ERS-2 (1996–2003; Burrows et al., 1999), SCIAMACHY on ENVISAT (2002–2012; Bovensmann et al., 1999) and GOME-2 on MetOp A, B and C (since 2006, 2012 and November 2018 respectively; Munro et al., 2016). From 2004 onwards, ${NO}_{2}$ measurements in the early afternoon have also been performed from the OMI imaging spectrometer on the EOS-Aura platform (Levelt et al., 2006) and since the end of 2017 from the Sentinel-5P TROPOMI instrument (Veefkind et al., 2012). In the last 15 years, ground-based MAX-DOAS (multi-axis differential optical absorption spectroscopy) instruments have been developed to measure tropospheric trace gases (Hönninger and Platt, 2002; Hönninger et al., 2004; Sinreich et al., 2005). Combined with profiling algorithms, this technique has been successfully applied to retrieve tropospheric columns and information on the vertical distribution of ${NO}_{2}$ , $HCHO$ , ${SO}_{2}$ , $BrO$ , $IO$ , $HONO$ , $CHOCHO$ and aerosols (e.g., Bobrowski et al., 2003; Wittrock et al., 2004; Wagner et al., 2004; Heckel et al., 2005; Frieß et al., 2006, 2016; Sinreich et al., 2007; Theys et al., 2007; Irie et al., 2008b, 2009; Clémer et al., 2010; Galle et al., 2010; Hendrick et al., 2014). Direct sun observations in the UV–visible, which provide total column measurements (Cede et al., 2006; Wenig et al., 2008; Herman et al., 2009; Wang et al., 2010), are also used for monitoring atmospheric ${NO}_{2}$ . In particular, the recently developed Pandora instrument (SciGlob, http://www.sciglob.com/, last access: 29 October 2020) operationally provides direct sun measurements of $O_{3}$ and ${NO}_{2}$ and ${SO}_{2}$ and $HCHO$ in a scientific mode (Herman et al., 2009, 2018, 2019; Wang et al., 2010; Tzortziou et al., 2015; Fioletov et al., 2016; Spinei et al., 2018) at a growing number of sites.

One of the strengths of LEO nadir satellite instruments with wide swath width, like OMI and GOME-2, is their daily global coverage. Their main drawback is their limited revisit frequency and associated sampling of the diurnal cycle (typically one overpass per day for midlatitudes) and coarse spatial resolution (from a few to several hundreds of kilometers). The accuracy of the different satellite datasets is also of concern, e.g., for trend analysis or diurnal variation studies. Validation activities, which are an essential part of any satellite program, aim at deriving independently a set of indicators characterizing the quality of the data product. They encompass the monitoring of instrumental stability as well as the inter-sensor consistency needed to ensure continuity between different satellite missions. Satellite validation also contributes to the improvement of retrieval algorithms through investigation of the accuracy of the data products and their sensitivity to retrieval parameter choices. Tropospheric satellite data products depend on various sources of ancillary data, e.g., a priori vertical distribution of the absorbing and scattering species, surface albedo and information on clouds and aerosols (Boersma et al., 2004; Lin et al., 2015; Lorente et al., 2017; Liu et al., 2019a). In the case of ${NO}_{2}$ , separation between stratospheric and tropospheric contributions is an additional source of complexity in the retrieval, and there is considerable debate on the importance of the role of free tropospheric (background) ${NO}_{2}$ in the retrieval process (Jiang et al., 2018; Silvern et al., 2019). As discussed by Richter et al. (2013), the validation of tropospheric reactive gases (such as ${NO}_{2}$ , $HCHO$ and ${SO}_{2}$ ) is also challenging because short atmospheric lifetimes, local emission sources and transport can lead to a large variability of their concentrations in time and space (both vertically and horizontally). Active photochemistry and transport processes lead to important diurnal variations cycles (Boersma et al., 2008) that need to be considered for validation studies. MAX-DOAS and direct sun remote-sensing techniques have large potential capacities for the validation of satellite trace gas observations, as they measure all day long and provide accurate measurements of integrated column amounts (i.e., a quantity close to that measured by spaceborne instruments). Remote sensing measurements also match the horizontal resolution of satellite observations better than e.g., surface in situ monitoring networks. The spatial averaging of MAX-DOAS measurements has been quantified and shown to range from a few kilometers to tens of kilometers depending on aerosol content and measurement wavelength (Irie et al., 2011, 2012; Wagner et al., 2011; Wang et al., 2014; Gomez et al., 2014; Ortega et al., 2015).

In the last decade, several studies have compared different SCIAMACHY, GOME-2 and OMI ${NO}_{2}$ data products (generated by both operational and scientific prototype processors) to MAX-DOAS measurements at various stations (e.g., Brinksma et al., 2008; Hains et al., 2010; Vlemmix et al., 2010; Irie et al., 2008a; Ma et al., 2013; Lin et al., 2014; Wang et al., 2017; Drosoglou et al., 2017, 2018; Liu et al., 2019a, b, 2020). JAMSTEC data from the MADRAS network have been used in Kanaya et al. (2014) for the validation of the OMI DOMINO and NASA tropospheric ${NO}_{2}$ data. BIRA-IASB MAX-DOAS stations have been regularly used for the validation of GOME-2 GDP (GOME Data Processor) products from MetOp-A and MetOp-B (Valks et al., 2011; Pinardi et al., 2011, 2014, 2015; Liu et al., 2019b) as part of the AC SAF activities (Hassinen et al., 2016; see also https://cdop.aeronomie.be/validation/valid-results, last access: 29 October 2020). Pandora datasets have also been used in satellite validation of total and tropospheric ${NO}_{2}$ columns (Herman et al., 2009; Tzortziou et al., 2014, 2015; Judd et al., 2019, and a recent study of Herman et al. (2019) presented an overview at 14 Pandora sites showing that NASA OMI ${NO}_{2}$ overpass data consistently underestimate the Pandora-derived ${NO}_{2}$ amounts. One general conclusion of these exercises was to find a low bias of the satellites tropospheric ${NO}_{2}$ columns in urban conditions and, in contrast, a better agreement with ground-based data in background and pristine locations (Celarier et al., 2008; Halla et al., 2011; Kanaya et al., 2014). However Irie et al. (2012) also reported low OMI ${NO}_{2}$ column values over China in summer, when the spatial distribution of ${NO}_{2}$ was likely homogeneous.

In the present study, we validate GOME-2A and OMI tropospheric ${NO}_{2}$ column measurements using data from a large number of MAX-DOAS and direct sun instruments operating in Europe, Asia, North America and Africa under a wide variety of atmospheric conditions and pollution patterns. Some of these datasets have already been used in the past for tropospheric ${NO}_{2}$ validation of different satellites and products and participated in the CINDI-1 and/or 2 intercomparison campaigns (Piters et al., 2012; Kreher et al., 2020). In the present study we combine them in a coordinated way, allowing for a global approach to satellite validation, sampling different ${NO}_{2}$ levels in various locations around the globe. In addition the smearing (or dilution) of the ${NO}_{2}$ field due to the limited horizontal resolution of satellite measurements is investigated. A method for the quantification and correction of the dilution effect is proposed, and its impact on validation results is quantitatively evaluated. Our validation approach is applied to operational OMI DOMINO and AC SAF GOME-2A products as well as to climate data record OMI and GOME-2A ${NO}_{2}$ data products generated within the EU QA4ECV project.

The paper is structured as follows: Sects. 2 and 3 describe the OMI and GOME-2A sensors and datasets as well as the reference ground-based measurements. Section 4 presents the comparison methodology, and comparison results are discussed in Sect. 5. In Sect. 6, we concentrate on the quantification of horizontal dilution effects in satellite measurements performed around the measurement sites, and we show how these effects impact the validation results in urban conditions. Section 7 presents a summary of the validation results, and conclusions are detailed in Sect. 8.

Satellite tropospheric ${NO}_{2}$ datasets

Tropospheric ${NO}_{2}$ data products from spaceborne sensors are generally retrieved via three main steps: firstly, a DOAS spectral analysis, yielding the total column amount of ${NO}_{2}$ along the slant optical path; secondly an estimation of the stratospheric ${NO}_{2}$ column, to be subtracted from the total column to derive the tropospheric contribution (so-called “residual” technique); and finally a conversion from slant (SCD) to vertical (VCD) column densities. The last step is based on air mass factor (AMF) calculations which require a priori knowledge of the ${NO}_{2}$ vertical distribution, pressure and temperature, surface albedo and aerosols and information on (effective) cloud cover and height (Boersma et al., 2004). The retrieval of tropospheric ${NO}_{2}$ is given by 1 ${VCD}_{tropo} = \frac{(SCD - {AMF}_{strato} \cdot {VCD}_{strato})}{{AMF}_{tropo}} .$ Different data products have been generated for each satellite instrument, using different assumptions for each of the three aforementioned steps (see Boersma et al., 2004; Richter et al., 2011; Lin et al., 2014; Bucsela et al., 2013; Lamsal et al., 2014; van Geffen et al., 2015; Krotkov et al., 2016; Lorente et al., 2017; Liu et al., 2019a, b, 2020). In addition to instrument-specific differences, structural uncertainties arising from the application of different retrieval methodologies to the same satellite observations (sometimes also called forward model uncertainties) can introduce differences in the retrieved tropospheric ${NO}_{2}$ columns ( ${VCD}_{tropo}$ ) of 10 %–50 % (e.g., van Noije et al., 2006; Lorente et al., 2017; Zara et al., 2018). SCD structural uncertainties generally do not exceed $1 \times 10^{15}$ $molecules {cm}^{- 2}$ , while the AMF calculation leads to more significant uncertainties (Boersma et al., 2004), which can be separated into implementation differences (when different groups use identical ancillary data for the calculation of tropospheric ${NO}_{2}$ AMFs) of about 6 % and structural differences, due to ancillary data selection, which can reach 31 %–42 % (Lorente et al., 2017). The uncertainty in separating the stratospheric and tropospheric columns is about $0.5 \times 10^{15}$ $molecules {cm}^{- 2}$ (Dirksen et al., 2011; Lorente et al., 2017).

In the present study, we focus on the ground-based validation of the mid-morning GOME-2A and the early afternoon OMI data. Illustration of the validation method and step-by-step results along the paper are given for the GOME-2A GDP (GOME Data Processor) 4.8 ${NO}_{2}$ operational data product (Valks et al., 2011) and the OMI DOMINO v2.0 data product (Boersma et al., 2011), while final validation results and discussion also gather results for the GOME-2A and OMI QA4ECV products (Boersma et al., 2018; Zara et al., 2018). All products are briefly presented in Table 1 and in the following subsections.

Table 1

Description of the satellite retrievals algorithms involved in this study.

	GOME-2A		OMI
	GOME-2A GDP 4.8	GOME-2A QA4ECV v1.1	OMI QA4ECV v1.1	OMI DOMINO v2.0
Instrument information
Resolution at nadir
(across $\times$ along track)	$80 km \times 40$ $km$ $^{*}$		$24 km \times 13$ $km$
Solar local time at
Equator crossing node	09:30		13:30
${NO}_{2}$ retrieval information
Version	GOME-2A GDP 4.8	GOME-2A QA4ECV v1.1	OMI QA4ECV v1.1	OMI DOMINO v2.0
Reference	Valks et al. (2011, 2017)	Boersma et al. (2018),	Boersma et al. (2018),	Boersma et al. (2011)
		Zara et al. (2018)	Zara et al. (2018)
SCD retrieval	DOAS fitting window:	DOAS fitting window:	DOAS fitting window:	DOAS fitting window:
	425–450 $nm$	405–465 $nm$	405–465 $nm$	A 405–465 $nm$
	Absorbers: ${NO}_{2}$ , $O_{3}$ ,	Absorbers: ${NO}_{2}$ , $O_{3}$ ,	Absorbers: ${NO}_{2}$ , $O_{3}$ ,	Absorbers: ${NO}_{2}$ , $O_{3}$ ,
	$O_{2} - O_{2}$ , $H_{2} O$	$O_{2} - O_{2}$ , $H_{2} O$ ,	$O_{2} - O_{2}$ , $H_{2} O$ ,	$H_{2} O$ and Ring
	and Ring	$H_{2} O$ liq and Ring	$H_{2} O$ liq and Ring
Stratospheric	Spatial filtering/	Assimilated ${NO}_{2}$	Assimilated ${NO}_{2}$	Assimilated ${NO}_{2}$
Correction	masking of	stratospheric slant	stratospheric slant	stratospheric slant
	polluted	columns with	columns with	columns with
	fields	the TM5-MP (selected	the TM5-MP (selected	the TM4
		as default) STREAM	as default) STREAM	chemistry transport
		(Beirle et al., 2016)	(Beirle et al., 2016)	model
Tropospheric AMF calculation
Radiative transfer model	LIDORT	DAK 3.0	DAK 3.0	DAK 3.0
${NO}_{2}$ a priori	Monthly profiles for 1997	Daily profiles from	Daily profiles from	Daily profiles from
profile	from MOZARTv2	TM5-MP model	TM5-MP model	TM4 model
	(Horowitz et al., 2003),	(Williams et al., 2017),	(Williams et al., 2017),	(Huijnen et al., 2010),
	$1.875^{\circ} \times 1.875^{\circ}$ resolution	$1^{\circ} \times 1^{\circ}$ resolution	$1^{\circ} \times 1^{\circ}$ resolution	$2^{\circ} \times 3^{\circ}$ resolution
Cloud treatment	IPA correction based on	IPA correction based on	IPA correction based on	IPA correction based on
	OCCRA/ROCINN cloud	FRESCO $+$ cloud algorithm	OMCLDO2 cloud algorithm	OMCLDO2 cloud algorithm
	scheme v3	(Wang et al., 2008)	(Veefkind et al., 2016)	(Acarreta et al., 2004;
	(Loyola et al., 2018)			Stammes et al., 2008)
Aerosol	Implicitly corrected	Implicitly corrected	Implicitly corrected	Implicitly corrected
	by cloud treatment	by cloud treatment	cloud treatment	by cloud treatment
Albedo	1.25 $^{\circ}$ long $\times$ 1 $^{\circ}$ lat surface	Climatology from	Updated 5-year climatology	$0.5^{\circ} \times 0.5^{\circ}$ OMI
	LER climatology	Tilstra et al. (2017)	(Kleipool et al., 2008)	climatology
	derived from combined			(Kleipool et al., 2008)
	TOMS–GOME measurements
	(Boersma et al., 2004)
Overall estimated	$1.0 \times 10^{15}$ $molecules {cm}^{- 2}$	Average of 35 % to 45 %	Average of 35 % to 45 %	$1.0 \times 10^{15}$ $molecules {cm}^{- 2}$
uncertainty of	( $\pm 25$ %) in rural environments	single pixel uncertainties	single pixel uncertainties	( $\pm 25$ %)
tropospheric ${NO}_{2}$	and from 40 % to 80 % under	in polluted regions	in polluted regions	(Boersma et al., 2011;
vertical column	polluted conditions	(Boersma et al., 2018)	(Boersma et al., 2018)	Lin et al., 2014;
densities	(Valks et al., 2011)			Lamsal et al., 2014)

$^{*}$ Since 15 July 2013 GOME-2A has been operating in a reduced swath mode, corresponding to a ground pixel size of $40 km \times 40 km$ .

2.1 GOME-2 products

The second Global Ozone Monitoring Instrument (GOME-2) is a nadir-looking UV–visible spectrometer measuring the solar radiation backscattered by the atmosphere and reflected by the Earth and clouds in the 240–790 $nm$ wavelength interval, with a spectral resolution of 0.2–0.5 $nm$ full width at half maximum (FWHM; Munro et al., 2016). There are three versions of GOME-2 instruments flying on a sun-synchronous polar orbit on board the Meteorological Operational satellites (MetOp-A, MetOp-B and MetOp-C, launched respectively in October 2006, September 2012 and November 2018). They have an Equator crossing time of 09:00–09:30 local time in the descending node. In this study we concentrate on the GOME-2A instrument (that is on MetOp-A), which presents the longest data record. The default swath width of the GOME-2A across-track scan is 1920 $km$ , allowing global Earth coverage within 1.5–3 $d$ at the Equator, with a nominal ground pixel size of $80 km \times 40$ $km$ . Since 15 July 2013, GOME-2A has been measuring in a reduced swath mode of 960 $km$ , with a ground pixel size of $40 km \times 40$ $km$ .

Operational products are retrieved from GOME-2 measurements in the framework of the Atmospheric Composition Satellite Application Facility AC SAF (https://acsaf.org/, last access: 29 October 2020; formerly O3M SAF; see also Hassinen et al., 2016). Total, tropospheric and stratospheric ${NO}_{2}$ columns are operationally retrieved with the GOME Data Processor (GDP, and a description of this algorithm can be found in Valks et al. (2011) and Liu et al. (2019b). Within the QA4ECV (Quality Assurance for Essential Climate Variables) project, a coherent offline ${NO}_{2}$ dataset has been created for GOME, SCIAMACHY, GOME-2A and OMI (Boersma et al., 2018; Zara et al., 2018; Lorente et al., 2017), and comparisons with this dataset are also included at the end of this study.

Table 1 summarizes the main retrieval steps for the various tropospheric ${NO}_{2}$ products considered here. The main differences are related to the methods to obtain the stratospheric ${NO}_{2}$ column, the cloud parameters and the a priori information used to calculate the tropospheric air mass factor. In the Q4ECV case, stratospheric columns are derived using two different approaches (assimilation in TM4 and STREAM). The stratospheric separation method has an estimated uncertainty in the 0.15– $0.3 \times 10^{15}$ $molec {cm}^{- 2}$ range (Valks et al., 2011). The typical overall uncertainty for individual retrievals of tropospheric ${NO}_{2}$ vertical column densities is estimated to be $1.0 \times 10^{15}$ $molecules {cm}^{- 2}$ ( $\pm 25$ %) in rural environments and from 40 % to 80 % under polluted conditions (Valks et al., 2011).

Previous validation of GOME-2A GDP 4.8 data can be found in Valks et al. (2011), Hassinen et al. (2016) and Liu et al. (2019b) for a few MAX-DOAS stations, and results of regular validation exercises can be found at http://cdop.aeronomie.be/validation/valid-results (last access: 29 October 2020). Satellite-to-satellite comparisons of the GOME-2A QA4ECV data have been performed by Zara et al. (2018), Lorente et al. (2017) and Liu et al. (2019b). Previous GOME-2 validation highlighted the effect of GOME-2 large pixels and the aerosol shielding effect, leading, e.g., to differences of 5 % to 25 % over China (Ma et al., 2013; Wu et al., 2013; Wang et al., 2017; Drosoglou et al., 2018). Liu et al. (2019b) showed possible improvements of the GDP 4.8 product, leading to reduced discrepancies of the satellite-to-ground-based biases of the order of 10 % to 25 % for several MAX-DOAS stations.

2.2 OMI products

OMI (Ozone Monitoring Instrument) is a nadir-viewing imaging spectrometer with a spectral resolution of about 0.5 $nm$ FWHM (Levelt et al., 2006). The light entering the telescope is depolarized using a scrambler and split into two spectral bands: a UV channel (wavelength range 270–380 $nm$ ) and a visible channel (wavelength range 350–500 $nm$ ). The 114 $^{\circ}$ viewing angle of the telescope corresponds to a 2600 $km$ wide swath on the Earth's surface distributed over 60 cross-track positions, which enables quasi-global coverage in 1 d. In the nominal global operation mode, the OMI ground pixel size varies from $13 km \times 24 km$ at true nadir to $28 km \times 150 km$ on the edges of the swath. OMI is on board the EOS-Aura satellite that was launched in July 2004, in a sun-synchronous polar orbit crossing the Equator around 13:45 LT (in ascending node). The radiometric stability of the OMI instrument is exceptionally good (Schenkeveld et al., 2017); however, since June 2007, several rows of the detector have been affected by a signal reduction, the so-called “row anomaly” (http://www.knmi.nl/omi/research/product/rowanomaly-background.php, last access: 29 October 2020), reducing the usable swath coverage (see Boersma et al., 2018).

The DOMINO (Derivation of OMI tropospheric ${NO}_{2}$ ) product is distributed in NRT via the TEMIS (Tropospheric Emission Monitoring Internet Service; http://www.temis.nl, last access: 29 October 2020) project (Boersma et al., 2011). The offline OMI QA4ECV v1.1 product (Boersma et al., 2018) is very similar to the GOME-2A product, as can be seen in Table 1. For OMI, the stratospheric separation is performed using a data assimilation scheme based on the TM4 or TM5-MP chemistry transport models. Its uncertainty is estimated to be about 0.2– $0.3 \times 10^{15}$ $molec {cm}^{- 2}$ (Boersma et al., 2004; Dirksen et al., 2011). Stratospheric ${NO}_{2}$ vertical columns used in our study are derived from assimilated stratospheric slant columns divided by a geometrical air mass factor, as described in Hendrick et al. (2012). For the OMI QA4ECV dataset, two estimates of the stratospheric column are reported (data assimilation and STREAM), and Boersma et al. (2018) illustrated the differences for both approaches, with differences of up to $1 \times 10^{15}$ $molec {cm}^{- 2}$ . Compernolle et al. (2020) showed best agreement with ZSL-DOAS NDACC measurements for the STREAM stratospheric dataset, with mean differences between the two datasets of the order of $0.2 \times 10^{15}$ $molec {cm}^{- 2}$ on average.

OMI DOMINO v2.0 has been widely used in the past, and several validation exercises (Brinksma et al., 2008; Hains et al., 2010; Vlemmix et al., 2010; Irie et al., 2008a, 2012; Lin et al., 2014; Wang et al., 2017; Drosoglou et al., 2017, 2018; Liu et al., 2019a) found underestimation of the OMI tropospheric ${NO}_{2}$ columns in urban conditions and a better agreement in background locations (Celarier et al., 2008; Halla et al., 2011; Kanaya et al., 2014). Kanaya et al. (2014) showed close correlations with MAX-DOAS observations at seven stations but found low biases of up to $\sim 50$ %. Regarding the OMI QA4ECV product, Boersma et al. (2018) reported a first validation at the Tai'an station (China) in 1 summer month, finding good agreement (bias of $- 2$ %) with respect to MAX-DOAS ${NO}_{2}$ columns (better than the agreement found for DOMINO v2 of $- 11$ % bias). Liu et al. (2019a) investigated the impact of correcting for aerosol vertical profiles in the OMI data and compared four OMI datasets (POMINO and POMINO v1.1, DOMINO v2.0 and QA4ECV) with respect to data of three Chinese stations. Results suggested a significant improvement of the OMI ${NO}_{2}$ retrieval when correcting for aerosol profiles, in general and for hazy days. This is consistent with the previous finding that the accuracy of DOMINO v2.0 is reduced for polluted, aerosol-loaded scenes (Boersma et al., 2011; Kanaya et al., 2014; Lin et al., 2014; Chimot et al., 2016). Liu et al. (2019a) also established discrepancies in DOMINO v2.0 for very high ${NO}_{2}$ values (>70 $\times 10^{15}$ $molec {cm}^{- 2}$ ). For 18 cloud-free days, they found smaller differences between the four products with respect to MAX-DOAS, with the QA4ECV dataset having the highest $R^{2}$ (0.63) and the lowest bias ( $- 5, 8$ %). An extended validation of the QA4ECV OMI product is reported in the recent Compernolle et al. (2020) study, showing a negative bias (from $-$ 1 to $-$ 4 $\times 10^{15}$ $molec {cm}^{- 2}$ ) with respect to 10 MAX-DOAS instruments, a feature also found for the OMI OMNO2 standard data product. They also found that the tropospheric VCD discrepancies between satellite and ground-based data exceed the combined measurement uncertainties and that, depending on the site, this discrepancy could be attributed to a combination of comparison errors (horizontal smoothing difference error, error related to clouds and aerosols and differences due to a priori profile assumptions).

3 Ground-based datasets: MAX-DOAS and direct sun measurements

3.1 MAX-DOAS technique

A MAX-DOAS instrument measures the scattered sunlight under a sequence of viewing elevation angles extending from the horizon to the zenith (Fig. 1a). At low elevation angles, the observed sunlight travels a long path in the lower troposphere (under aerosol-free conditions, the lower the elevation angle, the longer the path), while all observations have approximately the same light path in the stratosphere, independently of viewing elevation. By taking the difference in SCD between off-axis observations and a (nearly) simultaneously acquired zenith reference spectrum (the differential slant column), the stratospheric contribution can therefore be eliminated. Tropospheric absorbers can be measured along the day, generally up to a solar zenith angle (SZA) of approximately 85 $^{\circ}$ (Hönninger et al., 2004; Sinreich et al., 2005).

Figure 1

Sketches illustrating the MAX-DOAS and direct sun viewing geometries.

[Figure omitted. See PDF]

Radiance spectra acquired at different elevation angles are analyzed using the DOAS method (Platt and Stutz, 2008), which gives integrated trace gas concentrations along the atmospheric absorption path. The resulting differential slant columns (dSCDs) can be converted to vertical columns and/or vertical profiles using methods of different levels of complexity. Table 2 presents details about the retrieval strategy adopted by different teams. They generally belong to one of the following categories:

-
Geometrical approximation (GA). The vertical column is determined under the assumption that a single-scattering approximation can be made for moderately high elevation angles $α$ (typically 30 $^{\circ}$ ) so that a simple geometrical air mass factor ( $AMF α \equiv SCD / VCD = 1 / sin⁡ (α)$ ) (Hönninger et al., 2004; Brinksma et al., 2008; Ma et al., 2013) can be used.
-
QA4ECV datasets. The vertical column is calculated using tropospheric AMFs based on climatological profiles and aerosol situations as developed during the QA4ECV project (http://uv-vis.aeronomie.be/groundbased/QA4ECV_MAXDOAS/QA4ECV_MAXDOAS_readme_website.pdf, last access: 29 October 2020). These data are less sensitive to the relative azimuth angle than the purely geometric approximation presented above.
-
Vertical profile algorithms using an optimal estimation method (OEM; Rodgers, 2000). These make use of a priori vertical profiles and associated uncertainties (Frieß et al., 2006; Clémer et al., 2010; Hendrick et al., 2014; Wang et al., 2017; Friedrich et al., 2019; Bösch et al., 2018).
-
Vertical profile algorithms based on parameterized profile shape functions. These make use of analytical expressions to represent the trace gas profile using a limited number of parameters (Irie et al., 2008a, b; 2011; Li et al., 2010; Vlemmix et al., 2010; Wagner et al., 2011; Beirle et al., 2019).

Table 2

MAX-DOAS tropospheric ${NO}_{2}$ datasets included in this study (23 stations, 15 with profiles). GA stands for geometrical approximation, OEM for optimal estimation method and PP for parameterized profiling.

Station/country (lat/long)	Station type	Owner/group	Time period	Instrument type	Retrieval type	Reference
Bremen/Germany (53 $^{\circ}$ N, 9 $^{\circ}$ E)	Urban	IUPB	January 2007–August 2018	Custom-built MAX-DOAS	VCD from QA4ECV	QA4ECV dataset
De Bilt/the Netherlands (52.10 $^{\circ}$ N, 5.18 $^{\circ}$ E)	Urban	KNMI	November 2007–August 2018	miniDOAS	VCD with fixed profile shape	Vlemmix et al. (2010)
						QA4ECV and
						NID-FORVAL datasets
Uccle/Belgium (50.78 $^{\circ}$ N, 4.35 $^{\circ}$ E)	Urban	BIRA-IASB	April 2011–February 2016	miniDOAS	VCD and profiles from OEM	Gielen et al. (2014)
Mainz/Germany (50 $^{\circ}$ N, 8 $^{\circ}$ E)	Urban	MPIC	June 2013–August 2018	Custom-built MAX-DOAS	VCD from QA4ECV	QA4ECV dataset
Thessaloniki/Greece (40.63 $^{\circ}$ N, 22.96 $^{\circ}$ E)	Urban	AUTH	January 2011–August 2018	Phaethon	VCD from QA4ECV	Kouremeti et al. (2013),
						Drosoglou et al. (2017)
						QA4ECV datasets
Beijing/China (39.98 $^{\circ}$ N, 116.38 $^{\circ}$ E)	Urban	BIRA-IASB/IAP	July 2008–April 2009	Custom-built MAX-DOAS	VCD and profiles from OEM	Clémer et al. (2010),
						Hendrick et al. (2014),
						Vlemmix et al. (2015)
Beijing/China (39.95 $^{\circ}$ N, 116.32 $^{\circ}$ E)	Urban	CAMS	August 2008–September 2011	miniDOAS	VCD from GA at 30 $^{\circ}$ elev	Ma et al. (2013)
Athens/Greece (38.05 $^{\circ}$ N, 23.86 $^{\circ}$ E)	Urban	IUPB/NOA	September 2012–August 2018	Custom-built MAX-DOAS	VCD from QA4ECV	QA4ECV datasets
Chiba/Japan (35.63 $^{\circ}$ N, 140.10 $^{\circ}$ E)	Urban	ChibaU	June 2012–July 2017	CHIBA-U MAX-DOAS	VCD and profiles from PP	Irie et al. (2011, 2012, 2015, 2019)
Yokosuka/Japan (35.32 $^{\circ}$ N, 139.65 $^{\circ}$ E)	Urban	JAMSTEC	October 2007–December 2015	MADRAS MAX-DOAS	VCD and profiles from PP	Kanaya et al. (2014)
Gwangju/South Korea (35.23 $^{\circ}$ N, 126.84 $^{\circ}$ E)	Urban	JAMSTEC	January 2008–December 2015	MADRAS MAX-DOAS	VCD and profiles from PP	Kanaya et al. (2014)
Nairobi/Kenya (1 $^{\circ}$ S, 36.50 $^{\circ}$ E)	Urban	IUPB	February 2011–December 2013	Custom-built MAX-DOAS	VCD from QA4ECV	QA4ECV datasets
Bujumbura/Burundi (3 $^{\circ}$ S, 29 $^{\circ}$ E)	Urban	BIRA-IASB	November 2013–July 2017	Custom-built MAX-DOAS	VCD and profiles from OEM	De Smedt et al. (2015),
						Gielen et al. (2017)
Zvenigorod/Russia (55.70 $^{\circ}$ N, 36.78 $^{\circ}$ E)	Suburban	JAMSTEC	October 2008–December 2012	MADRAS MAX-DOAS	VCD and profiles from PP	Kanaya et al. (2014)
Xianghe/China (39.75 $^{\circ}$ N, 116.96 $^{\circ}$ E)	Suburban	BIRA-IASB	March 2010–August 2018	Custom-built MAX-DOAS	VCD and profiles from OEM	Hendrick et al. (2014),
						Vlemmix et al. (2015)
Tsukuba/Japan (36.05 $^{\circ}$ N, 140.12 $^{\circ}$ E)	Suburban	ChibaU	January 2007–April 2014	CHIBA-U MAX-DOAS	VCD and profiles from PP	Irie et al. (2011, 2012, 2015, 2019)
Kasuga/Japan (33.52 $^{\circ}$ N, 130.48 $^{\circ}$ E)	Suburban	ChibaU	December 2013–July 2017	CHIBA-U MAX-DOAS	VCD and profiles from PP	Irie et al. (2011, 2012, 2015, 2019)
Cabauw/the Netherlands (51.97 $^{\circ}$ N, 4.93 $^{\circ}$ E)	Remote	KNMI	March 2011–August 2018	miniDOAS	VCD from QA4ECV	QA4ECV and
						NID-FORVAL datasets
Hohenpeissenberg/Germany (47.80 $^{\circ}$ N, 11.67 $^{\circ}$ E)	Remote	IUPH/DWD	March 2012–December 2012	Custom-built MAX-DOAS	VCD and profiles from OEM	Yilmaz (2012),
						Niebling (2010)
OHP/France (43.94 $^{\circ}$ N, 5.71 $^{\circ}$ E)	Remote	BIRA-IASB	February 2005–December 2016	Custom-built MAX-DOAS	VCD from QA4ECV	Valks et al. (2011)
						QA4ECV datasets
Fukue/Japan (32.75 $^{\circ}$ N, 128.68 $^{\circ}$ E)	Remote	JAMSTEC	March 2009–December 2015	MADRAS MAX-DOAS	VCD and profiles from PP	Kanaya et al. (2014)
Cape Hedo/Japan (26.87 $^{\circ}$ N, 128.25 $^{\circ}$ E)	Remote	JAMSTEC	April 2007–December 2015	MADRAS MAX-DOAS	VCD and profiles from PP	Kanaya et al. (2014)
Réunion Le Port/Réunion Island (20.9 $^{\circ}$ S, 55.36 $^{\circ}$ E)	Remote	BIRA-IASB	April 2016–January 2018	Custom-built MAX-DOAS	VCD and profiles from OEM	Theys et al. (2007)

MAX-DOAS profile inversion algorithms use a two-step approach: in the first step, aerosol extinction profiles are retrieved from the measured absorption of the oxygen dimer $O_{4}$ (Wagner et al., 2004; Frieß et al., 2006). In a second step, trace gas profiles are retrieved from the measured trace gas absorptions, taking into account the aerosol extinction profiles retrieved in the first step. Both OEM and parameterized profiling approaches provide vertical profiles of aerosols and ${NO}_{2}$ with a sensitivity typically in the 0–4 $km$ altitude range, with generally between 1.5 and 3 independent pieces of information in the vertical dimension (Vlemmix et al., 2015; Frieß et al., 2016, 2019; Tirpitz et al., 2020). This complementary information on the vertical distribution of gases and aerosols in the atmosphere has been used in some studies to test some key assumptions made in the satellite data retrieval, in particular the a priori ${NO}_{2}$ profile and aerosols content, providing therefore more insight into the quality of the satellite data (e.g., Wang et al., 2017b; Liu et al., 2019b, 2020; Compernolle et al., 2020). Recent intercomparison studies (Vlemmix et al., 2015; Frieß et al., 2019; Tirpitz et al., 2020) show that both OEM and parameterized inversion approaches lead to consistent results in terms of tropospheric vertical column but larger differences in terms of profiles. In this study, every data provider submitted data retrieved with their own tools and formats, without any harmonization. Our study focuses therefore only on the vertical column, which is the more robust and reliable retrieved quantity. The time coverage of the different datasets used in this study is presented in Fig. S1 in the Supplement.

The accuracy of the MAX-DOAS technique depends on the SCD retrieval noise, the uncertainty of the ${NO}_{2}$ absorption cross sections and most importantly the uncertainty of the tropospheric AMF calculation. The estimated total error on ${NO}_{2}$ VCD is of the order of 7 %–17 % in polluted conditions. This includes both random (around 3 % to 10 % depending on the instruments) and systematic (11 % to 14 %) contributions (e.g., Irie et al., 2008, 2011, 2012; Wagner et al., 2011; Hendrick et al., 2014; Kanaya et al., 2014). In extreme cases, the error can however reach $\sim 30 %$ depending on geometry and aerosols.

3.2 Direct sun technique

Equipped with a 2-axis positioner, direct-sun-capable DOAS instruments measure non-scattered photons. Such instruments are equally sensitive to both tropospheric and stratospheric absorptions (Fig. 1b). They have a very small uncertainty in AMF and can provide accurate total column measurements with a minimum of a priori assumptions.

Direct sun (DS) observations are routinely available from Pandora spectrometer instruments. A standardized Pandora network has been set up by NASA (Herman et al., 2009; Tzortziou et al., 2014; Pandora project: http://pandora.gsfc.nasa.gov, last access: X29 October 2020) and extended by ESA and LuftBlick to form the PGN (Pandonia Global Network; http://www.pandonia-global-network.org/, last access: 29 October 2020). Pandora data used in this study originate mostly from the original NASA network, which includes more than 60 different sites covering different time periods (mostly campaign-based). In total, 15 Pandora direct sun instruments delivering at least 3 months of data have been considered here. They are listed in Table 3 with an indication of their location, ownership, availability (see also Fig. S2 in the Supplement) and references. Pandora instruments are generally operated in polluted areas (urban or suburban); however the network also contains a few background/remote sites located in Europe, Asia and the United States. Valid data were selected for a normalized root-mean square of weighted spectral fitting residuals (WRMS) of less than 0.005; uncertainty in ${NO}_{2}$ retrievals less than 0.05 DU was kept (Alexander Cede, personal communication, 2015).

Table 3

Direct sun instruments measuring total ${NO}_{2}$ VCD included in this study (16 stations).

Station/country (lat/long)	Station type	Owner/group	Time period	Instrument type	Reference
FMI, Helsinki/Finland (60.20 $^{\circ}$ N, 24.96 $^{\circ}$ E)	Urban	NASA and FMI	September 2011–June 2013	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Harvard/USA (42.67 $^{\circ}$ N, 71.12 $^{\circ}$ W)	Urban	NASA	November 2014–August 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Thessaloniki/Greece (40.63 $^{\circ}$ N, 22.96 $^{\circ}$ E)	Urban	AUTH	January 2011–May 2014	PHAETON (direct sun mode)	Kouremeti et al. (2013), Drosoglou et al. (2017)
Boulder/USA (39.99 $^{\circ}$ N, 105.26 $^{\circ}$ W)	Urban	NASA	December 2013–August 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Beijing/China (39.98 $^{\circ}$ N, 116.38 $^{\circ}$ E)	Urban	BIRA-IASB	July 2008–April 2009	MAX-DOAS (direct sun mode)	Clémer et al. (2010), Hendrick et al. (2014), Vlemmix et al. (2015)
GSFC/USA (38.99 $^{\circ}$ N, 76.84 $^{\circ}$ W)	Urban	NASA	May 2009–August 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
NASA HQ/USA (38.88 $^{\circ}$ N, 77.01 $^{\circ}$ W)	Urban	NASA	August 2012–August 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Seoul/South Korea (37.59 $^{\circ}$ N, 126.93 $^{\circ}$ E)	Urban	NASA	March 2012–August 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Busan/South Korea (35.24 $^{\circ}$ N, 129.08 $^{\circ}$ E)	Urban	NASA	March 2012–May 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
UHMT/USA (29.72 $^{\circ}$ N, 95.34 $^{\circ}$ W)	Urban	NASA	March 2012–April 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Xianghe/China (39.75 $^{\circ}$ N, 116.96 $^{\circ}$ E)	Suburban	BIRA-IASB	March 2010–August2018	MAX-DOAS (direct sun mode)	Hendrick et al. (2014), Vlemmix et al. (2015)
Langley/USA (37.10 $^{\circ}$ N, 76.39 $^{\circ}$ W)	Suburban	NASA	January 2010–June 2014	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
SERC/USA (38.88 $^{\circ}$ N, 76.55 $^{\circ}$ W)	Remote	NASA	September 2010–January 2013	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Four Corners NM/USA (36.80 $^{\circ}$ N, 108.48 $^{\circ}$ W)	Remote	NASA	June 2012–July 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Izaña/Spain (28.31 $^{\circ}$ N, 16.50 $^{\circ}$ W)	Remote	NASA	January 2013–August 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)
Mauna Loa/USA (19.48 $^{\circ}$ N, 155.60 $^{\circ}$ W)	Remote	NASA	November 2014–May 2015	Pandora	Herman et al. (2009), Tzortziou et al. (2014)

Recent detailed studies in US and South Korean sites during DISCOVER-AQ have shown good agreement of Pandora instruments with aircraft in situ measurements, within 20 % on average, although larger differences are observed for individual sites (Choi et al., 2020), the largest discrepancies being found in Texas (Nowlan et al., 2018). Good agreement of a few percent between Pandora and GeoTASO has been reported by Judd et al. (2019), while differences increase when resampling the comparisons for larger simulated pixel sizes, up to about 40 % bias for $18 km \times 18 km$ , similar to the bias found with OMI (50 %).

The Pandora spectrometers provide ${NO}_{2}$ total vertical column observations, with a random uncertainty of about $2.7 \times 10^{14}$ $molec {cm}^{- 2}$ and a systematic uncertainty of $2.7 \times 10^{15}$ $molec {cm}^{- 2}$ (Herman et al., 2009). These account for DOAS fit systematic errors, random noise and uncertainties related to the estimation of the residual gas amount in the reference spectra. In the present study, direct sun tropospheric VCDs are derived from the measured total ${NO}_{2}$ content after subtraction of the stratospheric part estimated using satellite data (SAT) (alone or within assimilation scheme; see Sect. 2), interpolated to the geolocation of the Pandora spectrometer: 2 ${VCD}_{tropo} (DS) = {VCD}_{tot} (DS) - {VCD}_{strato} (SAT) .$ Summing the Pandora error uncertainty and the error uncertainty on the stratospheric column in quadrature, this approach leads to an error uncertainty of about $\sim 2.75 \times 10^{15}$ $molec {cm}^{- 2}$ on the tropospheric column from direct sun data. It should be noted that this approach leads to retrieval of the total tropospheric column from the direct sun, while the tropospheric column from MAX-DOAS represents mainly the boundary layer.

4 Comparison method

For the comparison, GOME-2A and OMI data were extracted within a radius of 50 $km$ around the 36 stations listed in Tables 2 and 3, with only pixels having a cloud radiance fraction $< 50 %$ and an ${AMF}_{ratio} ({AMF}_{tropo} / {AMF}_{geom}) > 0.2$ (Boersma et al., 2018) being selected. In the case of OMI, pixels affected by the row anomaly were filtered out (Boersma et al., 2018). As the pointing direction and horizontal sensitivity length are not reported for all ground-based instruments, our baseline approach is to consider only pixels encompassing the station location. However, a sensitivity test has been performed at the Xianghe station (where both parameters are provided in the data files) by selecting all pixels crossing the MAX-DOAS line of sight. Comparison results were found to be close to those from the baseline case, with only 10 additional coincident days.

To reduce the differences in spatial resolution of the satellite measurements (GOME-2A: $40 km \times 80 km$ ; OMI: $13 km \times 24 km$ at best) compared to the ground-based sensitivity (horizontal length of the probed air mass up to $\sim 20 km$ ), the largest pixels from each instrument dataset were removed: only pixels with an across-track width smaller than 100 $km$ for GOME-2A and smaller than 40 $km$ for OMI were kept in the comparisons. Previous studies have investigated the use of stricter coincidence criteria as a way to overcome spatial resolution differences. For example, Irie et al. (2008a) showed differences of up to 25 % in satellite VCD between pixels located 5 to 50 $km$ away from the site, and only OMI pixels centered within ${0.1}^{\circ} \times {0.1}^{\circ}$ of the MAX-DOAS stations were considered in the validation. Other approaches have averaged MAX-DOAS VCDs made in several azimuth directions (Brinksma et al., 2008; Celarier et al., 2008; Ortega et al., 2015) or have excluded MAX-DOAS measurements with a relative uncertainty $\geq 10$ % (Vlemmix et al., 2010).

Ground-based (GB) MAX-DOAS data were interpolated to the satellite overpass time, and a verification of the presence of data within $\pm 1 h$ was performed in order to avoid large interpolation errors. Pandora direct sun measurements have a much higher acquisition rate (approximately 30 acquisitions per hour compared to typically one to four MAX-DOAS measurements) with sometimes strong ${NO}_{2}$ variations not perfectly removed with the data filtering, so Pandora measurements within 1 $h$ ( $\pm 30 \min$ ) of the satellite overpass time were averaged. On this basis, in addition to the daily comparisons at each station, corresponding monthly averages were also compared.

As an example, Fig. 2 shows the results of our analysis for the Xianghe MAX-DOAS site. Pollution episodes are well captured by both GOME-2A and OMI as well as seasonal variations characterized by high ${NO}_{2}$ VCDs in winter and low values in summer. Quantitatively, the comparison of the whole time series is good, with correlation coefficient $R$ values of 0.88 and 0.94 and linear regression slopes of about 0.79 and 0.93, for the monthly GOME-2A and OMI data respectively. VCDtropo differences (SAT–GB in $\times 10^{15}$ $molec {cm}^{- 2}$ ) and percent relative difference $(100 \cdot (SAT - GB)) / GB$ in %) were calculated for each site. For Xianghe the median bias is about $- 2 \times 10^{15}$ $molec {cm}^{- 2}$ ( $- 8$ %) and $0.7 \times 10^{15}$ $molec {cm}^{- 2}$ ( $- 4.4$ %) for GOME-2A and OMI data respectively. Values for each site are reported in Table S1 in the Supplement for GOME-2A and OMI, with daily and monthly statistics for correlation coefficient $R$ , slope $S$ and intercept $I$ of a linear regression and mean and median monthly absolute and relative biases. Depending on the length of the ground-based time series, the number of daily comparison points can vary significantly, from at least 3 months of data to several years of continuous measurements.

Figure 2

Comparison of monthly mean tropospheric ${NO}_{2}$ VCDs at the Xianghe station for (a) GOME-2A GDP 4.8 data and (b) OMI DOMINO v2.0 versus MAX-DOAS data, over the period March 2010 to July 2017. Correlation coefficients $R$ are given as an inset in the scatterplots on the right column. The variability (standard deviation of the monthly mean) is given as error bars for both datasets.

[Figure omitted. See PDF]

5 Results

5.1 Overview of the ground-based datasets

Figure 3 presents an overview of the tropospheric and stratospheric ${NO}_{2}$ columns measured at each station, as obtained from the satellite-to-ground-based coincidences. The tropospheric columns correspond to the ground-based data as selected in Sect. 4 (including, for the direct sun case, the subtraction of the satellite-estimated stratospheric content; see Sect. 3), while the stratospheric columns are the satellite estimations. As can be seen from the box-and-whisker plot, the tropospheric content varies strongly among the stations, the observed median columns ranging from $1 \times 10^{15}$ $molec {cm}^{- 2}$ in rural places (Hohenpeissenberg, Réunion, Cape Hedo, Mauna Loa, Izaña) to about 30 to $40 \times 10^{15}$ $molec {cm}^{- 2}$ in highly polluted sites (Beijing, Seoul, Beijing-CMA). As can also be seen, tropospheric columns selected at GOME-2A overpass times (i.e., in the morning) are usually larger than those selected at OMI overpass time (13:30 $\pm$ 0:90), which is explained by lower OH levels and somewhat higher ${NO}_{x}$ emissions, leading to slower ${NO}_{2}$ chemical loss mid-morning (09:30) compared to noon (13:30) (Boersma et al., 2008; Kim et al., 2009). Note that the median tropospheric column is negative at the mountaintop stations of Izaña and Mauna Loa. This is either caused by a slight underestimation of the Pandora total columns or a slight overestimation of the stratospheric columns derived from satellite. This discrepancy is under investigation and will be the subject of a future study.

Figure 3

${NO}_{2}$ columns at the various ground-based stations (MAX-DOAS in panels a, b and direct sun in panels c, d). (a) Box-and-whisker plot of the ground-based tropospheric ${NO}_{2}$ columns (obtained by subtracting the satellite VCDstrato in the case of direct sun data) and (b) box-and-whisker plot of the stratospheric ${NO}_{2}$ content derived from satellite instruments. OMI data in green; GOME-2A data in dark red. The box-and-whisker plots are defined as follows: crosses for the mean values, horizontal lines for the median, boxes for the 25 and 75 percentile and vertical lines for the 9 and 91 percentile. Stations are ordered by increasing values of the VCDtropo columns.

[Figure omitted. See PDF]

Due to different deployment strategies, the direct-sun-measuring instruments (especially Pandora instruments) were located closer to strong ${NO}_{2}$ emission sources than MAX-DOAS instruments that sample both polluted and background sites. The MAX-DOAS ensemble of stations measured ${NO}_{2}$ tropospheric columns in the 2 to $20 \times 10^{15}$ range (about 18 MAX-DOAS stations and 10 direct sun stations). Moreover, being able to also measure under partially cloudy conditions, MAX-DOAS sites tend to sample the full variability of the ${NO}_{2}$ field at measurement sites, while direct sun data preferentially sample clear-sky conditions. As a result, MAX-DOAS sites tend to display a larger variability, as can be judged from the larger boxes (25 % to 75 %) and lines (9 % to 91 %) in the box-and-whisker plots of Fig. 3a.

Figure 3b presents the stratospheric columns derived from the two satellites. Values typically range between $2 \times 10^{15}$ and $3.5 \times 10^{15}$ $molec {cm}^{- 2}$ . The difference of about 0.6 (up to 1) $\times 10^{15}$ $molec {cm}^{- 2}$ between the GOME-2A and OMI data is consistent with the known diurnal variation of the stratospheric ${NO}_{2}$ , which results from the $NO$ $/$ ${NO}_{2}$ equilibrium and the progressive photodissociation of $N_{2} O_{5}$ during the day (Dirksen et al., 2011; Belmonte Rivas et al., 2014; van Geffen et al., 2015). Minimum values of the stratospheric column are obtained over the equatorial sites (Nairobi, Bujumbura and Mauna Loa).

The validity of the tropospheric estimation approach applied to the direct sun data (see Sect. 3.2 and Eq. 2) was verified at stations where both MAX-DOAS and direct sun measurements are performed. This is the case for three sites: Beijing, Xianghe and Thessaloniki. Combining these three datasets, Fig. 4 displays a scatterplot of the tropospheric ${NO}_{2}$ columns measured by both techniques. Results are shown separately for GOME-2A and OMI overpass times. In both cases, a high level of correlation is obtained (linear correlation coefficient $> 0.95$ ). The corresponding linear regression slopes are $1.09 \pm 0.02$ and $1.06 \pm 0.01$ for OMI and GOME-2A overpasses respectively, with intercepts of $- 3.5 \times 10^{15}$ and $- 0.6 \times 10^{15}$ $molec {cm}^{- 2}$ . These results suggest that MAX-DOAS and direct sun data show a small relative bias of about 10 %–15 %. Part of this bias, which could change depending on pollution levels, may arise from the satellite-based stratospheric correction applied to direct sun data. However, it should be noted that MAX-DOAS and direct sun measurements are not synchronized, with typical differences in measuring time of about half an hour for these stations. The ${NO}_{2}$ variability (which can be large in polluted sites) therefore probably contributes to the observed scatter and apparent bias. Furthermore, MAX-DOAS and direct sun instruments observe different air masses, which might lead to differences in the presence of horizontally inhomogeneous air masses.

Figure 4

MAX-DOAS and direct sun tropospheric ${NO}_{2}$ columns in Thessaloniki, Xianghe and Beijing. At these sites, ground-based measurements are performed in both geometries.

[Figure omitted. See PDF]

Another approach to verify the consistency of the ground-based dataset is to investigate the coherence between measurements at sites that are geographically close to each other. For example, NASA-HQ and GSFC are very close to each other, but measurements were performed by different Pandora instruments and during different time periods. Their median VCDtropo differences for the overlapping days are about 4.4 and $7.8 \times 10^{14}$ $molec {cm}^{- 2}$ at the OMI and GOME-2A overpasses respectively, in line with the expected uncertainty/variability of these ground-based data. Beijing and Beijing-CMA sites are interesting to compare since both are located inside the city, at a mutual distance of about 6 $km$ . The first instrument has been measuring on the roof of the Institute of Atmospheric Physics (IAP) (Clémer et al., 2010), the second at the China Meteorological Administration (Ma et al., 2013). Both instruments have already been compared in Hendrick et al. (2014), showing good agreement (differences of about $- 2$ % in winter and 3 % to 4 % for the rest of the period). When comparing their columns for the satellite's colocations, they present differences of about 1.7 and $6 \times 10^{15}$ $molec {cm}^{- 2}$ at OMI and GOME-2A overpass times, respectively (12 % to 15 %). Another example is Chiba and Yokosuka. Both of these sites are situated in the urban area of Tokyo Bay but at about 53 $km$ distance from each other. Their median differences from OMI and GOME-2A are 5.7 and $14.2 \times 10^{15}$ $molec {cm}^{- 2}$ respectively (69 % to 82 %).

5.2 Comparison of ground-based and satellite datasets

The comparison methodology illustrated in Fig. 2 has been extended to the 23 MAX-DOAS and 16 direct sun stations gathered in this study. As expected, results show a clear dependence on the location of the comparison site. The best agreement is obtained in background/remote conditions, while comparisons are more challenging close to the sources, where the ${NO}_{2}$ field is more heterogeneous (Chen et al., 2009; Irie et al., 2012; Ma et al., 2013; Pinardi et al., 2014). To illustrate this point, the different stations have been qualitatively classified by the station PIs into urban, suburban and background sites (see Tables 2 and 3), based on their location with respect to known pollution sources. This classification is not based on ${NO}_{2}$ levels but reflects the influence of the surrounding areas. For example, Xianghe station is in a polluted background with high ${NO}_{2}$ levels (see Fig. 3), but it is located at a relatively large distance from surrounding urban areas and is thus classified as suburban.

Figure 5 presents monthly mean scatterplots of the GOME-2A GDP 4.8 data against ground-based measurements at the different stations. Different sites are plotted in different colors, and results are grouped separately for MAX-DOAS and direct sun data as well as for urban and background/suburban stations. As can be seen, satellite and ground-based data generally correlate well, with correlation coefficients ranging between 0.75 and 0.96 and linear regression slopes between 0.37 and 0.83. For more details on the statistical analysis of the regressions, see Table 4. It is clear that smaller slopes, larger biases and larger root mean square (rms) values are found at urban locations compared to background/suburban ones. Note also that smaller biases are obtained for OMI than for GOME-2A in all cases except for the case of the comparisons against direct sun data in background/suburban sites, where the differences among the two satellites are small (about $- 19.6 %$ and $- 21.3 %$ ).

Figure 5

Scatterplot of GOME-2A GDP 4.8 ${NO}_{2}$ tropospheric columns with respect to MAX-DOAS instruments (a, b) and direct sun instruments (c, d). Panels (a, c) display background and suburban stations, while urban stations are represented in (b, d). Linear regression values are given as an inset for each case (correlation coefficient $R$ , slope $S$ and intercept $I$ ), and the number of months for each station is given in brackets in the legend. Pixel selection: GOME-2A pixel size $< 100 km$ (i.e., removing backscans) over the stations.

[Figure omitted. See PDF]

Table 4

Statistics of the monthly median comparisons per station type for the satellite baseline (small pixel over station) versus ground-based comparisons. Linear regression slope $S$ and intercept $I$ are presented.

		$R$	$S$	$I$	Bias (SAT–GB)	Bias	rms
				( $\times 10^{15}$ $molec {cm}^{- 2}$ )			( $\times 10^{15}$ $molec {cm}^{- 2}$ )
MAX-DOAS comparisons
Suburban and remote	GOME-2A	0.92	0.8	$- 0.36$	$- 0.97$	$- 36$ %	4.33
Suburban and remote	OMI	0.81	0.61	1.37	$- 0.2$	$- 6.5$ %	5.68
Urban	GOME-2A	0.82	0.47	1.3	$- 0.46$	$- 42$ %	8.78
Urban	OMI	0.86	0.57	1.56	$- 2.8$	$- 29.7$ %	7.88
Direct sun tropospheric comparisons
Suburban and remote	GOME-2A	0.96	0.83	0.48	$- 1.18$	$- 19.6$ %	3.76
Suburban and remote	OMI	0.96	0.80	0.71	$- 1.32$	$- 21.3$ %	3.68
Urban	GOME-2A	0.75	0.37	3.75	$- 2.18$	$- 25.3$ %	10.2
Urban	OMI	0.87	0.70	1.42	$- 0.7$	$- 11.6$ %	5.72

The median relative biases (SAT–GB)/GB at each site are presented as a color-coded map in Fig. 6. Satellite data display a negative bias against ground-based reference data at all stations, except UHMT-Houston, which is a coastal site, highly heterogeneous in nature (Tzortziou et al., 2014; 2015; 2018; Loughner et al., 2014; Martins et al., 2016). Negative biases of about $- 80$ % are observed in Bujumbura and Nairobi, which can be related to the small ${NO}_{2}$ signal and the localized nature of the sources at these sites, combined with a complex orography (Gielen et al., 2017; Compernolle et al., 2020). Systematic uncertainties in the estimation of the stratospheric column in satellite datasets could also contribute to the observed underestimation, considering the overall small tropospheric ${NO}_{2}$ signals at these locations. For example, Valks et al. (2011) have shown that small-scale variations visible in the IFS-MOZART stratospheric ${NO}_{2}$ field could not be captured by the GOME-2A stratosphere–troposphere separation algorithm, due to limitations of the spatial filtering approach. In particular this might be the case at the Izaña and Mauna Loa stations (see Fig. 3a), where the satellite stratospheric column is found to exceed the total column ${NO}_{2}$ derived from ground-based direct sun measurements. Finally, issues related to the use of inadequate ancillary datasets might also affect the accuracy of the satellite ${NO}_{2}$ columns. This can be due to the coarse spatial resolution of models used as a priori information (from 1.875 to 3 $^{\circ}$ here; see Table 1) or their temporal sampling (monthly values from 1997 or daily profiles; see Table 1), leading to unrealistic representation of the sources and errors on the AMF calculation of up to 50 % (Heckel et al., 2011; Lin et al., 2014; Kuhlman et al., 2015; Laughner et al., 2016, 2019; Judd et al., 2019). Also Liu et al. (2020) showed that known uncertainties in albedo climatologies result in ${NO}_{2}$ column uncertainties of 3 %–6 %, while errors in model input are responsible for up to 20 % of error on the retrieved ${NO}_{2}$ columns.

Figure 6

Daily median relative bias at each station for OMI DOMINO v2 and GOME-2A GDP tropospheric ${NO}_{2}$ columns. MAX-DOAS stations are represented with circles and direct sun stations with squares.

[Figure omitted. See PDF]

Looking at the details of the comparison results at each station (Fig. 6 and values in Table S1 in the Supplement), we find that GOME-2A and OMI present a similar behavior at a significant number of stations. Biases, however, tend to be slightly larger for GOME-2A. For example, in the megacity of Beijing, the median monthly mean bias is $- 32$ % for OMI and $- 42$ % for GOME-2A when considering direct sun cases, $- 24$ % and $- 45$ % for the Beijing MAX-DOAS case and $- 33$ % and $- 49$ % for the Beijing-CMA MAX-DOAS case. In Xianghe, which is a suburban site, the biases are smaller ( $- 4$ % and $- 8$ % for MAX-DOAS), as expected. Table S1 provides a complete overview of the monthly bias results obtained when comparing OMI and GOME-2A to MAX-DOAS and direct sun instruments. Aside from the stations showing coherent validation results for OMI and GOME-2A (about 9 out of 16 direct sun sites and 8 out of 23 MAX-DOAS sites with differences in the satellite-to-ground validation results bias of less than 15 %), others are characterized by much larger differences, especially in remote sites such as OHP, Réunion, Cape Hedo, Fukue, Tsukuba and Bujumbura. A few mountaintop or high-altitude sites present very large relative biases, such as Nairobi (about $- 80$ %), Mauna Loa (about $- 60$ %) and Izaña ( $- 200$ % to $- 210$ %). At Réunion and Bujumbura, only GOME-2A results display large biases ( $- 76$ % compared to 5 % for Réunion, and $- 84$ % compared to $- 46$ % for Bujumbura). Significant differences between ground-based MAX-DOAS and both OMI QA4ECV and OMI NASA were also reported by Compernolle et al. (2020) in OHP, Bujumbura, Nairobi and Mainz.

However, for some of these stations, these results only rely on a very small subset of comparison points (5 $d$ for OMI comparisons at Mauna Loa, 14 $d$ for Thessaloniki direct sun, 3 $d$ for Nairobi, 11 $d$ for Réunion, 12 $d$ for Hohenpeissenberg), and in the next section we test the impact of relaxing the comparison criteria, to select the closest pixel per day, within the maximum radius of 50 $km$ .

5.3 Impact of the satellite pixel selection

As to be expected, for a large number of stations, selecting pixels that do not contain the stations increases the comparison statistics but also changes the comparison results. This is especially the case for OMI. The change in coincidence selection is presented in Table S1 for each station. The following conclusions can be drawn for OMI.

-
Direct sun measurements: for 9 sites out of 16 there is a significant (more than 5 %) difference between results obtained using all the pixels and only those intersecting the stations. For six of them, the median bias is strongly increased: Seoul (from $- 4$ % to $- 29$ %), Boulder (from $- 36$ % to $- 54$ %), GSFC (from 6.2 % to $- 8.5$ %), Harvard (from $- 12$ % to $- 29$ %), Four Corners (from $- 7$ % to $- 17$ %) and Mauna Loa (from $- 60$ % to $- 120$ %). At three sites, it is reduced: Izaña (from $- 210$ % to 190 %), FMI (from 90 % to $- 31$ %) and UHMT (43 % to 15 %).
-
MAX-DOAS measurements: for 15 sites out of 23 there is a significant (more than 5 %) difference between results obtained using all the pixels and only those intersecting the stations. For 10 of them, the median bias is larger: Athens (from $- 38$ % to $- 48$ %), Bremen (from $- 8$ % to $- 36$ %), Gwangju (from $- 34$ % to $- 44$ %), Kasuga (from $- 44$ % to $- 52$ %), Réunion (from 5 % to 14 %), Uccle (from $- 16$ % to $- 28$ %), Beijing (from $- 24$ % to $- 39$ %), Thessaloniki (from $- 30$ % to $- 44$ %) and OHP (from $- 12$ % to $- 19$ %). For five of the sites, the bias is improved: Hohenpeissenberg (from 17 % to $- 1.3$ %), Tsukuba (from $- 6$ % to 3 %), Bujumbura (from $- 46$ % to $- 31$ %) and Fukue (18 % to $- 6.8$ %).

At most stations, the stricter colocation criterion results in smaller biases (by up to

\sim 20 %

). In order to better understand the impact of changing the pixel selection criteria, additional tests were performed for two megacities characterized by extremely high

{NO}_{2}

levels (see Fig. 3).

Figure 7 illustrates, for Beijing, Beijing-CMA, Xianghe and Seoul, the impact of making different choices on the OMI pixel size and location. For the most strict selection criterion (OMI pixels smaller than 40 $km$ and located above the stations), we see a significant smaller bias and spread of the comparison in Seoul for direct sun data and only a slight difference in the median bias for the Beijing/Beijing-CMA data. For Xianghe, the impact appears to be moderate or even negligible, as expected due to the suburban nature of this site. Differences in the results for the two Beijing sites are to be considered in light of the different measurement times (Table 1) and ${NO}_{2}$ levels (Fig. 3): measurements in Beijing (median ${NO}_{2}$ of about $20 \times 10^{15}$ $molec {cm}^{- 2}$ ) were performed in 2008–2009 during the Olympic Games, while measurements at the CMA building (median of $35 \times 10^{15} molec {cm}^{- 2}$ ) covered the period from 2009 to 2011. For Seoul, where measurements were performed in 2012–2015 (median ${NO}_{2}$ of $35 \times 10^{15} molec {cm}^{- 2}$ ), the metropolitan area extends over more than 11700 ${km}^{2}$ . In this case, as can be seen in Fig. S23 in the Supplement, the ${NO}_{2}$ signal is inhomogeneously spread over the city, and the instrument is not centered at the maximum of the satellite ${NO}_{2}$ observations. As a result, the selection of pixels in strict overpass with the site has a larger impact than for Beijing, where the MAX-DOAS instrument is located in the center of the city (Fig. 7). This is in line with the findings of Duncan et al. (2016). Analyzing OMI data over the period from 2005 to 2014, they found a complex spatial distribution of the ${NO}_{2}$ trends characterized by a decrease in the Seoul metropolitan area and an increase outside of the city center. The heterogeneity of changing emissions leads to a high dependence of the trend calculation across the city (change from about $- 30$ % to $+ 10$ %). For the Beijing case, Duncan et al. (2016) also showed a reduction of the tropospheric ${NO}_{2}$ (by about $- 10.3$ % from 2005 to 2014), with a minimum in 2008 at the time of the Olympic Games.

Figure 7

Impact of the OMI pixel size (pixels smaller than 100 and 40 $km$ in grey and black respectively) and with filtering on pixels only above the station (blue) on the differences' deviation between satellite and ground-based data at a few stations: Xianghe, Beijing, Beijing-CMA and Seoul. The number of comparison points is indicated on top with the corresponding colors. The box-and-whisker plots are defined as follows: crosses for the mean values, horizontal lines for the median, boxes for the 25 and 75 percentile and vertical lines for the 9 and 91 percentile.

[Figure omitted. See PDF]

Figure 8 summarizes the change in biases for the station ensemble, for the three pixel selection cases presented for OMI. As can be seen, restricting the comparison to small pixel sizes (from 100 to 40 $km$ ) improves the median bias, and it reduces the comparison spread. Further focusing on pixels in strict overpass with the stations, the spread is also reduced, but the median bias not so much, at the expense of a large number of comparison days.

Figure 8

Box-and-whisker plot of the daily OMI DOMINO v2.0 biases for all the stations and for different possibilities of pixel size selection (pixels smaller than 100 $km$ in grey, smaller than 40 $km$ in black and with filtering on pixels only above the station in blue). First row: ensemble of MAX-DOAS stations; second row: ensemble of direct sun stations. The box-and-whisker plots are defined as in Fig. 7. The number of comparison points for each case is shown in the corresponding color.

[Figure omitted. See PDF]

For GOME-2A (not shown), both of these effects are much smaller, as the pixel side size is always about 80 $km$ , and as such, when the pixel center is within 50 $km$ radius, usually part of the pixel covers the station.

When considering the results as a whole, the most prominent feature is the systematic underestimation of ground-based data by both satellite datasets for most of the sites. This underestimation is mostly prominent at urban sites close to the sources, but it is also found at background/suburban sites and cannot be fully explained by the satellite uncertainties (see Sect. 2). The differences observed between OMI and GOME-2A can be related to instrumental characteristics (e.g., differences in pixel size) but also to details of the applied retrieval methods (see Table 1 and Sect. 2). Several studies have discussed in detail the impact of algorithmic differences on the ${NO}_{2}$ column uncertainty, which can reach 42 %, mainly due to tropospheric AMF uncertainties (Lorente et al., 2017). The underestimation of the ${NO}_{2}$ satellite products identified here at a large number of stations confirms what was obtained in previous validation exercises using fewer sites and different satellite products (Celarier et al., 2008; Brinksma et al., 2008; Vlemmix et al., 2010; Irie et al., 2008a, 2012; Lin et al., 2014; Halla et al., 2011; Shaiganfar et al., 2011; Ma et al., 2013; Kanaya et al., 2014; Wang et al., 2017b; Mendolia et al., 2013; Tzortziou et al., 2014; Lamsal et al., 2014; Drosoglou et al., 2017; Herman et al., 2019; Judd et al., 2019; Compernolle et al., 2020). These studies generally reported small negative or positive biases over rural (unpolluted) measurement sites and stronger (systematic) negative biases over urban polluted sites.

One way to understand these results is to consider the impact of the spatial resolution of the satellite measurements. For the case of rural sites, coincident satellite pixels can include areas with higher ${NO}_{2}$ columns, leading to positive biases in the comparisons. In contrast at urban locations characterized by strong ${NO}_{2}$ sources, coincident pixels generally tend to include surrounding (suburban) areas. This effect is especially significant for satellite instruments measuring at coarse spatial resolution, such as GOME-2A. It can be attenuated in validation studies making use of long time periods and many stations; however large localized ${NO}_{2}$ concentrations will always tend to be underestimated. This is particularly true for satellite instruments characterized by horizontal resolution much coarser than the size of typical urban agglomerations (see Table 1). Note that the effect can be somewhat mitigated in the case of satellite retrievals using a priori profiles specified at high temporal and spatial resolution (Huijnen et al., 2010; Russell et al., 2011; Heckel et al., 2011; Lin et al., 2014; McLinden et al., 2014; Kuhlmann et al., 2015; Laughner et al., 2019; Goldberg et al., 2017; 2019). In the next section, we present an attempt to quantify the smearing effect around urban sites and use it to extend the validation pixel selection method, in order to increase the comparison statistic.

6 Horizontal dilution effects

In order to investigate the horizontal variability of the ${NO}_{2}$ field at the 36 different stations, 1 full year (2005) of the OMI ${NO}_{2}$ QA4ECV dataset v1.1 (Boersma et al., 2018) was extracted to map the average ${NO}_{2}$ column distribution at a grid of $0.025^{\circ} \times 0.025^{\circ}$ in latitude–longitude. Such highly resolved gridded maps were obtained using a realistic representation of the OMI point spread function allowing the native OMI pixels to be subsampled (Sihler et al., 2017). Only the smallest OMI pixels (rows 11 to 49) were retained for this analysis. Corresponding high-resolution grids were used to quantify the systematic change in tropospheric ${NO}_{2}$ between the position of the satellite pixels and the location of the stations, what we call hereafter the “dilution effect”. The approach used here is an extension of a similar method introduced by Chen et al. (2009) and Ma et al. (2013) based on high-resolution city night light maps used as a proxy for ${NO}_{2}$ sources. Judd et al. (2019) also accurately quantified this effect in the New York area using airborne ${NO}_{2}$ mapping data from the GeoTASO instrument. In our approach, the variation of the tropospheric ${NO}_{2}$ VCD is sampled in concentric circles of different radii around each of the stations. Figure 9 illustrates the method for the Beijing (urban, Fig. 9a) and Xianghe (suburban, Fig. 9c) sites, which both present strongly inhomogeneous ${NO}_{2}$ fields. Figure 9b and d show the ${NO}_{2}$ VCD variation in concentric circles around the stations. In Beijing, the ground-based instrument is located close to the urban ${NO}_{2}$ hotspot, so that the ${NO}_{2}$ level decreases rapidly outwards. In contrast, a different behavior is found at the Xianghe station, which is located about 60 $km$ to the east of the city center of Beijing. In this case, due to the influence of the surrounding emission sources, the mean ${NO}_{2}$ column tends to slightly increase when moving away from the site in the direction of Beijing. For background sites, one expects the ${NO}_{2}$ content to remain roughly constant around the station value. Horizontal variability effects have been documented in previous studies dealing with ozone and water vapor (Lambert et al., 2013; Verhoelst et al., 2015), as well as with tropospheric ${NO}_{2}$ (Irie et al., 2012; Duncan et al., 2016; Kim et al., 2016; Boersma et al., 2018), mostly to illustrate the impact of collocation mismatch errors on validation results. In our study, we propose a correction method applied to satellite data, which aims at reducing the impact of the smearing effect on comparisons.

Figure 9

Dilution effect illustration for a typical urban (Beijing, a, b) and suburban (Xianghe, c, d) case. Panels (a, c) represent the 2005 yearly mean tropospheric ${NO}_{2}$ gridded from OMI QA4ECV data at the resolution of 0.025 $^{\circ}$ latitude $\times$ 0.025 $^{\circ}$ longitude. The black dot indicates the station location, the two circles denote 50 and 100 $km$ radii around the station and the red box represents the outer extent of any $80 km \times 40 km$ GOME-2A pixels whose centers are within the 50 $km$ radius. Panels (b, d) display the mean (black) and median (red) ${NO}_{2}$ values at increasing colocation radii (expressed in kilometers), with the variability (1 standard deviation) given as an error bar around the mean.

[Figure omitted. See PDF]

6.1 Dilution correction method

Similarly to the studies of Chen et al. (2009) and Ma et al. (2013), a correction factor is calculated to quantify the change in ${NO}_{2}$ between the ground-based site and the satellite pixel location. In our approach, the dilution factor ( $F_{dil}$ ) is obtained from the OMI gridded files by taking the ratio between the average (mean or median) ${NO}_{2}$ VCD at increasing distances from the site and the VCD value at the site. A second-order polynomial is then fitted to these ratio values as illustrated in Fig. 9 (panels b and d). Accordingly, $F_{dil}$ is calculated using the following equation, where $R$ represents the distance from the site:

3 $F_{dil} (R) = {NO}_{2}_VCD (R) / {NO}_{2}_VCD (0) .$ In practice, $F_{dil}$ is calculated as the median values of the gridded ${NO}_{2}$ field for values of $R$ from 0 to 50 $km$ . For sites showing a negative slope in the dilution factor (i.e., a clear dilution effect; see Figs. S3 and S6 to S30 in Supplement), a dilution correction (DC) is applied to the satellite data according to 4 $VCDsat_DC = VCDsat / F_{dil} (R) .$ This correction is applied to individual satellite measurements according to their respective distances. Typically, it is applied to large urban sites, stations isolated on small islands such as Réunion Island (Fig. S18 in the Supplement), Izaña (Fig. S15 in the Supplement) and Mauna Loa (Fig. S27 in the Supplement), stations close to a large power plant such as Four Corners (Fig. S11 in the Supplement) and generally speaking sites characterized by a ${NO}_{2}$ hotspot surrounded by a clean area. The stations where a dilution correction was applied are (from north to south) Helsinki FMI, Bremen, De Bilt, Uccle, Mainz, Harvard, Thessaloniki, Boulder, Beijing, Beijing-CMA, NASA-HQ (headquarters), GSFC, Athens, Seoul, Yokosuka, Langley, Four Corners (New Mexico), Chiba, Busan, Gwangju, Kasuga, UHMT, Izaña (IZO), Mauna Loa and Réunion Island (Le Port station). This ensemble is referred to as UIPP (urban, island and power plant) in the rest of the paper.

6.2 Impact of the dilution correction

The improvement brought by the dilution correction is illustrated in Fig. 10, where the slopes of the linear regressions from daily scatterplots are presented for each station separately with and without dilution correction. In order to limit the impact of outliers (especially the large columns that strongly affect the regression analysis), daily comparison points are filtered for values larger than the 75th percentile of the ground-based values of each station. This selection excludes large local values that cannot be captured by satellite measurements and allows for a more robust statistical regression analysis. In each panel, the case denoted “all” corresponds to a combined analysis including the data from all stations together. This is different than averaging the stations' slopes, as the different sites have a varying number of points. After application of the dilution correction, regression slopes improve (and come closer to unity) for all cases except De Bilt. However, for some sites, there seems to be an overcorrection effect (Athens/GOME-2A, UHMT/GOME-2A, Beijing (both sites)/OMI and Réunion/OMI), while a negative slope is obtained at a few other sites (e.g., Mauna Loa/GOME-2A and Réunion/GOME-2A). As already discussed in Sect. 5.1, for direct sun stations this could be related to issues with the determination of stratospheric columns in the satellite algorithm. UHMT is a peculiar site, where several studies performed during the DISCOVER-AQ 2013 Texas campaign (Nowlan et al., 2018; Choi et al., 2020) suggested that those Pandora ${NO}_{2}$ measurements tend to be too low. Finally, some sites (e.g., Nairobi, Bujumbura, Thessaloniki, Izaña) display very small slopes, probably due to the fact that these sites are characterized by very local sources or by nonsymmetric ${NO}_{2}$ distributions. This is clearly the case for isolated islands where the ${NO}_{2}$ can be locally trapped due to orography (see Figs. S19, S22, S24 in the Supplement).

Figure 10

Bar plot of the daily regression slopes at each station for the original (black bars) and the dilution-corrected data (red bar, for the UIPP stations). In order to reduce the weight of large columns on the regression line and to remove local effects, data are filtered to keep only points smaller than the 75 percentile. (a) GOME-2A GDP vs. MAX-DOAS stations, (b) OMI DOMINO v2.0 vs. MAX-DOAS stations, (c) GOME-2A GDP vs. direct sun stations and (d) OMI DOMINO v2.0 vs. direct sun stations.

[Figure omitted. See PDF]

An alternative dilution correction approach taking into account the geographical extent of the satellite pixel and its localization in the ${NO}_{2}$ field has been tested. In order to estimate an uncertainty on our correction method, we applied this modified scheme to two extreme urban cases (Beijing and UHMT) and two moderate cases (Xianghe and Uccle). Differences amounting to about half the value of the current dilution correction are obtained.

Figure 11 displays monthly scatterplots of GOME-2A and ground-based data for all the UIPP stations, i.e., those at which a dilution correction was applied. Data points corresponding to values larger than the 75 percentile are represented as grey points. The two upper plots show results without correction for MAX-DOAS (left) and direct sun (right) datasets, while corrected data are represented similarly in the lower plots. Again, the impact of the dilution correction is clearly apparent. The regression slope increases from 0.52 to 0.76 for MAX-DOAS and from 0.67 to 1.1 for direct sun data. The impact of excluding the largest columns from the regression analysis can be judged by comparing the grey and black lines, respectively obtained without and with filtering. As can be seen, direct sun data are more affected by this filtering (slope increase from 0.38 to 0.67) than MAX-DOAS ones (slope increase from 0.49 to 0.52). This is likely related to the fact that, as already mentioned, direct sun instruments (especially Pandora instruments) tend to be located closer to strong ${NO}_{2}$ emission sources than MAX-DOAS instruments. Other potential reasons are (1) the higher uncertainty in determining the true ${NO}_{2}$ column amount in the reference spectrum and (2) the more spatially localized direct sun measurements, especially at high sun. Moreover, the Pandora DOAS analysis is performed with the ${NO}_{2}$ absorption cross section at a temperature corresponding to the effective temperature of 254 $K$ , while MAX-DOAS is typically analyzed for a temperature of 298 $K$ . Spinei et al. (2014) showed that at polluted sites during hot summer months this could result in 5 %–10 % of underestimation in ${NO}_{2}$ total column derived from the direct sun data compared to the retrieval results at the true effective temperature.

Figure 11

Scatterplot of monthly mean GOME-2A GDP 4.8 ${NO}_{2}$ columns versus UIPP ground-based stations (MAX-DOAS instruments in (a, c) and direct sun instruments in (b, d)). Panels (a, b) present the original comparisons, and panels (c, d) those after applying the dilution correction. Calculations of the monthly mean values are performed after removal of the daily ground-based points larger than the 75 percentile of each station dataset. The monthly means without the filtering are presented in grey to illustrate the impact, and the number of remaining months for each station is given in brackets in the legend. Linear regression values are shown on each plot.

[Figure omitted. See PDF]

Table 5 lists the statistical parameters from regression analyses performed with and without the dilution correction for all the UIPP stations and the different satellite products. Generally speaking, validation results obtained using both MAX-DOAS and direct sun systems appear to be consistent, although direct sun observations tend to agree slightly better with the satellite data. In the case of direct sun data, however, we note that the dilution correction tends to overcorrect satellite measurements (see also Fig. 11), also resulting in slightly larger rms values for the dilution-corrected cases. It is also interesting to note in Table 5 that the intercepts are always positive, which could point to a systematic additive bias, possibly coming from an underestimation of the stratospheric (slant) columns. A bias of about $- 0.2 \times 10^{15}$ $molec {cm}^{- 2}$ has been reported by Compernoelle et al. (2020) when comparing the OMI QA4ECV assimilated stratospheric columns (based on an approach similar to the one used in the OMI DOMINO algorithm) to ground-based zenith-sky data. This bias was reduced to about $- 0.01 \times 10^{15}$ $molec {cm}^{- 2}$ when using the STREAM (Beirle et al., 2016) approach. Investigation of the impact of the smoother STREAM stratosphere on the tropospheric validation results is out of the scope of this study but would be interesting as the small stratospheric errors can be amplified by the AMFs.

Table 5

Statistics of the monthly median comparisons of ground-based with satellite data for UIPP ensembles, before and after the 75 percentile filtering and the dilution correction are applied.

		$R$	$S$	$I$	Bias (SAT–GB)	Bias	rms
				( $\times 10^{15}$ $molec {cm}^{- 2}$ )			( $\times 10^{15}$ $molec {cm}^{- 2}$ )
MAX-DOAS comparisons
All original	GOME-2A	0.83	0.48	0.9	$- 4.77$	$- 44.5$ %	8.63
All original	OMI	0.85	0.56	1.02	$- 3.3$	$- 36.8$ %	7.98
Original filtered	GOME-2A	0.81	0.52	1.16	$- 2.8$	$- 37.3$ %	5.7
Original filtered	OMI	0.8	0.65	0.97	$- 1.63$	$- 26$ %	4.57
All with dilution correction:	GOME-2A	0.84	0.69	0.4	$- 3$	$- 28.5$ %	6.54
All with dilution correction:	OMI	0.85	0.83	0.26	$- 1.45$	$- 17.3$ %	6.56
Filtered with dilution correction:	GOME-2A	0.83	0.76	0.94	$- 1.37$	$- 18.4$ %	4.38
Filtered with dilution correction:	OMI	0.83	0.99	0.5	0.08	1.8 %	4.65
Direct sun tropospheric comparisons
All original	GOME-2A	0.79	0.38	2.9	$- 1.63$	$- 29.4$ %	9.25
All original	OMI	0.74	0.44	2.65	$- 1.11$	$- 28.3$ %	8.66
Original filtered	GOME-2A	0.89	0.67	1.13	$- 0.53$	$- 22$ %	3.59
Original filtered	OMI	0.82	0.67	1.45	$- 0.009$	$- 16.4$ %	4.23
All with dilution correction:	GOME-2A	0.80	0.63	3.62	0.21	$- 5.7$ %	7.54
All with dilution correction:	OMI	0.74	0.72	3.22	0.73	2.36 %	8.43
Filtered with dilution correction:	GOME-2A	0.91	1.11	0.78	1.18	11.1 %	4.05
Filtered with dilution correction:	OMI	0.83	1.11	1.45	1.37	12.8 %	6.1

Considering all the stations together, Fig. 12 presents an overview of the differences between satellite and ground-based datasets, for the original comparisons (in black) and after dilution correction (in red). We make the distinction between two different approaches for the selection of the coincident pixels: closest cloud-free (cloud radiance fraction $< 50 %$ ) pixel and mean value of all cloud-free pixels within a radius of 50 $km$ . Results are also given separately for MAX-DOAS sites (upper plot) and direct sun sites (lower plot).

Figure 12

Box-and-whisker plot of the daily biases for all the stations with (red) and without (black) dilution correction (see Sect. 6.1). First row: ensemble of MAX-DOAS stations; second row: ensemble of direct sun stations. For each row, several cases are shown: closest pixel and mean value within the 50 $km$ radius for OMI DOMINO v2.0 and GOME-2A GDP 4.8. The box-and-whisker plots are defined as in Fig. 7.

[Figure omitted. See PDF]

As can be seen, the overall agreement between satellite and ground-based datasets is better for OMI comparisons, and, after dilution correction, it is slightly better for direct sun than for MAX-DOAS sites. Again, this is likely related to the fact that direct sun instruments (of Pandora type) tend to be located closer to strong ${NO}_{2}$ emission sources. Moreover, as also discussed previously, MAX-DOAS sites report measurements under a larger variability of conditions (both clear-sky and cloudy), leading to an increased spread of the comparisons. Generally speaking the dilution correction pushes biases closer to zero and often reduces the spread of the differences. The best results are obtained with OMI, when comparing direct sun tropospheric columns to the closest pixel of the satellite. In this case, the median bias of $- 1.16 \times 10^{15}$ $molec {cm}^{- 2}$ obtained is reduced to $- 0.23 \times 10^{15}$ $molec {cm}^{- 2}$ after application of the dilution correction. A similar improvement is found for the MAX-DOAS comparisons, from $- 0.95$ to $- 0.47 \times 10^{15}$ $molec {cm}^{- 2}$ . We find that the selection of the daily closest pixel leads to smaller biases and spreads and a better agreement between median and mean values for both OMI and GOME-2A comparisons. Therefore, in the rest of the study, comparison results are exclusively based on coincidences determined using daily closest pixels.

Several sites submitted data for time periods longer than 1 year (see Tables 2 and 3 for details), allowing the seasonal dependence of the comparisons to be investigated. In Fig. 13, seasonally sorted bias values of GOME-2A and OMI against MAX-DOAS measurements are presented for six selected stations (Uccle, OHP, Beijing, Xianghe, Bujumbura and La Réunion). A dilution correction was applied to satellite datasets at three of these sites (La Réunion, Uccle and Beijing). Although comparison results are roughly consistent for all seasons, smaller biases seem to be observed in summer time at several stations of the Northern Hemisphere. This might be related to the shorter lifetime of ${NO}_{2}$ in the warm season and the associated reduced variability of its concentration. As already discussed in Sect. 5, for Bujumbura and Réunion Island, one observes larger negative biases for GOME-2A than for OMI, despite the dilution correction applied in both sites. Note that a large underestimation of QA4ECV OMI ${NO}_{2}$ VCDs was also reported by Compernolle et al. (2020) in Bujumbura. Our validation results do not point to major seasonal effects; however it is general good practice to base validation studies on complete annual cycles in order to properly sample all observational conditions.

Figure 13

Bias (in percent) between daily tropospheric ${NO}_{2}$ columns from satellites, (a) GOME-2A and (b) OMI, and a selection of BIRA-IASB MAX-DOAS stations, for the different seasons. A dilution correction is applied to the satellite data when relevant. The box-and-whisker plots are defined as in Fig. 7.

[Figure omitted. See PDF]

Although the dilution correction improves the agreement between the ground-based and satellite measurements, significant negative biases persist at some of the validation sites (see Fig. 10). This could be related to satellite retrieval issues but also to shortcomings in our correction approach, which relies on average ${NO}_{2}$ fields derived using 1 year (2005) of OMI data. These average fields are not necessarily representative of the actual day-to-day variability at all sites. This certainly contributes to the scatter of the comparisons but should have relatively little systematic effect on regression slopes. Seasonal behavior differences, not taken into account here, could also play a role. Moreover the OMI QA4ECV dataset (Boersma et al., 2018), which has been selected as a source for estimating the correction factors, might have its own limitations. Trends in the last decades in ${NO}_{2}$ values worldwide (Duncan et al., 2016; Georgoulias et al., 2019) can be a limiting factor for some of the stations. Using OMI for the correction also implies that the afternoon ${NO}_{2}$ is representative of the morning GOME-2A overpass, which is not entirely true. Another issue is the limited spatial resolution of OMI data and of its a priori profiles' assumption. High-resolution models (Drosoglou et al., 2017) or airborne imaging DOAS measurements (Judd et al., 2019) could provide a better source of information to correct the ${NO}_{2}$ distributions around the stations, but such data are currently not available at the global scale.

Finally, ground-based instruments are assumed to provide point source measurements, while in reality the horizontal sensitivity area of MAX-DOAS measurements can be as large as several tens of kilometers (Irie et al., 2011). The provision of this information for all ground-based measurements would thus be very valuable to further improve the comparison method. Note that in urban areas, the representativeness of MAX-DOAS observations for comparison with satellite data could be improved by making use of measurements in different azimuth directions (Ortega et al., 2015; Gratsea et al., 2016; Schreier et al., 2019; Dimitropoulou et al., 2020).

7 Overall validation results

Figures 14 and 15 present an overview of the absolute deviations and relative differences between OMI and GOME-2A tropospheric ${NO}_{2}$ column measurements and the reference ground-based MAX-DOAS and direct sun measurements considered in our study. For each sensor, deviations obtained without dilution correction are presented in panel (a), while biases and relative differences after application of the dilution correction are given in panels (b) and (c). For panels (a) and (b), the total median instrumental errors (satellite and ground-based errors summed in quadrature) are also given as grey bars. When comparing the deviation in (a) and (b), the improvement by the dilution correction is clear. One can also see that results obtained using MAX-DOAS and direct sun stations are consistent within the comparison uncertainties. Note that for a few urban sites (e.g., UHMT, Seoul), the dilution correction seems to overcorrect the satellite ${NO}_{2}$ columns, especially for OMI data. This is less clear for GOME-2A, indicating that the correction approach might be slightly too aggressive for the OMI case. It can also be seen that except for a few cases, both satellite data products behave similarly at the different stations. Once corrected for the dilution effect, satellite measurements agree with ground-based data to within 25 % (black dotted lines). The blue lines represent the median bias of satellite measurements against all station data, when including the dilution correction and for ground-based ${VCD}_{tropo} > 2 \times 10^{15}$ $molec {cm}^{- 2}$ . The latter filtering is applied to remove outliers, leading to unphysical mean percent values. Resulting median residual biases are $- 23.5 %$ for GOME-2A and $- 18 %$ for OMI. For the sake of completeness, the same analysis was also performed on QA4ECV v1.1 OMI and GOME-2A datasets, using the same selection criteria. Corresponding figures can be found in the Supplement (Figs. S4 and S5 in the Supplement). Similar results are found, although the QA4ECV products tend to display slightly larger residual bias values, both for the original comparisons and after dilution correction.

Figure 14

Box-and-whisker plot of the daily OMI TEMIS/DOMINO v2.0 biases for each station (a) for the original comparisons and (b, c) when correcting for the dilution effect, in absolute and relative values. MAX-DOAS stations are presented in black; direct sun stations in dark red. The stations are ordered by increasing values of the ground-based VCDtropo, and corresponding values are given on the upper horizontal axis. The box-and-whisker plots are defined as in Fig. 7. In (a, b), grey bars are the $\pm$ comparison error, calculated adding in quadrature the satellite and ground-based VCDtropo errors.

[Figure omitted. See PDF]

Figure 15

Box-and-whisker plot of the daily GOME-2A GDP 4.8 biases for each station (a) for the original data and (b, c) when correcting for the dilution effect, in absolute and relative values. MAX-DOAS stations are presented in black; direct sun stations in dark red. The stations are ordered by increasing values of the ground-based VCDtropo for the satellite overpasses coincidences, and corresponding values are given on the upper horizontal axis. The box-and-whisker plots are defined as in Fig. 7. In panels (a, b), grey bars are the $\pm$ comparison error, calculated by adding in quadrature the satellite and ground-based VCDtropo errors.

[Figure omitted. See PDF]

Figure 16 presents the overall GOME-2A and OMI biases for the different GDP, DOMINO and QA4ECV data products, for satellite pixels in strict coincidence with the stations. In the SAT–GB panel, grey bars present the estimated error on the median bias for each comparison case, estimated as 5 $Err = 2 \cdot MAD / \sqrt{n},$ where $n$ is the number of comparisons of each case (which can be different), and MAD is the median absolute deviation (see Huber, 1981), a robust indicator: 6 $\begin{aligned} MAD = & k \cdot median (abs (SATi - GBi) \\ - median (SATi - GBi)) \end{aligned},$ where $k = 1.4826$ , for a correspondence of MAD with the $1 σ$ SD in case of normal distribution without outliers. We note that the errors on the median values are significantly smaller (around $2 \times 10^{14}$ $molec {cm}^{- 2}$ ) than the median values themselves (a few $1 \times 10^{15}$ $molec {cm}^{- 2}$ ), indicating that the derived residual biases are significant.

Figure 16

Box-and-whisker plot of the daily satellite biases for all stations together, in absolute and relative values. The box-and-whisker plots are defined as in Fig. 7. Red is used for the dilution-corrected data, while black is used for the previously presented products (OMI DOMINO and GOME-2A GDP), and grey is used for the QA4ECV products.

[Figure omitted. See PDF]

Table 6 summarizes the median biases for all the cases. As already stated, the dilution correction improves the validation results for both sensors, by about 10 % to 13 % in total over the station ensemble, with an overall uncertainty due to the method estimated at about 5 %. The impact of relaxing the comparison criteria from only pixels over the stations to the daily closest pixels selection is to increase the bias by 4 % to 6 % for OMI, but it has a negligible effect on GOME-2A (about 2 %), probably due to the large size of the GOME-2A pixels ( $40 km \times 80 km$ ). When considering the best comparison conditions including dilution correction (last column of Table 6), we come to the conclusion that satellite tropospheric ${NO}_{2}$ measurements tend to underestimate ground-based reference data by the following:

-
23 % for GOME-2A GDP4.8
-
39 % for GOME-2A QA4ECV
-
18 % for OMI DOMINO
-
27 % for OMI QA4ECV.

It should be noted that in addition to this relative bias, the previously found positive intercepts and slopes smaller than 1 (see Table 5) could point to a twofold effect, involving a multiplicative error source (e.g., the AMF) and an additive error source (e.g., the stratosphere–troposphere separation). This question should be further investigated in future studies using more extended validation data, in particular of the stratospheric

{NO}_{2}

column (see, e.g., Compernolle et al., 2020).

Table 6

Daily median biases for all the stations together for the baseline (pixels above the stations) and when relaxing the comparison criteria for the original and dilution-corrected comparison (in $molec {cm}^{- 2}$ ). Values are reported after filtering out GBi values smaller than $2 \times 10^{15} molec {cm}^{3}$ .

	Baseline over stations	Closest pixel	DC closest pixel
OMI DOMINO	$- 1.7 \times 10^{15}$ $[- 24 %]$	$- 2 \times 10^{15}$ $[- 30 %]$	$- 1.2 \times 10^{15}$ $[- 18 %]$
OMI QA4ECV	$- 2.2 \times 10^{15}$ $[- 34.4 %]$	$- 2.5 \times 10^{15}$ $[- 38 %]$	$- 1.8 \times 10^{15}$ $[- 27 %]$
GOME-2A GDP	$- 2.8 \times 10^{15}$ $[- 34.2 %]$	$- 2.9 \times 10^{15}$ $[- 36 %]$	$- 2 \times 10^{15}$ $[- 23.5 %]$
GOME-2A QA4ECV	$- 3.7 \times 10^{15}$ $[- 45.6 %]$	$- 3.7 \times 10^{15}$ $[- 48 %]$	$- 2.9 \times 10^{15}$ $[- 39 %]$

8 Conclusions

Tropospheric ${NO}_{2}$ column data from 39 ground-based remote-sensing instruments worldwide were used to validate results from GOME-2A GDP 4.8 and QA4ECV v1.1 and OMI DOMINO v2 and QA4ECV v1.1 data products. Although the ground-based retrievals are not yet fully harmonized at network level, the ground-based datasets are treated coherently for the different stations, and the study illustrates the potential capacity of MAX-DOAS and the direct sun network for tropospheric ${NO}_{2}$ validation. The interest of such a network resides in the large number of stations sampling different pollution levels and scenarios, corresponding to remote, suburban and urban conditions. Typically, suburban polluted stations (e.g., Xianghe) provide the best conditions for the validation of satellite ${NO}_{2}$ , owing to their good representativeness of the size of the OMI or GOME-2A pixel spatial extent. Validation at more remote stations can be challenging due to usually low levels of tropospheric ${NO}_{2}$ , leading to difficulties in the stratosphere–troposphere separation step in the satellite retrieval. Other challenging cases are cities and islands surrounded by a pristine atmosphere, such as Izaña, Réunion Island, Nairobi or Bujumbura, leading to large biases (up to $\sim 80$ %) due to smearing of the local tropospheric ${NO}_{2}$ emissions content in otherwise clean surroundings.

The baseline comparison keeping only satellite pixels covering the stations presents the smaller bias and spread at urban locations and the comparison spread at suburban sites for OMI data. Relaxing the collocation criteria increases the statistics but at the expense of larger biases and spread. Comparisons at urban sites or close to strong ${NO}_{x}$ sources may suffer from smoothing difference errors due to the horizontal dilution of the measured ${NO}_{2}$ field. Therefore, a quantitative correction for the dilution effect has been developed based on the spatial distribution of tropospheric ${NO}_{2}$ columns probed by OMI and averaged over 1 year. This dilution correction generally improves the comparison, reducing biases due to the spatial mismatch between ground-based and satellite observations. Generally OMI DOMINO v2 data agree better with ground-based data than GOME-2A GDP 4.8, especially for comparisons with MAX-DOAS data. The dilution correction improves the station-per-station comparisons with a few exceptions, generally at remote sites with local emissions surrounded by clean areas.

A large reduction of the bias is obtained when applying the dilution correction. In terms of validation results, MAX-DOAS and direct sun measurements are found to be highly consistent, and therefore they have been used as an ensemble to assess the accuracy of GOME-2A and OMI data. Results based on this ensemble indicate that, even after correction for the horizontal dilution effect, satellite tropospheric ${NO}_{2}$ columns are systematically biased low in comparison to ground-based measurements by 23 % to 39 % for GOME-2A and 18 % to 27 % for OMI, depending on the selected satellite product. A summary of the validation results is given in Table 6.

The dilution correction developed here is parameterized according to the distance from the station and is based on 1 year of OMI ${NO}_{2}$ measurements (2005). This approach has several identified limitations, such as assumptions made on the radial nature of the ${NO}_{2}$ distribution around the sites and the overall applicability of the ${NO}_{2}$ field derived in 2005. Another limitation is the different intra-pixel dilution expected for the OMI and GOME-2A measurements. It has been tested on a few extreme cases by taking into account the pixels' corner positions, showing improvement in the comparisons and elimination of the overestimation. Despite its simplicity and shortcomings, our dilution correction was shown to significantly improve validation results, and we anticipate that future developments will lead to further improvements. For example, possibilities exist to use estimates of the horizontal extent of MAX-DOAS measurements to improve the colocation with satellite data. MAX-DOAS instruments can also be operated in multiple azimuthal scan mode, which could be used to further refine the colocation with satellite pixels (Brinksma et al., 2008; Gratsea et al., 2016; Ortega et al., 2015; Schreier et al., 2019; Dimitropoulou et al., 2020). Finally, imaging MAX-DOAS systems such as the IMPACT instrument (Peters et al., 2019), which provides fast sampling of the full (360 $^{\circ}$ ) azimuthal range, may lead to significant improvements in tropospheric ${NO}_{2}$ validation close to source regions.

To further improve validation studies, information on the vertical distribution of ${NO}_{2}$ and aerosols is also needed to test the impact of a priori assumptions in satellite data retrieval. To some extent, this can be provided by MAX-DOAS instruments, making use of vertical profiling techniques for the inversion of tropospheric profiles of ${NO}_{2}$ and aerosols.

Finally, improving and further extending existing networks are essential requirements for future operational air quality satellite validation (Veihelmann et al., 2019). In this context, important steps include the following:

-
the further development of the PGN network of Pandora instruments, to better cover source regions in all continents and in the measurement areas of all current and future satellites;
-
the inclusion of MAX-DOAS instruments in the Network for the Detection of Atmospheric Composition Change (NDACC; De Mazière et al, 2018), based on ongoing efforts to harmonize retrieval methods and develop facilities for central data processing;
-
the systematic adoption of harmonized uncertainty characterization and reporting and of harmonized data reporting formats, another crucial point for data usage.

On this basis, it is anticipated that significant progress will be achieved in the near future towards the development of harmonized and quality-controlled global networks of UV-VIS MAX-DOAS and direct sun instruments. The development of such networks is an essential element for the validation and cross-mission consistency of the atmospheric composition satellite constellation bridging low-earth (LEO) and geostationary (GEO) orbits, in particular the ESA/EUMETSAT Copernicus Sentinel-4 (GEO) and -5 (LEO) series (planned for launch in from 2023 to 2036), the NOAA/NASA LEO Suomi-NPP/JPSS OMPS series (started in 2011, with JPSS launches planned to 2031), the CNSA LEO GaoFen-5 Environment Monitoring Instrument (2018) and the geostationary missions GEMS (2020) and TEMPO (2022) developed by the United States and South Korea and the United States, respectively.

Code and data availability

The datasets generated and analyzed in the present work are available from the corresponding author on request, and data per station can be requested from the individual PIs.

The supplement related to this article is available online at: https://doi.org/10.5194/amt-13-6141-2020-supplement.

Author contributions

GP and MVR planned this study. GP performed the validation and the associated investigations and wrote the manuscript. MVR and FH contributed to the scientific discussions and to the manuscript writing. NT participated in the OMI gridded maps' creation. JG keeps the GOME-2 GDP station overpass database up-to-date. All other co-authors provided ground-based data for the station(s) they are responsible for or support for the satellite data or the validation method. All co-authors were involved in the discussion of the results.

Competing interests

The authors declare that they have no conflict of interest.

Acknowledgements

EUMETSAT and the AC SAF are acknowledged for the production of GOME-2A GDP 4.8 data. KNMI is acknowledged for the production of OMI DOMINO v2.0 data, freely available from http://www.temis.nl (last access: 29 October 2020). QA4ECV data were obtained as part of the EC FP7 project Quality Assurance for Essential Climate Variables (QA4ECV; FP-SPACE-2013-1 project no. 607405). The Pandora data used in this work were obtained partly through the Pandonia Global Network (PGN) and are available publicly.

Financial support

This work has been supported by EUMETSAT through the AC SAF Continuous Development and Operations Phase (CDOP-3) and by the Belgian Federal Science Policy Office (BELSPO) via the ProDEx B-ACSAF contribution to the AC-SAF. Work done by Hitoshi Irie was supported by the Environment Research and Technology Development Fund (fund no. 2-1901) of the Environmental Restoration and Conservation Agency of Japan, JSPS KAKENHI (grant nos. JP19H04235 and JP17K00529), JAXA 2nd Research Announcement on the Earth Observations (grant no. 19RT000351).

Review statement

This paper was edited by Karin Kreher and reviewed by two anonymous referees.

Word count: 15685

Show less

© 2020. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Multi-axis differential optical absorption spectroscopy (MAX-DOAS) and direct sun ${NO}_{2}$ vertical column network data are used to investigate the accuracy of tropospheric ${NO}_{2}$ column measurements of the GOME-2 instrument on the MetOp-A satellite platform and the OMI instrument on Aura. The study is based on 23 MAX-DOAS and 16 direct sun instruments at stations distributed worldwide. A method to quantify and correct for horizontal dilution effects in heterogeneous ${NO}_{2}$ field conditions is proposed. After systematic application of this correction to urban sites, satellite measurements are found to present smaller biases compared to ground-based reference data in almost all cases. We investigate the seasonal dependence of the validation results as well as the impact of using different approaches to select satellite ground pixels in coincidence with ground-based data. In optimal comparison conditions (satellite pixels containing the station) the median bias between satellite tropospheric ${NO}_{2}$ column measurements and the ensemble of MAX-DOAS and direct sun measurements is found to be significant and equal to $- 34$ % for GOME-2A and $- 24$ % for OMI. These biases are further reduced to $- 24$ % and $- 18$ % respectively, after application of the dilution correction. Comparisons with the QA4ECV satellite product for both GOME-2A and OMI are also performed, showing less scatter but also a slightly larger median tropospheric ${NO}_{2}$ column bias with respect to the ensemble of MAX-DOAS and direct sun measurements.

Details

Title

Validation of tropospheric NO2 column measurements of GOME-2A and OMI using MAX-DOAS and direct sun network observations

Author

Pinardi, Gaia¹

; Michel Van Roozendael¹; Hendrick, François¹; Theys, Nicolas¹; Abuhassan, Nader²; Bais, Alkiviadis³

; Boersma, Folkert⁴

; Cede, Alexander⁵; Chong, Jihyo⁶; Donner, Sebastian⁷

; Drosoglou, Theano³; Dzhola, Anatoly⁸; Eskes, Henk⁹

; Frieß, Udo¹⁰

; Granville, José¹; Herman, Jay R²

; Holla, Robert¹¹

; Hovila, Jari¹²; Irie, Hitoshi¹³; Kanaya, Yugo¹⁴; Karagkiozidis, Dimitris³; Kouremeti, Natalia¹⁵; Jean-Christopher, Lambert¹

; Ma, Jianzhong¹⁶

; Peters, Enno¹⁷

; Piters, Ankie⁹; Postylyakov, Oleg⁸

; Richter, Andreas¹⁷

; Remmers, Julia⁷; Takashima, Hisahiro¹⁸; Tiefengraber, Martin¹⁹; Valks, Pieter²⁰; Vlemmix, Tim⁹; Wagner, Thomas⁷; Wittrock, Folkard¹⁷

¹ Royal Belgian Institute for Space Aeronomy, BIRA-IASB, Brussels, Belgium
² NASA/Goddard Space Flight Center, GSFC, Greenbelt, MD, USA; University of Maryland, Joint Center for Earth Systems Technology, Baltimore, MD, USA
³ Laboratory of Atmospheric Physics, Aristotle University of Thessaloniki, AUTH, Thessaloniki, Greece
⁴ Royal Netherlands Meteorological Institute, KNMI, De Bilt, the Netherlands; Meteorology and Air Quality Group, Wageningen University, Wageningen, the Netherlands
⁵ NASA/Goddard Space Flight Center, GSFC, Greenbelt, MD, USA; LuftBlick, Innsbruck, Austria
⁶ Gwangju Institute of Science and Technology GIST, Gwangju, South Korea
⁷ Max Planck Institute for Chemistry, Mainz, Germany
⁸ A.M. Obukhov Institute of Atmospheric Physics, Russian Academy of Sciences, IAP/RAS, Moscow, Russia
⁹ Royal Netherlands Meteorological Institute, KNMI, De Bilt, the Netherlands
¹⁰ Institut für Umweltphysik, Universität Heidelberg, Heidelberg, Germany
¹¹ German Weather Service, DWD, Hohenpeissenberg, Germany
¹² Finnish Meteorological Institute, FMI, Helsinki, Finland
¹³ Center for Environmental Remote Sensing, Chiba University, Chiba, Japan
¹⁴ Research Institute for Global Change, JAMSTEC, Yokohama, Japan
¹⁵ Laboratory of Atmospheric Physics, Aristotle University of Thessaloniki, AUTH, Thessaloniki, Greece; Physikalisch-Meteorologisches Observatorium Davos, World Radiation Center (PMOD/WRC), Davos Dorf, Switzerland
¹⁶ Chinese Academy of Meteorological Sciences, Beijing, China
¹⁷ Institut für Umweltphysik, Universität Bremen, Bremen, Germany
¹⁸ Research Institute for Global Change, JAMSTEC, Yokohama, Japan; Faculty of Science, Fukuoka University, Fukuoka, Japan
¹⁹ LuftBlick, Innsbruck, Austria; Department of Atmospheric and Cryospheric Sciences, University of Innsbruck, Innsbruck, Austria
²⁰ Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für Methodik der Fernerkundung (IMF), Oberpfaffenhofen, Germany

Pages

6141-6174

Publication year

2020

Publication date

2020

Publisher

Copernicus GmbH

ISSN

18671381

e-ISSN

18678548

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.5194/amt-13-6141-2020

ProQuest document ID

2461577302

Validation of tropospheric NO2 column measurements of GOME-2A and OMI using MAX-DOAS and direct sun network observations

Jump to:

Full text

Abstract

Details

Suggested sources