Introduction
Atmospheric methane () contributes significantly to the Earth's radiative forcing budget , making it the second most important anthropogenic greenhouse gas after carbon dioxide (). The major sources of atmospheric methane include wetland emission, rice production, enteric fermentation (cattle), termites, biomass burning, fossil fuel production, and waste . There remains, however, a large degree of uncertainty on the magnitude of these individual sources .
The lifetime of in the atmosphere is mainly controlled by its reaction with the hydroxyl free radical (OH), resulting in an atmospheric lifetime of approximately 9 . Given its long atmospheric lifetime, there is a need for long-term global measurements to fully understand how the atmospheric distribution of is evolving with time. Indeed, recent unexpected variability in the atmospheric growth rate of methane has emphasised gaps in our current understanding .
In order to begin to understand the spatio-temporal distribution of atmospheric methane, regular global satellite observations of can be coupled with highly precise but geographically sparse surface concentration data. Through the combination of both data sources, the large uncertainties related to the upscaling of surface concentration data can be minimised whilst also obtaining information in remote regions where surface measurements are not available.
Various studies have demonstrated the utility of such space-borne measurements in determining the regional surface fluxes of methane using data from the SCIAMACHY and Greenhouse gases Observing SATellite (GOSAT) instruments.
The SCIAMACHY instrument operated onboard ENVISAT and provided a 9- record (2003–2012) of global methane total column observations . The continuation of this time series of space-based observations was ensured by the launch of the first dedicated greenhouse gas measuring satellite, the Japanese GOSAT, in 2009 . GOSAT provides global coverage with a 3-day repeat cycle and was designed with the intention of characterising continental-scale sources and sinks.
In a previous work we presented the first year of our global short-wave infrared (SWIR) measurements of the dry-air column-averaged mole fraction of () from the GOSAT mission using the “proxy” retrieval approach. This data product has subsequently been developed and validated as part of the ESA Climate Change Initiative Greenhouse Gas project and we now report an assessment of the full 5- data set for version 5.0 of the University of Leicester GOSAT Proxy data product.
This work is motivated by the desire to better understand the uncertainty characteristics of the Proxy data for use within flux inversion systems, especially relating to uncertainties introduced by the model .
In Sect. we describe the retrieval approach, including details of the updates since the original version of the University of Leicester GOSAT Proxy data . In Sect. we compare both the Proxy and the ratio against the ground-based validation data. In Sect. we assess the model component of the Proxy for the first time, with Sect. then discussing the associated uncertainty of the final Proxy product and its utility in constraining surface fluxes within an inversion framework. Finally, we conclude the paper in Sect. and provide recommendations for data users.
University of Leicester GOSAT Proxy retrieval updates
The University of Leicester GOSAT Proxy retrieval utilises the Orbiting Carbon Observatory (OCO) “full physics” retrieval algorithm, developed for the original NASA OCO mission to retrieve (dry-air column-averaged mole fraction of ) from a simultaneous fit of SWIR and bands and has subsequently been modified to operate on GOSAT spectral data.
Full details of the OCO retrieval algorithm can be found in . In short, the retrieval algorithm utilises an iterative retrieval scheme based on Bayesian optimal estimation to estimate a set of atmospheric, surface, and instrument parameters from the measured spectral radiances, referred to as the state vector. The state vector of our retrieval consists of 20-level profiles for and volume mixing ratios (vmr), profile scaling factors for vmr, and temperature, surface albedo, and spectral dispersion.
Rather than perform the “full physics” retrieval as typically used for , an alternative approach is possible for , the so-called “proxy” method. First used for the retrieval of from SCIAMACHY , this approach uses the fact that there exists and spectral signatures located close together at around 1.6 and hence the majority of atmospheric scattering and instrument effects will be similar between the two bands. The ratio of the retrieved should cancel modifications to the length of the light path that are experienced due to scattering , with the effectively acting as a “proxy” for the unknown light-path enhancements. As is known to vary much less than , the final product can be obtained by multiplying this ratio by a model value, typically taken from a global chemistry transport model (Eq. ). The “proxy” retrieval approach has various advantages over the full physics approach . Because there is no reliance on an explicit a priori knowledge of the aerosol distribution, the proxy approach is more robust in the presence of aerosols and also far less sensitive to instrumental issues or inconsistent radiometric calibration between the spectral bands than is the case for the full physics approach. Additionally, as moderate scattering from aerosols will be cancelled out and still result in an accurate retrieval of , the number of successful soundings for the proxy approach is typically much higher than for the full physics approach which requires far stricter post-filtering. This leads not only to more soundings in general but also to more soundings over regions where very little full physics data may be available, such as in the tropics.
The main disadvantage with the proxy approach is that it is reliant on an accurate, unbiased model data set to convert the ratio back into an quantity; otherwise errors relating to the model may be folded into the final result. Here we present assessments of the different uncertainties to determine the importance of this aspect of the Proxy data.
We process the latest versions of the GOSAT Level 1B files (version 161.160) acquired directly from the NIES Large Volume Data Server and apply the recommended radiometric calibration and radiometric degradation correction as per .
For the spectroscopic inputs we use v4.2.0 of the OCO line lists with taken from the Total Carbon Column Observing Network (TCCON) line lists (version “20120409”). The a priori pressure, temperature, and water vapour is taken from the ECMWF ERA-Interim data . For the a priori we use the MACC-II inversion (v13r1) and for the we use the MACC-II inversion (v10-S1NOAA, using 2012 data for 2013) but here we adjust the stratospheric methane using a specialised full chemistry run (run ID 563) of the TOMCAT stratospheric chemistry model from the University of Leeds . This TOMCAT model run has been validated against ACE-FTS observations and was found to provide a more accurate representation of the stratosphere.
The spectral noise is estimated from the standard deviation of the out-of-band signal. Spectra over ocean or with a signal-to-noise ratio (SNR) below 50 are removed. Cloud-contaminated scenes are removed by the comparison of a clear-sky surface pressure retrieval from the A-band to the ECMWF surface pressure for the relevant measurement time and location. A scene is determined to be cloudy when the retrieved surface pressure differs by more than 30 from the estimated ECMWF surface pressure. This relatively loose threshold is allowed as the proxy retrieval approach remains relatively robust in the presence of near-surface clouds. The average difference between our retrieved surface pressure and ECMWF after filtering for cloud is approximately 3 with a standard deviation of below 10 , with the offset from 0 being attributed to spectroscopic uncertainties in the cross-sections. The Proxy retrieval is performed for all scenes that are deemed to be sufficiently cloud free.
After filtering for signal-to-noise, cloud, and data quality we are left with 1 032 760 retrievals over land between April 2009 and December 2013. Figure shows global maps of the Proxy for each season and compares it to the MACC-II model data. Both model and observation show the variability in time and space, in particular with the large emissions of methane from wetland and rice cultivation over India and S.E. Asia.
Seasonal global maps of the University of Leicester GOSAT Proxy (top) and the MACC-II (bottom) model data (v10-S1NOAA). Both model and observation show the variability in time and space, in particular with the large emissions of methane from wetland and rice cultivation over India and S.E. Asia. Note that GOSAT changed their pointing pattern in August 2010 from five across-track points to three across-track points, resulting in a change in spatial coverage.
[Figure omitted. See PDF]
Validation of the Proxy and ratio
This section presents the validation of the University of Leicester GOSAT Proxy v5.0 data through comparison to observations from the ground-based TCCON. In addition, for the first time the ratio itself, the core component of the Proxy data, is validated against the corresponding TCCON data.
TCCON is a global network of ground-based high-resolution Fourier transform spectrometers recording direct solar spectra in the near-infrared spectral region . The TCCON data are calibrated to World Meteorological Organization (WMO) standards by calibration against aircraft measurements . Although it should be noted that this aircraft calibration does not measure the whole column, the TCCON data are the standard against which current satellite observations of greenhouse gases are validated .
To date, all previous validation of satellite greenhouse gas observations against TCCON has used TCCON data that were affected by instrumental biases relating to a laser sampling error which resulted in an error of approximately 0.26 % (1 ) . Although the corresponding error was not quantified, it is expected that it would be of similar magnitude (i.e. 1 part in 400). The latest, recently released, version of the TCCON data (GGG2014) incorporates a correction for the laser sampling errors and any remaining bias is expected to be small.
GGG2014 TCCON data and the Proxy plotted as time series for each TCCON site. The mean GOSATTCCON difference, the standard deviation of the GOSATTCCON difference, the correlation coefficient, and the number of soundings are all provided for each site.
[Figure omitted. See PDF]
Figure shows the GGG2014 TCCON data and the Proxy plotted as time series for each TCCON site. The mean GOSATTCCON difference, the standard deviation of the GOSATTCCON difference, the correlation coefficient, and the number of soundings are all provided for each site.
Figure (top) shows the correlation between the GGG2014 TCCON data and the Proxy values within 5 of each TCCON site and a temporal coincidence of 2 h. It should also be noted that for all TCCON comparisons, the difference inherent in the data due to using different a priori has been compensated for (as discussed in , by replacing the a priori used in the GOSAT retrievals with the TCCON a priori after the retrieval has been performed) which typically increases the GOSAT data by an average of between 0 and 5 with the larger effect seen at more northernly TCCON stations. We use all TCCON sites where version GGG2014 has been processed at the time of writing that contain data during the GOSAT time period (2009–2014). This results in 11 TCCON stations ranging from Sodankylä, Finland, at 67.4 N to Lauder, New Zealand, at 45.0 S. The correlation between the GOSAT and TCCON data is reasonable/good across all sites, ranging from 0.54 at Karlsruhe to 0.79 at Lauder with an overall correlation coefficient of 0.87 between 22 619 points. The overall bias is found to be 4.8 with an overall single measurement precision of 13.4 (ranging from 8.3 at Darwin to 14.9 at Garmisch). The station-to-station bias, which is an indication of the relative accuracy, is calculated as the standard deviation of the individual site biases and is found to be just 4.2 .
In addition to the validation of the Proxy data, we also present for the first time the validation of the ratio. This ratio is the quantity directly retrieved from the satellite measurement, is independent of any model , and has recently itself been used directly within a flux inversion study . The correlation coefficient across all stations is found to be 0.88 (ranging from 0.6 at Wollongong to 0.88 at Sodankylä) with a mean bias of 0.014 and a single-sounding precision of 0.033 (ranging from 0.20 at Darwin to 0.037 at Garmisch). The statistics for the ratio are therefore comparable to those of the Proxy itself, suggesting that the majority of the variation is from the satellite retrieval itself and not introduced by the model . The next section investigates this aspect in more detail.
Correlation plot of the Proxy (top) and the ratio (bottom) data against TCCON ground-based FTS data at 11 TCCON sites. The overall bias, standard deviation (single-sounding precision), correlation coefficient, and total number of soundings are provided. Note that the Lauder TCCON station upgraded the instrument from a Bruker 120 to a Bruker 125 in February 2010 and these two data sets are displayed separately.
[Figure omitted. See PDF]
Assessing the model ensemble component
In Sect. the final Proxy and the component were both validated against the TCCON data. In this section we validate the remaining component of the proxy product from Eq. (), namely the model .
As discussed in Sect. , this update to the
University of Leicester GOSAT Proxy data uses an ensemble
of model data to act as the model
component. We utilise the from three state-of-the-art
global transport models which all assimilate surface in situ
measurements; GEOS-Chem
Correlation plot of the model data for GEOS-Chem, MACC-II, CarbonTracker, and the ensemble median against TCCON ground-based FTS data at 11 TCCON sites. The overall bias, standard deviation (single-measurement precision), correlation coefficient, and total number of soundings are provided separately.
[Figure omitted. See PDF]
The main concern with using modelled data for the proxy method is that the additional uncertainty added to the final proxy data product is difficult to determine. Where the model data are constrained by surface data there can be a high degree of confidence that the model data are close to representing the true value of ; however, it is away from such regions where there is a possibility of adding additional biases into the Proxy data. The TCCON stations are mostly in regions that are also well constrained by surface in situ measurements and hence the model data should be well constrained, at least at the surface level, and it is therefore expected to reasonably reproduce the TCCON column data. Figure confirms that this is the case. As the model is used as a component in the proxy retrievals, the models are treated as “pseudo-measurements” and validated in the same way as the satellite data in order to maintain consistency with the satellite validation. The model data sampled at each GOSAT measurement point within 2 of each TCCON station are found to agree well with the TCCON data, with the correlation coefficients ranging from 0.94 (GEOS-Chem) to 0.97 (MACC-II and CarbonTracker). Similarly the precision and bias to TCCON are both found to be small (ranging from 0.97 to 1.3 and 0.07 to 0.27 respectively). The relative accuracies (the standard deviation of the individual site biases) are similar at around 0.5 ppm, with CarbonTracker and GEOS-Chem performing slightly better than MACC-II. Another metric to assess the models is how often they provide the median value of the ensemble. CarbonTracker (41 %) and MACC-II (36 %) tend to provide the median value more often than GEOS-Chem (22 %) but this can vary per site with the contribution from MACC-II as low as 27 % at Darwin (and CarbonTracker at 60 %) and conversely as high as 44 % at Wollongong (with CarbonTracker only 21 %). This provides further indication that no one model can be determined to be the “best”.
For a more detailed analysis of the performance of the different models please see Table in Appendix A. In short, none of the models are found to consistently be superior over the other models. GEOS-Chem typically has the highest scatter against TCCON but also has the smallest bias at 5 out of 12 of the sites. MACC-II has the smallest bias at seven sites but the highest bias at four of the sites. CarbonTracker has the highest bias at seven of the sites but also has the smallest scatter at eight of the sites. Whilst the absolute bias in the calculated median is typically not quite as small as the best of the individual models, the scatter in the median is better than (or the same as) the best scatter from any of the individual models at every site except Lauder_120 (where the time series is the shortest) and even there it is only worse than the best model by less than 0.1 .
The above has demonstrated that it is not a simple decision to determine which model most accurately represents the true atmosphere, even in locations where all of the models have been constrained by (often the same) surface measurements and high-quality validation data are available. In more remote regions where we neither have validation data nor surface measurements to constrain the models, this inconsistency between the models becomes more pronounced. It is this uncertainty in model in regions away from the available validation data that we attempt to address through the use of the model ensemble. Each of the three models are sampled at every GOSAT time and location and convolved with the scene-specific GOSAT averaging kernels. The median value of the three model values is used as the model in calculating the final Proxy . However, we also define the uncertainty on this median as the maximum of the absolute differences of each individual model to the median value.
We have already demonstrated that the models all well reproduce the validation data at TCCON sites without any one model identified as being better than the others from our comparisons. Where the models all agree well with each other away from the validation sites, the assumption is that the models are accurately representing the true atmosphere. Where the models disagree with each other, we do not know which model is correct in the absence of further validation data and in some cases the discrepancy between models can be very large (i.e. ). In such cases where no validation is possible, the best estimate of the uncertainty in the model data is obtained by examining the difference of the model data around the median value. Figure shows global maps of this estimated model uncertainty for each season. There are clear spatial/temporal patterns in the distribution of this model uncertainty. During March–May (boreal “spring”), there is a large uncertainty ( ) over India and the African regions typically associated with biomass burning. There is also a moderate level of uncertainty ( ) over Europe, South America, and for the latter years over North America and Australia. For the summer months (June–August) it is the Eurasian region, extending from the Ural mountains eastwards through Siberia and northern China, where the model uncertainty is largest at over 2 . This is to be expected as in the Northern Hemisphere it is the period of greatest photosynthetic activity and the model sensitivity to the underlying mechanisms is likely to be largest. During boreal autumn (September–November), the uncertainty in the Northern Hemisphere is vastly reduced again, with India being the major region of uncertainty along with South America and regions of biomass burning in Africa. Winter is similar to autumn, with all three models in very good agreement with each other in the Northern Hemisphere, with only S.E. Asia showing a moderate level of uncertainty. In the Southern Hemisphere, again South America and southern Africa show moderate uncertainty which appears to be linked to emissions from biomass burning.
This section has shown that the estimated uncertainty of the model can vary greatly in time and space. When considering the implication of this uncertainty on flux inversions of the Proxy data, the relative importance of the different uncertainties must be considered. The following section investigates the distribution of the model uncertainty and judges its relative importance against the a posteriori error from the retrieval itself. Finally, both of these uncertainties are assessed against the difference to modelled already constrained by surface observations to determine the utility of the satellite data despite the presence of these uncertainties.
Seasonal maps of the model difference, defined as the maximum absolute difference of the three-model ensemble from the median. All individual soundings have been averaged into grid boxes over each season. The largest uncertainties occur in regions where the variability is expected to be highest and the models are unconstrained by surface measurements.
[Figure omitted. See PDF]
Assessing the relative uncertainties
In order to assess the importance of the uncertainty of the model , we bin the three model fields into grid boxes over 8-day time steps and calculate the maximum difference of the three-model ensemble from the median value to use as an estimate of the uncertainty in the model values. We convert this uncertainty in model into an uncertainty in by multiplying each point by its respective retrieved amount. We also calculate the average a posteriori error for the same data. Unlike the more systematic model uncertainty, the a posteriori error should be close to random and hence reduce approximately with the square root of the number of soundings being averaged. If the error does not reduce as much, the model component would then contribute even less to the total, leading to this assumption being a “worst case” scenario for the effect of the model uncertainty. These grid boxes are then themselves averaged over the Transcom regions as defined in Fig. .
The Transcom regions over which the gridded data are then averaged in Fig. .
[Figure omitted. See PDF]
In Fig. , the red line shows the mean of the Proxy random (a posteriori) error from each box averaged over each Transcom region with the green line representing the estimated uncertainty related to the model . The majority of regions exhibit a similar trend over time. The a posteriori error peaks in the winter months when the SNR of the measurement is at its lowest and is at a minimum during the summer months when the SNR is at a maximum. This seasonal effect is more pronounced at higher latitudes which experience a greater degree of variability of sunlight throughout the year. Conversely, the model uncertainty follows biospheric activity with the uncertainty largest during the summer months when the variability is at a maximum and reduces to a minimum in the winter months when biospheric activity is lower. This leads to the situation where the a posteriori error dominates the model uncertainty in winter months but during summer months the model uncertainty can be comparable to, or even exceed, the a posteriori error. Taking the North America Temperate region as an example, during winter the a posteriori error can reach up to 8 with the error from the model significantly lower with values less than 2 . In contrast, during the summer months, the a posteriori error reduces to around 5 but the error for the model increases to 5 , meaning that both become significant components of the overall uncertainty.
We have shown that the uncertainty related to the model can, particularly in the Northern Hemisphere during summer months, be of comparable magnitude to the a posteriori retrieval error. However, that in itself does not preclude the data from adding useful information to a flux inversion.
The MACC-II model (v10-S1NOAA) data have assimilated NOAA surface measurements at background sites and hence are well constrained in the remote atmosphere . Here we calculate the difference between the MACC model field and the GOSAT Proxy data for each GOSAT measurement (referred to from here as ). We then aggregate these differences in the same way as the model uncertainties. Note that the MACC model data are currently only available until the end of 2012. As some inversion systems will perform a simple (e.g. latitudinal) bias correction, the calculated retrieval a posteriori and model uncertainties can potentially be much lower than the value but still not provide information to the inversion. For this reason, it is also important to consider both the mean () and the standard deviation () of the . To determine whether the GOSAT data are capable of providing information to the inversion, we compare the a posteriori and model uncertainties to the and values as shown in Fig. , with the seasonal averages for all of these values presented in Table .
Time series for each Transcom region showing the a posteriori retrieval error (red), the estimated uncertainty from the model (green), and the mean (navy) and standard deviation (purple) of the difference between the GOSAT and MACC-II . The a posteriori error is assumed to be a random error and hence reduces with the square root of the number of measurements whilst the model uncertainty is expected to be a systematic error and hence does not reduce.
[Figure omitted. See PDF]
It should be noted here that the absolute values are not necessarily quantitatively comparable when taking into account how an inversion system will use the two different quantities. The a posteriori error of the retrieved is an indication of the weighting that the inversion will give to an observation over the a priori, with a smaller value indicating that the inversion will “trust” the observation more. The is an indication of how much the inversion needs to adjust the fluxes in order to match the observation. However, if the estimated uncertainties are significantly less than the and values it is expected that the observations should provide value to the inversion. It should also be noted that this bias term () may also reflect systematic biases in the model due to, for example, errors in the vertical model profile whilst the sigma term () may also relate to subgrid-scale variations which are unresolved at the model resolution.
For the North American Boreal region, both the and values are very similar in terms of phase and magnitude to the a posteriori uncertainty with the ranging from an average of 9.6 in summer to 14.5 in winter compared to the a posteriori uncertainty that ranges from 6.6 in summer to 10.8 in winter. This suggests that regardless of the contribution to the uncertainty from the model, it would be difficult for the satellite data to inform the inversion any further than the in situ data already do. However, this is not the case for the North American Temperate region where the (7.4–11.6 ) and (8.4–10.0 ) are far larger than the total uncertainty (6.0–6.9 ) for much of the year. Both South American regions exhibit more complicated behaviour with far less of an apparent seasonality in the . Instead, for most years is much higher than the uncertainties (which themselves do not exhibit much seasonality in these regions). However, the year 2010 seems to be an anomalous year where the data are much more in agreement and in this year the difference is of comparable magnitude to the uncertainties with values between 4 and 8 . The does exhibit more seasonality than the and is again considerably higher than the estimated uncertainties (7–20 vs. 5–6 ). In combination, this suggests that the GOSAT observations over South America should add considerable information to the inversion.
For Northern Africa, both the a posteriori error and the uncertainty related to the model are small due to the high SNR over the Sahara and the low variability respectively (with seasonal average values ranging from 3.3 to 3.6 for the a posteriori error and from 2.8 to 4.6 for the model error). In contrast, the (7.2–12.3 ) and (4.9–8.4 ) values over this region are relatively large with a high degree of temporal variability, suggesting that the satellite data should add considerable value in constraining the inversion over this region. One complication is that GOSAT operates in a “medium gain” mode over the desert and hence may exhibit different instrumental biases over such regions but, due to the proxy method, any such differences in instrumental biases that relate to light-path modification should be minimised. Southern Africa shows similar behaviour with the total uncertainty being low (seasonal averages of 5.1–7.3 ) compared to the much larger (12.6–19.8 ) and (5.7–10.7 ) values, again indicating that considerable value is present in the satellite data.
The Eurasian Boreal region behaves similarly to the North American Boreal region. The and is of similar phase and magnitude to the retrieval a posteriori error, suggesting little information will be added to any inversion over this region beyond what is available from the in situ measurements. In contrast, the values over the Eurasian Temperate region show a large variability with the differences in winter months much larger than the total uncertainty (9.9 vs. 5.8 ), while in summer months the magnitudes become much more similar (5.5 vs. 7.3 ). Interestingly, the values appear to be of similar magnitude (5–20 ) but directly out of phase with the values. Even during summer months when the a posteriori (4.3 ) and model (5.8 ) uncertainties are comparable to the (5.5 ), the high variability in the values, as indicated by values of up to 20 (and a summertime mean value of 15.0 ), suggests that the observations are capable of providing useful information to the inversion.
The Tropical Asian region, which encompasses parts of India, China, and Indonesia, typically has low values for both the a posteriori (5.6–6.9 ) and model (4.4–6.0 ) uncertainties, with neither exhibiting much seasonal variability. The and values however are much more variable (8.9–11.7 and 9.0–16.8 ) and generally much higher than the uncertainties, suggesting that useful information from the satellite data is present.
The European Transcom region has uncertainties in the satellite data (seasonal averages of 7.6–10.0 ) that are of comparable magnitude to the values (8.1–10.7 ), especially when considering the combination of the a posteriori and model uncertainties. However, the standard deviation of the values is highly variable (8.7–13.2 ) which suggests that there is scope for the observational data to aid in constraining the European fluxes.
Finally, the Australian Transcom region shows very small uncertainties in the satellite data. The uncertainty associated with the model is comparable to the a posteriori error during the Australian spring months but even in those circumstances, the values are far larger (11.4 vs. 4.7 ), demonstrating that the satellite data are capable of providing some information to the inversion over Australia, although this may be limited in its ability to provide specific information on Australian sources as the values over this region are similar to the estimated uncertainties with seasonal averages of 4.5–5.2 compared to the total uncertainty values of 4.1–5.0 .
Summary and conclusions
We present details of the update to the University of Leicester GOSAT Proxy v5.0 data set with 5 of GOSAT data now processed. The data are validated against the latest ground-based TCCON data and found to agree well with on average a small bias of 4.8 ( %), a single-sounding precision of 13.4 ( %), and a relative accuracy of 4.2 . For the first time the ratio component of the proxy retrieval is validated and also found to agree well with TCCON with a bias of 0.014 ( %) and a single-sounding precision of 0.033 ( %).
A major unknown uncertainty in previous Proxy products was the uncertainty associated with the model . In this work we validate three separate state-of-the-art chemistry transport models against the TCCON data and find that although the models can differ greatly ( ) away from the TCCON stations, at the validation locations it is difficult to distinguish which model performs better from our comparisons. We therefore decide to use the median of the three models to act as the model in the calculation of the Proxy and use the maximum difference to the median as a measure of the uncertainty in the model . This model uncertainty is found to vary greatly in time and space but is typically largest over regions associated with biomass burning such as central Africa and in particular over the Eurasian regions during summer months where large uptake in leads to large differences between the models.
In order to assess the relative importance of these uncertainties, we compare this model uncertainty to the a posteriori retrieval error over the different Transcom regions and find typically that where there is seasonality in the uncertainties, it is typically directly out of phase between the two, resulting in the model uncertainty becoming significant during summer months where the a posteriori error is at its lowest. This relates to the fact that more sunlight leads to a reduction in the a posteriori uncertainty (by virtue of providing a greater signal in the SWIR) and at the same time is associated with an increase in photosynthesis and, hence, more potential for differences in the model .
We assess the significance of these uncertainties on any flux inversion by comparing the mean and standard deviation of the GOSATMACC differences ( and ) to the estimated uncertainties. We find that for the majority of regions the mean and standard deviation of the values are much larger than the estimated uncertainties, even taking into account the uncertainty related to the model . Our findings show that useful information will be provided to the inversions for the majority of regions, with the exceptions being the boreal regions (North American Boreal and Eurasian Boreal) where the uncertainty is of a similar magnitude to the and values. It is important to note that the MACC data are already constrained by NOAA background sites.
One final consideration for users of the Proxy data who are performing atmospheric inversions is that, should they have their own model which they believe is consistent with their model, it may be beneficial to only take the GOSAT ratio and apply their own model (with appropriate averaging kernels) in order to minimise transport model errors between the different models. Alternatively the ratio can also be inverted directly as shown in and .
Data sets
The GOSAT Proxy data used in this publication are freely
available from
Table showing the comparison statistics between each model (sampled as per the GOSAT measurements) within 2 of each TCCON site against the TCCON validation data. The difference (modelTCCON), the standard deviation of the difference, and the correlation coefficient are all provided as is the total number of measurements for each site, , and the percentage “share” of the median for each model, %. For each of the three models, GEOS-Chem, MACC-II, and CarbonTracker, the best (bold) and worst (italic) value for each metric is highlighted. For the ensemble median data, all values which are better than the best individual model value are highlighted in bold-italic. The lower panel provides overall statistics across all sites. These include the relative accuracy (the standard deviation of the individual site biases), the overall precision (the standard deviation of the GOSATTCCON differences), and the overall share that each model contributes to the median ensemble.
Coincident criteria | GEOS-Chem | MACC-II | CarbonTracker | Ensemble median | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
TCCON Site | Diff (ppm) | SD (ppm) | % | Diff (ppm) | SD (ppm) | % | Diff (ppm) | SD (ppm) | % | Diff (ppm) | SD (ppm) | |||||
Sodankylä | 584 | 1.1 | 1.1 | 0.97 | 20 | 0.9 | 0.9 | 0.98 | 37 | 1.2 | 0.9 | 0.99 | 42 | 1.1 | 0.8 | 0.99 |
Bialystok | 1429 | 0.6 | 1.5 | 0.95 | 25 | 0.4 | 1.1 | 0.97 | 33 | 0.6 | 1.0 | 0.98 | 44 | 0.6 | 1.0 | 0.97 |
Karlsruhe | 1569 | 0.2 | 1.4 | 0.92 | 22 | 0.6 | 1.1 | 0.95 | 33 | 0.4 | 1.1 | 0.95 | 45 | 0.4 | 1.1 | 0.95 |
Orleans | 1650 | 0.3 | 1.2 | 0.95 | 22 | 0.3 | 0.9 | 0.98 | 33 | 0.4 | 0.9 | 0.97 | 46 | 0.3 | 0.8 | 0.98 |
Garmisch | 1527 | 0.8 | 1.3 | 0.93 | 22 | 0.6 | 1.3 | 0.94 | 34 | 0.8 | 1.2 | 0.95 | 43 | 0.7 | 1.1 | 0.95 |
Park Falls | 2434 | 0.4 | 1.1 | 0.97 | 23 | 0.1 | 1.0 | 0.98 | 38 | 0.5 | 1.0 | 0.98 | 40 | 0.3 | 0.9 | 0.98 |
Lamont | 7464 | 0.2 | 1.6 | 0.92 | 20 | 0.1 | 0.9 | 0.98 | 39 | 0.0 | 0.9 | 0.98 | 41 | 0.1 | 0.9 | 0.98 |
Saga | 379 | 0.6 | 1.1 | 0.93 | 27 | 1.0 | 0.9 | 0.95 | 33 | 0.3 | 0.9 | 0.95 | 40 | 0.6 | 0.9 | 0.96 |
Darwin | 2491 | 0.0 | 0.8 | 0.97 | 12 | 0.5 | 0.7 | 0.97 | 27 | 0.4 | 0.6 | 0.98 | 60 | 0.3 | 0.6 | 0.98 |
Wollongong | 2601 | 0.1 | 0.8 | 0.96 | 36 | 0.1 | 0.8 | 0.96 | 44 | 0.2 | 0.9 | 0.95 | 21 | 0.0 | 0.8 | 0.96 |
Lauder_120 | 124 | 0.1 | 0.9 | 0.82 | 27 | 0.3 | 0.7 | 0.86 | 44 | 0.2 | 0.8 | 0.84 | 29 | 0.2 | 0.8 | 0.84 |
Lauder_125 | 368 | 0.3 | 0.4 | 0.99 | 30 | 0.2 | 0.3 | 0.99 | 40 | 0.4 | 0.4 | 0.99 | 30 | 0.3 | 0.3 | 0.99 |
Statistics for all sites | ||||||||||||||||
Relative accuracy (ppm) | 0.48 | 0.53 | 0.47 | 0.48 | ||||||||||||
Overall precision (ppm) | 1.3 | 0.97 | 0.96 | 0.92 | ||||||||||||
Total median share (%) | 22 | 36 | 41 |
Table showing the seasonal averages of the data plotted in Fig. for each of the Transcom regions. The retrieved a posteriori error, the uncertainty related to the model , their combined total, and the mean and standard deviation of the GOSATMACC difference are all provided for each season and for each Transcom region.
Region | Season | A posteriori | Model | Total | Region | Season | A posteriori | Model | Total | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
(ppb) | (ppb) | (ppb) | (ppb) | (ppb) | (ppb) | (ppb) | (ppb) | (ppb) | (ppb) | |||||
North America | spring | 7.4 | 2.2 | 7.8 | 6.3 | 13.0 | Eurasian | spring | 7.1 | 3.2 | 8.0 | 4.5 | 13.1 | |
boreal | summer | 6.6 | 4.9 | 8.3 | 2.9 | 9.6 | boreal | summer | 6.3 | 7.0 | 9.5 | 4.4 | 9.9 | |
autumn | 10.0 | 2.8 | 10.5 | 6.2 | 11.4 | autumn | 9.0 | 3.1 | 9.6 | 5.4 | 11.1 | |||
winter | 10.8 | 2.3 | 11.1 | 11.5 | 14.5 | winter | 9.9 | 2.5 | 10.3 | 5.7 | 13.9 | |||
North America | spring | 5.1 | 3.0 | 6.0 | 11.2 | 9.2 | Eurasian | spring | 4.4 | 3.5 | 5.7 | 11.7 | 9.7 | |
temperate | summer | 4.6 | 4.7 | 6.6 | 7.4 | 8.6 | temperate | summer | 4.3 | 5.8 | 7.3 | 5.5 | 15.0 | |
autumn | 5.3 | 3.1 | 6.3 | 7.7 | 8.4 | autumn | 4.4 | 4.1 | 6.2 | 6.5 | 10.2 | |||
winter | 6.5 | 2.4 | 6.9 | 11.6 | 10.0 | winter | 5.2 | 2.6 | 5.8 | 9.9 | 9.6 | |||
South America | spring | 6.4 | 4.3 | 7.8 | 8.6 | 11.2 | Tropical Asia | spring | 6.0 | 6.0 | 8.5 | 8.9 | 12.2 | |
tropical | summer | 5.3 | 4.3 | 6.9 | 10.8 | 8.6 | summer | 6.9 | 5.0 | 8.7 | 10.9 | 16.1 | ||
autumn | 5.8 | 5.2 | 7.9 | 10.7 | 12.6 | autumn | 6.2 | 6.0 | 8.7 | 11.7 | 16.8 | |||
winter | 6.5 | 4.1 | 7.8 | 7.4 | 15.5 | winter | 5.6 | 4.4 | 7.2 | 10.9 | 9.0 | |||
South America | spring | 4.7 | 3.7 | 6.1 | 6.0 | 10.7 | Australia | spring | 3.9 | 2.5 | 4.7 | 11.4 | 4.7 | |
temperate | summer | 4.3 | 3.4 | 5.5 | 9.1 | 7.4 | summer | 3.7 | 1.8 | 4.1 | 11.7 | 4.5 | ||
autumn | 4.2 | 3.7 | 5.7 | 9.1 | 9.7 | autumn | 3.7 | 2.1 | 4.3 | 13.9 | 5.2 | |||
winter | 4.7 | 3.8 | 6.1 | 7.2 | 13.5 | winter | 4.2 | 2.5 | 5.0 | 15.0 | 5.2 | |||
Northern Africa | spring | 3.6 | 3.6 | 5.2 | 8.8 | 7.3 | Europe | spring | 6.7 | 3.5 | 7.6 | 9.2 | 12.1 | |
summer | 3.6 | 4.6 | 5.9 | 7.2 | 7.4 | summer | 5.9 | 5.3 | 8.0 | 10.7 | 8.7 | |||
autumn | 3.4 | 3.5 | 4.9 | 12.3 | 8.4 | autumn | 7.5 | 3.0 | 8.2 | 8.1 | 8.9 | |||
winter | 3.3 | 2.8 | 4.3 | 8.0 | 4.9 | winter | 9.4 | 3.1 | 10.0 | 9.3 | 13.2 | |||
Southern Africa | spring | 4.7 | 4.6 | 6.7 | 15.5 | 9.9 | ||||||||
summer | 3.7 | 3.4 | 5.1 | 12.6 | 5.7 | |||||||||
autumn | 4.8 | 3.7 | 6.1 | 13.5 | 9.8 | |||||||||
winter | 5.4 | 4.8 | 7.3 | 19.8 | 10.7 |
The TCCON and data used in this publication
are publicly available from
Acknowledgements
We thank Japanese Aerospace Exploration Agency, National Institute for Environmental Studies, and the Ministry of Environment for the GOSAT data and their continuous support as part of the Joint Research Agreement. R. J. Parker is funded through an ESA Living Planet Fellowship. The work at the University of Leicester and University of Edinburgh was supported by funding through the UK National Centre for Earth Observation (NCEO), the Natural Environment Research Council (NERC), and the ESA Greenhouse Gas Climate Change Initiative (GHG-CCI). Part of this work was funded by the NERC Amazonian Carbon Observatory project (NE/J016284/1) and the NERC GAUGE project (NE/K002465/1). This research used the ALICE High Performance Computing Facility at the University of Leicester.
The authors would like to thank Paul Wennberg as the TCCON PI for provision of TCCON data. The European TCCON groups acknowledge financial support by the EU project InGOS. The RAMCES team at LSCE (Gif-sur-Yvette, France) is acknowledged for maintenance and logistical work for the Orléans TCCON site. The University of Bremen acknowledges support from the Senate of Bremen, the EU-project ICOS-INWIRE, and operational funding from the National Institute for Environmental Studies (NIES, Japan). A part of the work at JAXA was supported by the Environment Research and Technology Development Fund (A-1102) of the Ministry of the Environment, Japan. A part of this work has been supported by the European Commission Seventh Framework Programme (FP7/2007–2013) projects MACC under grant agreement 218793 and MACC-II under grant agreement 283576.Edited by: I. Aben
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2015. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
We present 5
The uncertainty relating to the model
We compare this uncertainty to the a posteriori retrieval error (which is assumed to reduce with sqrt(
We assess the significance of these model and retrieval uncertainties on flux inversion by comparing the GOSAT
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details







1 Earth Observation Science, Department of Physics and Astronomy, University of Leicester, Leicester, UK; National Centre for Earth Observation, Department of Physics and Astronomy, University of Leicester, Leicester, UK
2 Earth Observation Science, Department of Physics and Astronomy, University of Leicester, Leicester, UK
3 School of GeoSciences, University of Edinburgh, Edinburgh, UK; National Centre for Earth Observation, School of GeoSciences, University of Edinburgh, Edinburgh, UK
4 European Commission Joint Research Centre, Institute for Environment and Sustainability, Ispra, Italy
5 Lab. des Sciences du Climat et de l'Environnement, CNRS, Gif-sur-Yvette, France
6 Institute of Environmental Physics, University of Bremen, Bremen, Germany
7 Institute of Environmental Physics, University of Bremen, Bremen, Germany; School of Chemistry, University of Wollongong, Wollongong, Australia
8 Karlsruhe Institut für Technologie, Karlsruhe, Germany
9 Japan Aerospace Exploration Agency (JAXA), Tsukuba, Japan
10 Finnish Meteorological Institute, Arctic Research, Sodankylä, Finland
11 School of Chemistry, University of Wollongong, Wollongong, Australia