Bayesian atmospheric correction over land:

Full text

Turn on search term navigation

1 Introduction

Land surface monitoring at optical wavelengths from medium resolution Earth observation (EO) requires an accurate and consistent description of the bottom-of-atmosphere (BOA) spectral bidirectional reflectance function (BRF) that is made readily available to users. This is acknowledged in efforts to develop consensus on “analysis-ready data” (ARD) and the value of such data for monitoring global change, as emphasised elsewhere . Community needs for “CEOS Analysis Ready Data for Land” (CARD4L) are stated as “threshold” (minimum) and “target” (desirable) requirements, with the former reflecting current practice and what is achievable with existing approaches and the latter being an agreed position for the scientific and user communities to move towards. No uncertainty threshold or target values for surface reflectance are given in the CEOS ARD specification, but in this paper, we use specifications adopted by , given in Table , as threshold values for aerosol optical thickness (AOT), total column of water vapour (TCWV), and BOA BRF $r$ specification.

Table 1

Threshold uncertainty specifications for aerosol optical thickness (AOT), total column of water vapour (TCWV), and BOA BRF ( $r$ ) used in this paper.

Quantity	Threshold uncertainty	Reference
AOT	$0.05 + 0.15 AOT$
TCWV	$0.2 + 0.1 TCWV$
$r$	$0.005 + 0.05 r$

An important capability highlighted in target requirements is that per-pixel uncertainty estimates should be supplied , but this is lacking in the CEOS fully assessed USGS Landsat Collection 2 ARD product and the ESA Sentinel-2 Level-2A . Such information is vital for traceability and rigorous scientific analysis with ARD products . Additionally, the ACIX intercomparisons of and surface reflectance comparison studies illustrate that further work is needed to ensure accuracy and consistency of such data, which is critical for combining data with different spatial, spectral, temporal, and radiometric characteristics and for achieving more comprehensive and/or frequent monitoring than with any single set . This requires a focus on both accuracy and inter-operability, which we suggest is not being adequately realised in current approaches.

In this paper, we describe the approach used for the UK NERC NCEO BOA BRF (https://catalogue.ceda.ac.uk/uuid/ad7de4e3b3b34cc0adca86c68e94d3a1, last access: 21 October 2022) product from medium resolution S2/MSI and L8/OLI sensors . It is sensor agnostic over the optical domain in the sense that it does not rely on particular optical waveband sets, so is called Sensor Invariant Atmospheric Correction (SIAC). SIAC aims to be CARD4L-compliant at threshold requirements and to move towards target requirements by providing per-pixel uncertainty values. This is enabled by applying a Bayesian framework to the estimation of atmospheric parameters from medium resolution multispectral observations and other constraints. The resulting parameters are used to derive an estimation of BOA BRF. Mean estimates derived from SIAC are validated against the criteria in Table through a global comparison of derived AOT and TCWV estimates with in situ AERONET measurements, comparisons of retrieved surface reflectance with in situ Radiometric Calibration Network (RadCalNet) measurements, and interoperability comparisons of surface reflectance between S2 and L8. The uncertainty in the retrievals is also assessed, which complements the further validation of SIAC and inter-comparison with other processors in the ACIX-II experiment .

Table 2

Main symbols used in the paper. $C_{*}$ represents the covariance matrix part of the PDF for parameter $*$ .

Symbols	Meaning
$G_{m}$	Medium resolution grid of target sensor Level 1C data
$G_{c}$	Coarse resolution grid of atmospheric state variables
$d$	Relative day index, the day of a sample data point relative to the target day $(- 16 \leq d \leq 16)$
$f_{iso} (d), f_{vol} (d), f_{geo} (d)$	MDC43 BRDF model parameters for relative day $d$ on $G_{c}$
$k_{vol} (Ω_{v}, Ω_{s}), k_{geo} (Ω_{v}, Ω_{s})$	MDC43 BRDF model kernels for angles $Ω_{v}$ and $Ω_{s}$ on $G_{c}$
$Λ_{m}$	Set of native sensor wavebands of medium resolution sensor
$Λ_{c}$	Set of sensor wavebands of coarse resolution BRF
$D$	First order spatial difference matrix, defined on $G_{c}$
$R \sim N (r, C_{r})$	A posteriori PDF of BOA spectral BRF defined over $Λ_{m}$ on $G_{m}$
$R_{b} \sim N (r_{b}, C_{b})$	Apriori (background) PDF of BOA spectral BRF on $G_{c}$ over waveband set $Λ_{m}$ , unless $Λ_{c}$ stated explicitly. Can also be specified as function of relative day $d$ as $R_{b} (d)$
$X \sim N (x, C_{x})$	PDF of atmospheric state variables defined on $G_{c}$
$X_{b} \sim N (x_{b}, C_{x_{b}})$	A priori PDF of atmospheric state variables defined on
$Y \sim N (y, C_{y})$	PDF of observations over $Λ_{n}$ defined on $G_{m}$ at $(Ω_{s}, Ω_{v})$
$Y_{c} \sim N (y_{c}, C_{y_{c}})$	PDF of observations over $Λ_{m}$ defined on $G_{c}$ at $(Ω_{s}, Ω_{v})$
$X_{a_{c}} \sim N (x_{a_{c}}, C_{x_{a_{c}}})$	Augmented state vector containing $X$ , BOA BRF estimate $R_{b}$ , and ancillary variables (ozone and altitude) defined on $G_{c}$
$X_{a_{m}} \sim N (x_{a_{m}}, C_{x_{a_{m}}})$	Augmented state vector containing $X$ , TOA BRF $Y$ and ancillary variables (Ozone and altitude), defined on $G_{m}$
$\hat{Y} \sim N (\hat{y}, C_{\hat{y}})$	PDF of modelled observations over $Λ_{m}$ defined on $G_{c}$ at $(Ω_{s}, Ω_{v})$
$J_{obs}$	Observational negative log likelihood on $G_{c}$
$J_{prior}$	A priori negative log likelihood on $G_{c}$
$H (X_{a_{c}})$	Observation operator $H$ that defines TOA spectral reflectance as a function of augmented state vector $X_{ac}$
$γ$	Smoothness parameter used in differential constraint
${\overline{\overline{t}}}^{↓}$	Total (direct and diffuse) downwelling atmospheric transmittance, including modulation by gaseous absorption
${\overline{\overline{t}}}^{↑}$	Total (direct and diffuse) upwelling atmospheric transmittance
${\overline{\overline{r}}}^{↓}$	Spherical albedo of the atmosphere
$r^{↑}$	“Atmospheric intrinsic” or “path” reflectance, i.e. the upwelling reflectance of the molecule and aerosol layer in direction $Ω_{v}$ assuming a totally absorbing lower boundary, due to illumination from direction $Ω_{s}$ , modulated by gaseous absorption.

2 Atmospheric correction scheme in SIAC

2.1 Statement of the problem

We wish to estimate the probability distribution function (PDF) of BOA spectral BRF $R$ with illumination and viewing vectors $Ω_{s}, Ω_{v}$ , respectively, on a grid $G_{m}$ of medium-resolution pixels over a set of wavebands $Λ_{m}$ (see Table for symbol definitions). Under these conditions, this is driven by medium-resolution observations $Y$ from S2/MSI and L8/OLI sensors at 10–60 m resolution, in addition to other constraints. In SIAC, as in most other approaches, we first seek an estimate of atmospheric state $X$ over the target scene. We assume multivariate Gaussian PDFs throughout and ignore non-linear impacts on transformed distributions. Our approach targets the maximum a posteriori (MAP) estimate, given an a priori estimate of $X$ and $X_{b}$ and the observations $Y$ mapped to a grid $G_{c}$ of coarse resolution pixels (nominal 500 m resolution). We then apply $X$ at the original medium spatial resolution to the estimation of $R$ on $G_{m}$ . The heritage of the approach is the various works on Bayesian inference and optimal estimation applied to mapping atmospheric parameters from EO for other sensors , as this gives the framework for combining multiple sources of information and estimating per-pixel uncertainty. The MAP estimate of $X$ over $G_{c}$ is found by maximising the likelihood $P (X | Y)$ :

1 $P (X | Y) \propto P (Y | X) P (X) = exp⁡ [- J],$ with $J = J_{obs} + J_{prior}$ . The mean MAP estimate of $X$ is achieved by minimising the negative logarithm of $P (X | Y)$ in Eq. (), i.e. the “cost function” $J$ with respect to $X$ . $X$ in the current version of SIAC contains AOT at 550 nm and total TCWV in g cm $^{- 2}$ .

Figure 1

Schematic diagram of the SIAC processing chain.

[Figure omitted. See PDF]

2.2 Overview

Our approach uses a priori constraints in the form of a coarse resolution (500 m) spectral BRDF dataset from MODIS to provide sample land surface reflectance estimates as well as a very coarse resolution (40 km) estimate of atmospheric composition from the CAMS near-real-time (https://apps.ecmwf.int/datasets/data/macc-nrealtime/levtype=sfc/, last access: 21 October 2022) global assimilation and forecasting system , with an associated spatial correlation constraint for the atmospheric parameters. These are combined with observational data to solve an inverse problem to estimate atmospheric state at coarse resolution for the time and locations of the observations. We then use this to map from TOA to BOA reflectance (with associated uncertainty) at the native TOA data (S2 or L8) resolution. The method has the following steps, also summarised as a flowchart in Fig. . For the target observational dataset (S2 and L8, here) with given imaging location, geometry, and spectral bands, SIAC comprises two major steps:

1.
Atmospheric parameters estimation
- a.
  Simulation of TOA reflectance $Y_{c}$ at 500 m resolution from observations $Y$ , scaled with a calibrated MODIS effective point spread function (ePSF) model (Sect. );
- b.
  Simulation of BOA reflectance $R_{b}$ at 500 m from MODIS MCD43A1 product, mapped to target sensor spectral bands for sample pixels (Sect. );
- c.
  Development of atmospheric composition prior estimates of AOT and TCWV in $X_{b}$ from CAMS data, with a spatial correlation model on $X_{b}$ (Sect. );
- d.
  MAP estimate of the atmospheric parameters $X$ given $Y_{c}$ , $R_{b}$ , and $X_{b}$ (Sect. ).
2.
Atmospheric correction
- a.
  Application of $X$ to correct observed TOA reflectance $Y$ to a posteriori estimate of BOA BRF $R$ (Sect. ).

In this paper, we present the theoretical underpinnings of the method and major results, relegating details of the implementation and additional results to the appendices.

2.3 Observational constraint

We can express the observational negative log likelihood as

2 $J_{obs} = \frac{1}{2} (\hat{y} - y_{c})^{⊤} C_{\hat{y}}^{- 1} (\hat{y} - y_{c}) .$ Here, $^{⊤}$ is the matrix transpose operator and $^{- 1}$ the matrix inverse operator. Calculation of $J_{obs}$ as a function of variables in $X$ requires confronting a set of observations $y_{c}$ with modelled estimates $\hat{y}$ relative to the uncertainty in these, expressed as $C_{\hat{y}}$ here. We derive these terms below.

2.3.1 TOA observations

The main data controlling the estimation of $X$ (and so $R$ ) are medium-resolution observations $Y$ of TOA spectral reflectance from S2 or L8 on a level 1C grid $G_{m}$ used to form the observational constraint above over wavebands $Λ_{m}$ (Table ). A spectral mapping between MODIS and S2 and L8 is applied (see Appendix ) to correct the difference between their relative spectral response functions (hence the “sensor invariant” nature of SIAC). The output of the spectral mapping provides an estimate of reflectance from 400 to 2 400 nm every 1 nm. This provides a surface reflectance estimation for S2 B09, a band strongly sensitive to TCWV, even though MCD43 does not include this spectral region. Uncertainty from the spectral mapping is explicitly treated in the SIAC framework.

Table 3

MODIS, S2, and L8 bands used in SIAC for the atmospheric parameters retrieval.

MODIS		S2		L8
Band no.	Wavelength (nm)	Band no.	Wavelength (nm)	Band no.	Wavelength (nm)
3	459–479	2	457–523	2	452–512
4	545–565	3	542–578	3	533–590
1	620–670	4	650–680	4	636–673
2	841–876	8A	855–875	5	851–879
		9	931–958
6	1628–1652	11	1565–1655	6	1566–1651
7	2105–2155	12	2100–2280	7	2107–2294

Figure 2

From the top to bottom are the MODIS, L8, and S2 relative spectral response functions for each band, and the background is the atmospheric transmittance processed by 6S with US62 atmosphere profile and continental aerosol model with an AOT value of 0.2 at 550 nm.

[Figure omitted. See PDF]

We need TOA observational constraints $Y_{c}$ to drive Eq. (). The atmospheric state $X$ is defined at coarse resolution over grid $G_{c}$ , so the observational likelihood term must also be defined at the same scale. This involves mapping valid observations from $Y$ on grid $G_{m}$ to coarse resolution equivalents $Y_{c}$ on grid $G_{c}$ . This is achieved using the effective point spread function (ePSF) of the MODIS product following the approach of , as described in Appendix . The output through the ePSF modelling provides us with the TOA observations on grid $G_{c}$ , i.e. MODIS grid. We ignore uncertainty associated with this aggregated reflectance in the estimation of $X$ via Eq. (), assuming it is small compared to the atmospheric model uncertainty (below).

2.3.2 Modelling TOA reflectance

We need an estimate of TOA reflectance $\hat{Y}$ given atmospheric state $X$ to calculate $J_{obs}$ in Eq. (), which is provided by a radiative transfer (RT) model. In this paper, we follow other current approaches to this for medium resolution data by assuming the surface is Lambertian and that each pixel can be treated as independent. Under these assumptions, $\hat{y}$ is expressed by the “simple-form” relationship described for the 6S RT model by :

3 $\hat{y} = H (X_{a_{c}}) = r^{↑} + {\overline{\overline{t}}}^{↓} {\overline{\overline{t}}}^{↑} \frac{r_{b}}{1 - {\overline{\overline{r}}}^{↓} r_{b}} = \frac{r_{b} (p_{b} p_{c} - 1) - p_{b}}{r_{b} p_{a} p_{c} - p_{a}},$ where $p_{a} = 1 / {\overline{\overline{t}}}^{↓} {\overline{\overline{t}}}^{↑}, p_{b} = r^{↑} / {\overline{\overline{t}}}^{↓} {\overline{\overline{t}}}^{↑}$ and $p_{c} = {\overline{\overline{r}}}^{↓}$ , and $X_{a_{c}}$ is an augmented state vector, defined in Table , on grid $G_{c}$ . The terms $p_{i}, i \in a, b, c$ are lumped parameters for each waveband derived from 6 Sv . Clearly, $p_{a} \geq 1$ and depends on pathlengths in the atmosphere. Outside of the strong absorption bands (L8 band 6, S2 bands B09, B11), it has a general pattern of decreasing with increasing wavelength, as illustrated in Fig. . The path reflectance normalised by transmittance, $p_{b}$ , and spherical albedo $p_{c}$ show a similar spectral trend and become small outside of visible wavelengths. $p_{c}$ impacts multiple scattering between the land surface and aerosol layer in the atmosphere and is manifested as a slight curve in the relationship between $r_{b}$ and $y$ . Under the Lambertian assumption currently implemented in SIAC, these terms fully define the mapping from BOA BRF $r_{b}$ to TOA BRF $\hat{y}$ as well as the inverse (estimating $r$ from $y$ ). Within SIAC, they are calculated over a wide range of conditions using 6SV2.1. Running the model atmospheric model many times is computationally costly and is often approximated by using, e.g. look-up tables. Here, we provide fast surrogate approximations to the full atmospheric model – these are called emulators . These approximations are based on fully connected artificial neural networks (ANNs) and provide an estimate of the $p_{i}$ terms as a function of the model inputs $X_{a_{c}}$ . Additionally, the Jacobian of the atmospheric model (needed for efficient gradient descent minimisation and for uncertainty propagation) is also approximated by the emulator, making use of backpropagation techniques . In the current version of SIAC, we assume that atmospheric profiles used in 6S are from the US62 and that the aerosol type is “continental” . This choice of aerosol type may cause errors when conditions strongly depart from it, such as situations dominated by urban, maritime, or biomass burning conditions. See and for analysis of the impacts of aerosol types.

Direct calculation of TOA reflectance $\hat{y}$ needs an estimate of $r_{b}$ for comparisons with observations $y$ in Eq. (). Many algorithms take a spectral approach to the problem, assuming that the ratio in TOA reflectance between visible and SWIR bands over dark dense vegetation (DDV) targets is constant . Most operational algorithms for S2 and L8 are based on this, including LEDAPS for Landsat 4–7 , Sen2Cor for S2 , MAJA for S2 , etc. However, this constraint can be of limited value if suitable DDV targets cannot be found well dispersed in the scene. Other relevant approaches applied to coarse resolution data find alternative methods to estimate surface reflectance: the Deep Blue method uses a coarse resolution seasonal global reflectance database for blue and red wavelengths over bright surfaces to extend the range of conditions that can be used; MAIAC develops an expectation from a time series of observations; and use a set of spectral basis functions that need to be solved for each observational constraint sample. A variation of that is the use of a wider set of spectral basis functions used in processing hyperspectral data by .

In SIAC, we avoid the sampling limitations of DDV and take advantage of these other ideas of providing a dynamic and globally applicable expectation of surface reflectance. We use the MODIS MCD43A1 BRDF/albedo (collection 6) product to achieve this and derive an a priori model of surface reflectance for all target wavebands for the viewing and illumination angles $Ω_{v}$ , $Ω_{s}$ , respectively, of $Y_{c}$ . For relative day index $d$ , this gives $r_{b} (d)$ at MODIS wavebands $Λ_{c}$ , via the Ross-Thick Li-Sparse Reciprocal (RTLSR) linear kernel models and the values of the model parameters for relative day $d$ : 4 $\begin{aligned} r_{b} (d, Λ_{c}) & = f_{iso} (d) + f_{vol} (d) k_{vol} (Ω_{v}, Ω_{s}) \\ + f_{geo} (d) k_{geo} (Ω_{v}, Ω_{s}) . \end{aligned}$ Samples of this around the day of the observation $d$ are used to provide a gap-filled, uncertainty-quantified estimate of $r_{b} (Λ_{c})$ , detailed in Appendix . This is mapped to the target (S2/MSI or L8/OLI) waveband set $Λ_{m}$ as $r_{b} (Λ_{m})$ , as given in Appendix . The framework can tolerate incomplete coverage of $\hat{Y}$ , so we filter for plausible constraints from the MODIS data, as described in Appendix , to avoid gross errors from inappropriate values of interpolated MODIS surface reflectance.

2.4 A priori constraint on atmospheric state

We can express the negative log of the prior pdf (up to a proportional constant) as

5 $J_{prior} = \frac{1}{2} (x - x_{b})^{T} C_{x_{b}}^{- 1} (x - x_{b}) .$ This gives a constraint based on a background (a priori) estimate of atmospheric state, $X_{b}$ . In SIAC, the prior mean comes from the European Centre for Medium-Range Weather Forecasts (ECMWF) CAMS near -real-time (https://apps.ecmwf.int/datasets/data/macc-nrealtime/levtype=sfc/, last access: 21 October 2022) services for estimates of atmospheric composition parameters AOT at 550 nm, total column water vapour, and total column of ozone in $X_{b}$ and $X_{a_{c}}$ . These are at a coarse spatial resolution on a 40 km grid, but we need $X_{b}$ on a 500 m grid to match with $r_{b}$ , so the data are interpolated to 500 m resolution.

The role of $C_{x_{b}}$ is to encode the prior expectation of variance and spatial correlation of the AOT and TCWV fields. AOT and TCWV variances are reported in the CAMS global validation report (see Sect. for a detailed derivation).

We expect the AOT and TCWV fields to have long correlation lengths , which result in non-zero off-diagonal elements in $C_{x_{b}}$ . This spatial correlation structure is defined using a Markov process covariance after . This has two free parameters: the variance $σ_{x_{b}}^{2}$ and relative length scale. Since the covariance appears in the constraint Eq. () in inverse form, we use a fast approximation; this is derived from and implemented as a first-order spatial difference constraint defined in matrix $D$ . 6 $C_{x_{b}}^{- 1} = \frac{1}{σ_{x_{b}}^{2} k^{2}} (I + γ^{2} D^{T} D)$ Here, $I$ is the identity matrix, $k$ a normalising scale factor given in Eq. (), and $γ$ an implicit function of the relative length scale that controls the degree of spatial smoothness. Numerical values used for uncertainty in SIAC are given in Appendix .

The TOA reflectance observation, modelled TOA reflectance, and prior information on the atmospheric parameters are processed to $G_{c}$ through the procedures described above. Thus, the D matrix is also defined at $G_{c}$ , i.e. 500 m to provide spatial constraint for the atmospheric parameters. For a variable length of $n$ , the matrix $D$ is given as 7 ${[\begin{array}{cccccc} - 1 & 1 & 0 & 0 & \dots & 0 \\ 0 & - 1 & 1 & 0 & \dots & 0 \\ 0 & 0 & - 1 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & - 1 & 1 \end{array}]}_{(n - 1) \times n} .$ Having this prior inverse covariance matrix allows a flow of information from regions in the scene that are well constrained by observations to areas that are poorly constrained or missing.

2.5

MAP estimate of $X$

We obtain the MAP estimate of $x$ by minimising $J$ in Eq. () with respect to $X$ . This is done simultaneously for all samples in the grid $G_{c}$ of $X$ using the efficient L-BFGS-B gradient descent algorithm . The approach and details follow , using the derivatives of $J$ with respect to $X$ . The cost function and its partial derivatives exploit the ability of the emulators to provide accurate approximations to the atmospheric RT model and its partial derivatives . Multi-grid methods (following, e.g. ) are used to iteratively provide spatially refined solutions. This greatly improves convergence rates in the optimisation over the large dimensional state vector of $X$ . The uncertainty in $X$ , $C_{x}$ is calculated as in Appendix .

2.6 Atmospheric correction

We assume a Lambertian surface in the atmospheric correction process. The relative errors caused by this Lambertian assumption on the surface reflectance is 3 %–12 % in the visible bands and 0.7 %–5.0 % in the near-infrared bands. Its effect on the NDVI analysis is around 1 % and less than 1 % for albedo , which is within the 5 % accuracy requirement for albedo indicated by the Global Climate Observing System (GCOS) . This Lambertian assumption is also widely used to produce the surface reflectance for MODIS , Landsat , and S2 , where the Landsat and S2 surface reflectance products have been accepted as the CEOS-assessed ARD products .

The mapping from $Y$ to $R$ given $X$ at medium (10–60 m) spatial resolution on the grid $G_{m}$ is achieved by rearranging the terms in Eq. () to give $r$ :

8 $r = \frac{p_{a} y - p_{b}}{1 + p_{c} (p_{a} y - p_{b})} .$ We calculate the $p_{a, b, c}$ with the mean atmospheric parameters $x$ and the auxiliary data (Ozone and elevation) at MODIS spatial grid $G_{c}$ . A linear interpolation is then used to re-sample the $p_{a, b, c}$ to the target sensor grid $G_{m}$ , which is then used to derive the mean surface reflectance $r$ with Eq. (). The simple linear interpolation method used to resample the $p_{a, b, c}$ to sub-MODIS scale is justified, as atmospheric parameters are known to exhibit much larger correlation lengths (100s of km) . The TOA uncertainty is taken to be 5 % for both S2 and L8 and is independent for each waveband. This is the threshold uncertainty value for S2 TOA reflectance. The calculation of per pixel uncertainty in $r$ uses partial derivatives of $r$ with respect to atmospheric parameters $x$ and TOA reflectance $y$ , as shown in Appendix . The per-pixel reflectance uncertainty derived in this way and propagated from uncertainty in the atmospheric parameters and the measurements is an important feature of SIAC.

2.7 Summary of SIAC approach

In SIAC, the atmospheric composition at 500 m is inferred by combining three sources of constraints:

an a priori constraint on land surface reflectance (at 500 m) derived from the MODIS MCD43 product
an a priori constraint on atmospheric composition (AOT and TCWV) derived from CAMS near-real-time predictions
an expectation of spatial smoothness (correlation) in atmospheric composition parameters at the 500 m scale.

The spatial and spectral mismatch between the original S2 or L8 products and MODIS are dealt with by modelling the MODIS ePSF (Appendix ) and using spectral mapping based on a hyperspectral data library (Appendix ), respectively. These constraints form the observational cost function

J_{obs}

(Eq. in Sect. ) and prior cost function

J_{prior}

(Eq. in Sect. ). By minimising the combined cost value (

J_{prior} + J_{obs}

), the uncertainty-quantified estimates of AOT and TCWV at the 500 m resolution are obtained. These estimates are then interpolated to the S2 or L8 spatial resolutions to parameterise Eq. () for the atmospheric correction.

Table 4

Datasets used in SIAC.

Dataset	Usage	Reference	Notes
S2	TOA reflectance
L8	TOA reflectance
ASTER Global DEM	Per-pixel elevation		Horizontal resolution of 75 m covering 83 $^{\circ}$ north (N) and 83 $^{\circ}$ south (S) latitudes
ESA global water mask	Water mask
MCD43A1	Surface reflectance expectation
CAMS	Prior for AOT andTCWV
Spectral libraries	Spectral mapping from MODIS to target sensor	, , ,	USGS V7, ASTER, KLUM, ICRAF-ISRIC
AERONET	Validation of retrieved AOT and TCWV	,	Data from 2017–2019
RadCalNet	Validation of surfacereflectance		Data from 2017–2019

Figure 3

Globally distributed AERONET sites (dot markers) used in this study for the validation of retrieved atmospheric parameters.

[Figure omitted. See PDF]

3 Materials and method

3.1 Study region and datasets

We validate using SIAC-derived atmospheric composition to estimate surface reflectance over globally representative sites for the years 2017–2019. We use S2 and L8 granules over more than 400 AERONET sites, seen in Fig. , as well as granules encompassing three RadCalNet sites (Railroad Valley Playa, La Crau, and Gobabeb sites). This gives more than 3 000 S2 tiles and more than 2 500 L8 tiles in the evaluation. The datasets used in this study, with comments on their use, are listed in Table .

3.2 AERONET

The AERONET (AErosol RObotic NETwork) (https://aeronet.gsfc.nasa.gov/, last access: 21 October 2022; ) programme is a federation of ground-based remote sensing aerosol networks and provides globally distributed estimates of AOT, inversion products, and precipitable water. It has long been used as ground truth aerosol measurements and is used for the validation of various satellite inversions aerosol products. Atmospheric measurements from AERONET instruments were interpolated in time to get estimates corresponding to each satellite overpass. AOT at 550 nm was estimated using AERONET spectral log-transformed data with a second order polynomial between 400 and 860 nm following . The measurement uncertainty of AOT from AERONET is taken to be 0.01 , and that for TCWV is 0.15 % .

3.3 RadCalNet data

The Working Group on Calibration and Validation (WGCV) of the Committee on Earth Observation Satellites (CEOS) has been providing ground surface reflectance data through the Radiometric Calibration Network portal RadCalNet (http://www.radcalnet.org, last access: 21 October 2022; ) since 2018, with measurements taken as earlier as 2013 in the US Railroad Playa Valley site. RadCalNet provides nadir-view, top-of-atmosphere reflectance at 30 min intervals from 09:00 a.m. to 03:00 p.m. local standard time at 10 nm intervals from 400 to 2 500 nm, which is calculated from ground nadir-view reflectance measurements and atmospheric measurements such as surface pressure, columnar water vapour, columnar ozone, aerosol optical depth, and the Angstrom coefficient. TOA reflectance is simulated by propagating the measured surface reflectance through the atmosphere using the MODTRAN radiative transfer model, parameterised by measured local atmospheric composition measurements.

Here, we compare the SIAC-corrected data with measurements from three RadCalNet sites: the ESA/CNES site in Gobabeb (Namibia), the CNES site in La Crau (France), and the University of Arizona's site at Railroad Playa Valley (Nevada, United States), as these three sites measure over the entire solar reflective spectrum. Railroad Playa Valley is a high-desert playa surrounded by mountains to the east and west; La Crau has a thin, pebbly soil with sparse vegetation cover; and Gobabeb is over gravel plains. The area of interest (AOI) of the radiometric measurements for the sites is taken to be $30 m \times 30 m$ for Gobabeb and La Crau but $1 km \times 1 km$ the Railroad Valley Playa. The appropriate (S2 or L8) sensor spectral response functions are applied to the RadCalNet hyperspectral measurements that are closest in time to the S2 and L8 acquisitions to derive RadCalNet estimates of BOA reflectance in S2 and L8 wavebands. Gross mismatches due to cloud or other artefacts (such as saturation of pixel value, cloud shadow, modelling error from the RadCalNet surface reflectance to TOA reflectance, etc.) are filtered by comparing RadCalNet TOA reflectance (provided by RadCalNet) estimates with S2 and L8 TOA reflectances; 5 % is used as the target uncertainty of the S2 and L8 TOA reflectance, and the RadCalNet TOA uncertainty is around 2 %–5 % for non-absorption bands , which lead us to choose 10 % as the threshold to filter out bad samples. If the S2 and L8 TOA data fall outside a tolerance of 10 % of the RadCalNet TOA reflectances, we remove the sample from the comparison. This ends up with 273 S2 scenes and 72 L8 scenes over the RadCalNet sites, with 100 S2 scenes and 36 L8 scenes over Gobabeb, 93 S2 scenes and 19 L8 scenes over La Crau, and 80 S2 scenes and 17 L8 scenes over Railroad Valley Playa.

3.4 Sentinel 2 and Landsat 8

Sentinel 2A (S2A) and Sentinel 2B(S2B) were launched on 23 June 2015 and 7 March 2017, respectively. A single satellite revisits the Equator every 10 d, while a constellation of two satellites achieves an equatorial revisit time of 5 d, decreasing to 2–3 d at mid-latitudes. Each S2 has a 10, 20, and 60 m spatial resolution Multi-Spectral Instrument (MSI), with 13 spectral bands ranging from 443 to 2 190 nm. Identical to S2A and S2B, Sentinel 2C (S2C) is expected to be launched at the beginning of 2024, in which case S2A will be retired .

The Landsat project has provided the longest temporal record of moderate resolution, multi-spectral data over the Earth's surface. Landsat 8 was launched on 11 February 2013, having a global revisit time of 16 d, with 8 d offset to Landsat 7 for 8 d repeated coverage. Two push-broom sensors – the Operational Land Imager (OLI) and the Thermal Infrared Sensor (TIRS) – are mounted on the platform to provide multi-spectral and thermal observations of the Earth's surface at 30 and 100 m resolution, respectively. OLI has nine spectral bands, among which band 8 is panchromatic and has a spatial resolution of 15 m. At the time of writing, Landsat 9 had recently been launched (27 September 2021), but operational data are just coming online .

Both products provide projected and calibrated TOA reflectance datasets. Sentinel 2 products were obtained from the Copernicus Open Access Hub (https://scihub.copernicus.eu/, last access: 21 October 2022) and the L8 products from the USGS EarthExplorer (https://earthexplorer.usgs.gov, last access: 21 October 2022). The spectral characteristics of S2/MSI and L8/OLI, along with the MODIS land wavebands used in SIAC, are shown in Fig. and Table .

We process all near-simultaneous (maximum 1 h apart) scenes and tiles from S2 and L8 over the years 2017 to 2019 over the AERONET sites illustrated in Fig. . This gives 2635 S2 and 1922 L8 scenes and 3472 point samples.

3.5 Validation approach and metrics

We want to evaluate how well SIAC estimates mean BOA BRF and associated uncertainty over S2 and L8 wavebands. We can validate mean reflectance against measurements for some conditions using RadCalNet data. However, we can also gain confidence in the results by validating interim products (atmospheric parameters), testing uncertainty via the discrepancy principle , and examining patterns in uncertainty behaviour. Since we estimate surface reflectance from both S2 and L8 sensors and since these have some very similar wavebands, it is also worthwhile to look at the consistency of results between the sensors for samples over the same conditions.

We define residuals between values estimated from SIAC and measurements as follows: $\begin{matrix} 9 & Δ_{x_{atmo}} = x_{atm} - x_{aeronet}, \\ 10 & Δ_{RadCalNet} = r - r_{RadCalNet}, \end{matrix}$ for the residual $Δ_{x_{atmo}}$ for atmospheric parameters calculated from SIAC ( $x_{atm}$ ) and AERONET ( $x_{aeronet}$ ) and $Δ_{RadCalNet}$ for that between SIAC estimated reflectance and RadCalNet measurements $r_{RadCalNet}$ .

We define standardised residuals as follows: $\begin{matrix} 11 & ϵ_{x_{atm}} = \frac{Δ_{x_{atmo}}}{\sqrt{σ_{x_{atm}}^{2} + σ_{x_{aero}}^{2}}}, \\ 12 & ϵ_{r} = \frac{Δ_{RadCalNet}}{\sqrt{σ_{r}^{2} + σ_{r_{RadCalNet}}^{2}}}, \end{matrix}$ where $σ_{x_{atm}}$ and $σ_{x_{aero}}$ are the uncertainty in the SIAC retrievals of atmospheric parameters and aeronet measurements, respectively (see Sect. ). $σ_{r}$ and $σ_{r_{RadCalNet}}$ are the uncertainty in the SIAC retrievals of surface reflectance and that of the RadCalNet measurements, respectively. Assuming Gaussian distributions, we would expect the mean of $ϵ_{x_{atm}}$ or $ϵ_{r}$ to be 0 and the standard deviation to be 1 over a large number of samples. We follow in calculating accuracy ( $A$ ) and uncertainty ( $U$ ) metrics against AERONET and RadCalNet observations through $\begin{matrix} 13 & A = \frac{1}{n} \sum_{i = 1}^{i = n} Δ, \\ 14 & U^{2} = \frac{1}{n} \sum_{i = 1}^{i = n} Δ^{2}, \end{matrix}$ where $n$ is the total number of samples in a comparison, and $Δ$ is $Δ_{x_{atmo}}$ or $Δ_{RadCalNet}$ , as appropriate. The related measure, precision ( $P$ ), is given by $P^{2} = (\frac{n - 1}{n}) U^{2} - A^{2}$ . We recognise $A$ as a measure of bias and $U$ as the root mean squared error (RMSE). We assess SIAC against the threshold requirements in Table . For SIAC results to be within specification, we would expect 68 % to fall at or below the threshold value (assuming a Gaussian distribution) where samples or distributions are concerned. Where we calculate $U$ , we would expect it to lie on or below the threshold value.

4 Results

4.1 Validation of mean atmospheric composition over AERONET

We compare the 3 472 S2 and L8 samples over AERONET sites with independent in situ measurements of AOT and TCWV. Examples of the retrieved scene atmospheric parameters are given in Figs. – in Appendix . Since we use an a priori constraint on atmospheric state $X_{b}$ in estimating $X$ , we also assess $X_{b}$ against the AERONET measurements to see what improvement the medium resolution observations offers in this context. Comparisons of CAMS and SIAC AOT with AERONET measurements are shown in Fig. , with more detail for $A$ , $P$ , and $U$ (along with the number of samples used in each bin) given in Fig. . The threshold uncertainty (Table ) is shown on the plots.

Over all AOT values (Fig. ) for the a priori CAMS data, more than 73 % of samples are already within the threshold specification, which is slightly better than the 68 % expected. This increases to 77 % for S2 processing but is slightly reduced to 72 % for L8. The correlation coefficient is reasonably high for CAMS (0.58) but is dramatically improved by the data assimilation to 0.86 or better for S2 and L8. The regression for all cases is similar, with a slope that is slightly below unity (0.86–0.90) and a small intercept (0.03–0.05). The root mean squared error (RMSE, equivalent to the metric $U$ over all samples) is moderately large at 0.169 for CAMS but is reduced to 0.071–0.076 by the assimilation. The $A$ , $P$ , and $U$ plot shows that bias ( $A$ ) is low and that uncertainty and precision are close to the expected error for low values of AOT for both sensors, with S2 mostly slightly better than L8. The results are more variable and sometimes out of specification for higher values of AOT, but the sample size is small for those cases.

Figure 4

AOT validation against AERONET measurements from CAMS (a), SIAC S2 (b), and SIAC L8 (c) over 3 466 matches, where the vertical lines of each point are the uncertainty of solved AOT values, and the horizontal error bars are AERONET measurement uncertainty of 0.01. On three panels, the inset plot shows the region marked by the red square in more detail, with $0 \leq AOT \leq 0.3$ . The threshold uncertainty is shown as black dashed lines in the figure.

[Figure omitted. See PDF]

Figure 5

The accuracy ( $A$ ) (plus marker), precision ( $P$ ) (square marker), and uncertainty ( $U$ ) (triangle marker) validation of AOT against AERONET measurements. Threshold uncertainty is shown as black lines in the figure.

[Figure omitted. See PDF]

Figure 6

TCWV (g cm $^{- 2}$ ) validation against AERONET measurements from CAMS (a), SIAC S2 (b), and SIAC L8 (c) over 3 466 matches, where the vertical lines of each point are the uncertainty of solved TCWV values, and the horizontal error bars are 15 % of AERONET TCWV values. Threshold uncertainty is shown as dashed lines in the figure.

[Figure omitted. See PDF]

Figure 7

The accuracy ( $A$ ) (plus marker), precision ( $P$ ) (square marker), and uncertainty ( $U$ ) (triangle marker) validation of TCWV against AERONET measurements. Threshold uncertainty is shown as black lines in the figure.

[Figure omitted. See PDF]

Comparisons of CAMS and SIAC TCWV with AERONET measurements are shown in Figs. and . Over all values of TCVW, 86 % of the CAMS data are within specification. This is essentially the same after assimilation of L8 data but increases to 91 % for S2. The regressions for CAMS and L8 TCWV against AERONET have a slope close to unity and a low magnitude of intercept. The slope of the relationship is slightly poorer at 0.91 for S2. The $A$ , $P$ , and $U$ plot shows that S2 results are within specification for the vast majority of values of TCWV, with poorer $A$ and $U$ only for the highest value and P slightly out of specification for the lowest value. For L8, the results are very similar to those from CAMS alone. In this case, all values are within expected error, other than the precision and uncertainty for low TCWV. In summary, the CAMS and L8 a priori data are very similar (very low impact of the observations), and the results are good for all but the lowest values of TCWV. The assimilation of S2 data (mainly from band B09) improves these lower values but seems to cause poorer result for the highest value of TCWV, though this may be because of the small sample number.

4.2 Consistency check for S2 and L8

The S2 and L8 scenes over AERONET sites described above were atmospherically corrected to surface reflectance using SIAC. Since the overpass times between sensors are within 1 h, we expect the surface reflectance in overlapping spectral regions from both sensors to be highly correlated and can use this to test consistency between sensors. Differences in spatial coverage, acquisition geometry, spectral sampling, and other sensor characteristics may impact this, but we will assume them to be small.

Pixels within a $2400 \times 2400$ m $^{2}$ area around the AERONET sites in the S2 and L8 scenes presented above were considered for comparison. To account for geolocation errors and differences in spatial resolution, the L8 data were reprojected to the S2 reference system, and all data were spatially averaged to 60 m resolution. A filtering for cloud, shadow, and any large changes between scene acquisitions was then applied. Rather than relying on the cloud or shadow masks for this, we use compatibility in TOA reflectance to select candidate pixels. According to studies and operational validation reports , the agreement of nearby spectral bands (Table ) should be better than 5 %. Since we allow a larger temporal gap between S2 and L8 than some of these studies (1 h), we use a filter on a threshold of $10 % + 0.01$ between S2 and L8 TOA reflectance. This leaves around $3 \times 10^{6}$ pixels for comparison.

Figure 8

2D histogram of surface reflectance after the atmospheric correction from 3 466 S2 and L8 near simultaneous observations; each subplots shows the results for the closest S2 and L8 bands. The colour bar is shown using a logarithmic scale. The error bars are 3 $σ$ of the difference between the S2 and L8 corrected reflectance, which is computed at an interval of 0.05 from 0–1.

[Figure omitted. See PDF]

The results are shown in Fig. as a set of two-dimensional histograms. The reflectances are highly correlated (coefficient of determination $r > 0.98$ for all bands and $r > 0.99$ for bands in the NIR and SWIR regions), with a small RMSE ( $RMSE < 0.012$ ). The bias is very small (less than 0.0016 for all bands), and the slope is between 0.96 (blue band) and 1 (NIR band). The error bars of S2 and L8 bands are slightly larger than the 10 % $+ 0.01$ used to filter the TOA reflectances. This slight increase in the difference between S2 and L8 surface reflectance comes from the increase in uncertainty during the atmospheric correction process, but this is any case small.

4.3 Validation of uncertainty in atmospheric parameters

We need to verify that uncertainty values $σ_{x}$ calculated by SIAC are useful in characterising actual uncertainty. We approach this using the “discrepancy analysis” method suggested by to check if the error distributions in the AERONET comparisons described above follow an expected distribution. We calculate standardised residuals $ϵ_{x_{atm}}$ following Eq. () for AOT and TCWV for each sample. We know that different configurations, data, and algorithmic effects mean that some retrievals will be more accurate than others, so here, we test our ability to identify this by weighting the departure of SIAC estimates from AERONET measurements. If we have grossly overestimated uncertainty relative to actual discrepancy, then the standard deviation of the standardised residuals will be much less than one and vice-versa if we underestimate. A limitation of the assessment is that it can only be calculated over the AERONET sites where an independent measurement is available. But still, if the results mainly follow the expected statistics, it shows that the magnitude of uncertainties calculated in SIAC are plausible.

Part of the a priori constraint in SIAC is the imposition of a degree of smoothness on the atmospheric parameters through the parameter $γ$ in the inverse covariance function in Eq. (). We use $γ$ values of 5 for S2 and L8 AOT, 5 for S2 TCWV, and 0.1 for L8 TCWV in SIAC. Cross-validation studies suggest that there is a wide range of suitable values for $γ$ (Appendix for additional details on this choice and its implications). The scaling term $k$ in Eq. () should mean that the magnitude of uncertainty is not greatly affected by $γ$ . But since many applications of this type of constraint do not explicitly apply such a normalisation, we test the impact of that here and examine the distribution of $ϵ_{x_{atm}}$ over a range of $γ$ values in Figs. and .

Figure 9

Standardised residual $ϵ_{x_{atm}}$ distributions for S2 AOT (a) and L8 AOT (b) with $γ$ of [0.001, 0.1, 1, 5, 10, 100, 1000]. The red histogram distributions show standardised error distributions (Error distribution) from the data and the blue ones the estimated Gaussian distribution from the distributions (Fitted Gaussian).

[Figure omitted. See PDF]

Figure 10

Standardised residual $ϵ_{x_{atm}}$ distributions for S2 TCWV (a) and L8 TCWV (b) with $γ$ of [0.001, 0.1, 1, 5, 10, 100, 1000]. The red histogram distributions show standardised error distributions (Error distribution) from the data and the blue ones the estimated Gaussian distribution from the distributions (Fitted Gaussian).

[Figure omitted. See PDF]

The results show that, for $γ < 10$ , the standardised residual distributions of AOT for S2 and L8 remain broadly similar, i.e. the normalisation using $k$ seems to be effective. For S2, there is a small positive bias in AOT that decreases with increasing $γ$ , but the standard deviation $σ$ is around 1.0. For L8 AOT, there is a small positive bias in all cases (larger than for S2), but $σ$ is close to 1. Above $γ = 10$ for both, $σ$ increases and becomes unrealistic for very high $γ$ , so the normalisation is ineffective for extremely high values, possibly relating to boundary condition effects on Eq. () as an approximation to the intended inverse Markov process covariance function.

The distributions of $ϵ_{x_{atm}}$ for S2 TCWV retrieval show a slight overestimation in the TCWV uncertainty ( $σ$ smaller than 1) but almost no bias for $γ < 10$ . The L8 retrieval for TCWV is mainly controlled by the prior information, and the normalised error is close to 1, but with a broader distribution compared to S2 TCWV, as no water absorption band is available from L8 measurements. Again, the behaviour of these distributions for TCWV broadly confirms our choice of $γ$ for S2, but it seems that a higher value for L8 might be tolerated, and a compromise value of 5 might reasonably be used for $γ$ for all terms.

4.4 Validation of mean surface reflectance

We validate mean SIAC reflectance by comparison with ground measurements over RadCalNet sites. We compare mean S2 and L8 BOA reflectances from SIAC averaged over defined RadCalNet AOI boundaries with the RadCalNet estimates of BOA reflectance in Figs. , , and . Since TOA reflectance estimates are provided for the RadCalNet sites using observed surface reflectance and atmospheric parameters calculated with the 6S model, we also compare measured TOA reflectance for S2 and L8 with these. This provides context to interpret both the spectral signatures and any biases or other issues in the data. If there are mismatches between the TOA datasets (e.g. from sensor calibration), since one is essentially a direct (S2 or L8) measurement and the other developed only with measurements from the RadCalNet sites, we would not expect to do better than that using SIAC, where we have to estimate the atmospheric parameters.

Figure 11

Comparison between the S2 (a) and L8 (b) TOA reflectance and RadCalNet simulated nadir-view TOA reflectance (top row), and the surface reflectance after correction against RadCalNet nadir-view surface reflectance (bottom row) at Gobabeb. The blue lines on the left are different spectra measurements from RadCalNet, and the red dot with blue error bars (1 standard deviation) are the TOA or surface and TOA reflectance with uncertainty. The EE is defined as $\pm (0.05 TOA(BOA) + 0.005)$ , denoted as black dashed lines in the scatter plots. The regression line is draw as a red line and the 1-to-1 reference line is drawn as a thick black dashed line in the middle.

[Figure omitted. See PDF]

Figure 12

Same as Fig. but for La Crau site.

[Figure omitted. See PDF]

Figure 13

Same as Fig. but for Railroad Valley Playa site.

[Figure omitted. See PDF]

Figure 14

The ratio of SIAC-corrected S2 (a)and L8 (b) surface reflectance to RadCalNet surface reflectance over three sites. The black dashed line in the middle is the reference line of ratio 1, indicating the same values of SIAC-corrected S2 (a) and L8 (b) surface reflectance and RadCalNet surface reflectance. Values above the reference line indicate positive bias (SIAC overestimates compared to RadCalNet) and below it indicate negative bias; 5 % and 10 % biases are shown as dashed lines. Boxplot colours are the same as colours in Fig.

[Figure omitted. See PDF]

The agreement between the SIAC-retrieved surface reflectance and the reference measurements is very strong for all sites, with RMSEs values for the BOA products of around 3 %–5 % of RadCalNet ground measurement reflectance over all wavebands. The correlation coefficient $r$ is very high ( $> 0.94$ ) for all cases and is seen to increase slightly between TOA and BOA reflectance. The proportion of samples within the specification for TOA reflectance and SIAC-corrected surface reflectance are very similar. For BOA, there is 98 % for S2 and 95 %, respectively, within the specification for L8 over Gobabeb site, 88 % for S2 and 87 % for L8 over La Crau site, and 77 % for S2 and 86 % for L8 over Railroad Playa Valley site, so the results overall are well within the specification. Most samples outside of this can be attributed to the TOA reflectance being outside the RadCalNet TOA expectation limits. The patterns in the scatterplot of the small apparent biases in BOA reflectance are mirrored in the TOA analysis, suggesting that these arise from factors extraneous to the atmospheric correction. Interestingly, the results obtained using only the a priori CAMS data (included in Appendix ) show almost the same performance compared to RadCalNet as mean SIAC reflectance retrievals. The fact that sometimes the BOA statistics are better than the TOA is probably due to the better dynamic range of the surface reflectance signal compared to the TOA one.

The best performance is found over Gobabeb, but the results are only slightly poorer for La Crau, which has more variation in the pattern of spectral reflectance. The broader spread of results for the BOA analysis for Railroad Playa Valley is mimicked in the TOA data. A per-band analysis of the ratio of SIAC BOA reflectance to measured ground data over each RadCalNet site is given in Fig. . For Gobabeb, most of the time, the SIAC surface reflectance is within 5 % of RadCalNet ground measurements for all S2 and L8 bands, excluding the water absorption and Deep Blue bands (not shown). A similar situation is observed for the other two sites – although the distributions overrun the 0.95 mark at times, they are always within 10 %. The interquartile range (IQR) is within the 5 % limit in all cases other than L8 B2, where it goes slightly above.

4.5 Validation of uncertainty in surface reflectance

Although the assessment of $σ_{x}$ presented above is useful in understanding SIAC performance, we are ultimately more interested in knowing whether BOA reflectance uncertainty values $σ_{r}$ calculated by SIAC are useful in characterising surface reflectance uncertainty. We can do this with the same approach as above, calculating the standardised residual $ϵ_{r}$ from Eq. () between SIAC reflectance averaged over the RadCalNet AOIs and the RadCalNet measured reflectance convolved with the appropriate S2 or L8 bands. The expectations for this from the discrepancy principle are the same as for $ϵ_{x_{atm}}$ , as described above. Figure shows the distributions of $ϵ_{r}$ for the main surface reflectance wavebands.

Figure 15

SIAC surface reflectance uncertainty validation. The red histogram distributions show standardised error distributions (Error distribution) from the data and the blue ones the estimated Gaussian distribution from the distributions (Fitted Gaussian).

[Figure omitted. See PDF]

The histograms are quite noisy, particularly for L8, suggesting that the results might be impacted by a low sample number. For S2, the mean is very close to 0 for around half of the bands but can show a positive bias of up to 0.54 (B11). The standard deviation $σ$ is less than 1 for all cases, being as low as 0.73 for B11, indicating that we are likely overestimating the surface reflectance uncertainty by a factor of between 1.04 (B12) and 1.37 (B11) for S2. The L8 analysis shows broadly similar results, with a positive bias in the mean of similar magnitude. However, the values of $σ$ are generally lower, ranging from 0.55 (B06) to 0.75 (B07), suggesting an overestimation of uncertainty by a factor of between 1.33 and 1.8. In summary, our tests show that we have a conservative estimate of uncertainty. The overestimation of the variance in surface reflectance is more marked for L8 than S2.

4.6 Surface reflectance uncertainty behaviour

The work of showed that there are patterns to be expected in plots of uncertainty as a function of surface reflectance. The TOA uncertainty is assumed to be $σ_{y} = 0.05 y$ . Combining the ideas from and an examination of the equations from this paper, it is possible to gain further insights into the factors controlling the uncertainty in surface reflectance and to confirm that SIAC estimates of uncertainty follow the patterns as expected.

For low AOT and longer wavelengths, the sensitivity to uncertainty in AOT is low, and the term $Δ_{y} σ_{y}$ will mostly dominate. For these conditions, $p_{b}$ and $p_{c}$ will be very small (see Fig. ), so $Δ_{y} \approx p_{a}$ from Eq. (), so $y \approx r / p_{a}$ , and $Δ_{y} σ_{y} \approx 0.05 r$ . For shorter wavelengths, sensitivity to uncertainty in AOT increases. So, even for low AOT, the uncertainty is expected to be more than $0.05 r$ . For higher AOT and TCWV, there is significantly higher sensitivity to uncertainty in atmospheric parameters. In this case, uncertainty should have behaviour similar to Fig. 1 in , with a critical value of $r$ , for which $Δ_{H_{i}^{- 1}}$ in Eq. () is 0. For values of $r$ less than or greater than the critical value, the contribution from uncertainty in atmospheric parameters increases, resulting in a “V”-shaped behaviour if $σ_{r}$ is plotted as a function of $r$ . Figure shows scatterplots of typical behaviour of this. These are plotted for full S2 scenes for an example of a low AOT case (mean 0.15, ranging from 0.02 to 0.35) and high AOT case (mean 1.1, ranging from 0.9 to 1.25). Results are shown for each waveband, with the colour corresponding to those used in Fig. . The turning point reflectance for each band is indicated by a dashed line in that colour.

Figure 16

SIAC S2 uncertainty for different bands for low AOT (a) and high AOT (b). The baseline 5 % is noted as a dashed black line, and the minimum of the uncertainty of each band is noted with coloured dashed lines. Colours are used to indicate the bands shown in Fig. .

[Figure omitted. See PDF]

The turning point feature mainly arises from the AOT component of uncertainty in equation Eq. (). We have seen that uncertainty in AOT is expected to be higher for higher AOT (Fig. ); so for lower AOT, this component will be of lower magnitude, and the uncertainty will be dominated by TOA reflectance uncertainty. For higher AOT, the AOT uncertainty becomes more significant, especially for visible wavebands, and this feature becomes a more dominant part of the uncertainty behaviour, as we would expect.

The black dashed line in the subplots shows the lower boundary of 5 % uncertainty that would be expected for a TOA uncertainty of 5 %. All values appear on or above this line, providing some confidence in the calculations within SIAC. For low AOT, the longer the wavelength, the closer the behaviour directly mimics the TOA relative uncertainty. This arises from the decreasing magnitude of $p_{b}$ and $p_{c}$ with wavelength, as seen in Fig. . As these terms become negligible, the TOA and BOA reflectances become more proportionately related, and so with TOA reflectance uncertainty (low AOT), the proportionate TOA uncertainty more closely maps to proportionate BOA uncertainty.

5 Discussion and conclusions

5.1 Contributions of SIAC

Current approaches to atmospheric correction of S2 and L8 data over land use readily available and well-tested atmospheric RT codes, such as 6S, considered adequate for the task at hand . Since BRDF effects are not well sampled, they use the “simple form” of radiative interaction in Eqs. () and (), allowed by assuming the surface Lambertian. For the most part, they proceed by applying some sort of mask for clouds or other extraneous features, estimating atmospheric state $X$ based in part on observational constraints, and applying $X$ to estimate surface reflectance $r$ via Eq. (). As we have noted, they do not currently estimate per-pixel uncertainty. Differences in performance between approaches seen in exercises such as the ACIX intercomparisons of then come as a result of how effective the masks are and how they estimate $X$ . We do not attempt to explore the first of these issues in this paper. Issues in the latter can come down to the validity of assumptions that are made but can also be very dependent on getting a suitable spatial distribution of observational constraints.

SIAC follows these same steps and broad assumptions, but a major feature of the approach is its use of the reliable external operational data streams available nowadays in support of its estimations. It has other novel features, including accounting for PSF impacts in the scaling from L8 and S2 to MODIS. It is further distinguished by applying a Bayesian framework that is able to weigh up these contributions according to their uncertainty and directly estimate the resultant uncertainty in $X$ , from there providing a per-pixel estimate of uncertainty in $r$ . It is a data assimilation approach.

Rather than the more limited extent available for DDV approaches, SIAC uses a direct expectation of surface reflectance from an external coarse resolution source (MCD43), meaning that the keystone observational constraints can supply information to the solution over a wide range of conditions. This means that the solution is, to some extent, reliant on the accuracy of these MODIS reflectance predictions, so we go to some lengths to filter out samples that may not be reliable predictors. In this first implementation of SIAC, we ignore snow and water pixels as well as some other conditions (Appendix ), which may be over-cautious. The estimate of $X$ in SIAC is based on multiple constraint, so it is not in any case entirely dependent on the presence or quality of the observational constraints. The entire process (this includes state estimation, surface reflectance estimation, cloud masking, and uncertainty quantification) of a single S2 or L8 scene on a standard workstation with a 3.1 GHz i7 CPU and 16 Gb RAM takes between 20–30 min on a single process.

5.2 Accuracy assessment

SIAC aims to be CARD4L compliant and to meet threshold requirements for uncertainty. Validation results show that the method gives accurate (within threshold specifications) retrievals of uncertainty-quantified land surface reflectance, both for S2 and L8, for the most part. Moreover, the surface reflectances for the two sensors are compatible, an important step in using these sensors together for land monitoring applications.

A data assimilation system relies on having well-quantified uncertainties on the constraints used. This is vital in a relative sense for balancing contributions from different sources but also in an absolute sense for quantifying uncertainty. Unfortunately, none of the constraints we use have a per-pixel estimate of uncertainty to drive the analyses, so instead (Appendix ) we provide reasonable estimates, with references to justify the choice of uncertainty parameters and to assess the performance of the uncertainty estimates as part of the validation. In the absence of much observational constraint in a scene or area of a scene, our solution is based mainly on CAMS, so it should still provide a good, though less well spatialised, result. We see this to be borne out in the validation results for AOT and TCWV in Figs. and : a small additional percentage of AERONET samples come within specification in SIAC compared with the CAMS data, and the regressions equations remain very similar. What SIAC achieves is a large increase in the correlation coefficient for AOT and a reduction in RMSE, suggesting that this localisation is being achieved. When these results are analysed as a function of AOT, the bias $A$ is seen to be very low for AOT up to 0.5 (all comparisons that have more than 11 samples), with $U$ and $P$ lying almost exactly on specification. It is difficult to draw conclusions for AOT values beyond this due to the small sample size. There is some evidence that AOT uncertainty is slightly higher for L8 than for S2. For TCWV, the RMSE for S2 improves over that of CAMS, though it remains the same for L8. For S2, for all comparisons with more than 12 samples, $P$ and $U$ are well within specification, but for L8, this is not the case for TCWV values of less than around 1 g cm $^{- 2}$ .

The validation of atmospheric parameter uncertainty in Sect. suggests that the uncertainties calculated in SIAC are plausible for the selected values of $γ$ according to the discrepancy principle . Additionally, we have confirmed that the uncertainty estimation from SIAC is not strongly affected by the choice of $γ$ .

The results of comparison between SIAC BOA reflectance and RadCalNet measurements (Sect. ) suggest that the atmospheric correction is comparable to the atmospheric modelling done with the RadCalNet data driven by observed atmospheric parameters. The number of samples within specification is much higher than expected at the threshold assumed. For most land surface bands and most sites, relative errors are within 5 %.

Most of the time, at least over the conditions represented by RadCalNet, we can expect SIAC to estimate surface reflectance $r$ within the threshold specification. Further, the surface reflectances from S2 and L8 are seen to be consistent: the processing has not created artificial biases between the surface reflectance from the two sensors, a feature seen in several papers on the comparison of S2 and L8 surface reflectance . With the assumed 5 % TOA uncertainty, SIAC overestimates surface reflectance uncertainty, as shown in Sect. . This suggests that we might reasonably relax the assumption of 5 % uncertainty in TOA reflectance for S2 and L8 towards a 3 % value (other than B12) that would be consistent with . The result for L8 is similar to that for S2 but with estimated maximum TOA uncertainty of around 2.5 % from this analysis. The calibration of TOA uncertainty is detailed in Appendix .

Surface reflectances produced by SIAC are of high accuracy and are consistent for S2 and L8, as shown in Sect. and . But we also find that atmospheric correction results derived solely from CAMS atmospheric information over the RadCalNet sites demonstrated similar average behaviour (Figs. , , and in Appendix ). The main impact of the assimilation of observational data is a reduction in uncertainty of 10 % for S2 visible to near-infrared bands and 5 % for S2 SWIR bands with the assumed TOA uncertainty of 3 %–5 %. This is because the TOA uncertainty dominates the total uncertainty budget in the surface reflectance when the AOT is low (mean AOT is around 0.08 over RadCalNet sites). However, we should bear in mind that these sites are not necessarily representative of the challenging environments that might be encountered in global processing. The RadCalNet sites are located in places with quite homogeneous landscapes and mostly low aerosol loading , which is probably the main reason why there seems to be little improvement in mean reflectance after assimilation. This suggests that RadCalNet measurements should be used as a minimum test of any atmospheric correction method rather than a solid evaluation of the accuracy of atmospheric correction, and we ideally need more measurements covering different landscapes with variations of atmospheric states.

5.3 Future developments

In this paper, the atmospheric composition is set by a model (6S in this case) and by a choice of aerosol optical properties (continental aerosol model). The use of emulators of the RT model makes it easy to change the RT model entirely in the code or to use a different configuration of the model used. We can also extend the scheme to retrieve independent aerosol species concentrations by both modifying the RT model (and thus extending the number of parameters that go in the inference) and by using data on species distribution available from CAMS and extending the prior to cover these. A similar approach has been implemented in the MAJA processor , which uses the CAMS aerosol species data to define the aerosol types for the atmospheric correction and has found improved atmospheric correction results over deserts. This approach may well be valuable in areas of high dust aerosol loading or in situations where biomass burning results in an important contribution to aerosol concentrations.

SIAC relies on the surface reflectance constraint from the MODIS MCD43 product, but the current MODIS satellites are approaching the end of their mission . VIIRS is designed to produce a continuity data record to MODIS, so it may be possible to use the VIIRS VNP43 product as an alternative surface constraint, but this has not been tested. VIIRS land products additionally include bands within the Deep Blue spectral region (M1, M2), which are more sensitive to the aerosol variation. This information may be used to constrain the S2 and L8 Deep Blue bands to retrieve AOT over bright surfaces .

Appendix A

Estimation of $R_{b} (Λ_{c})$

We need an estimate of BOA BRF at $Λ_{m}$ , which we derive from an estimate of BOA reflectance $R_{b} (Λ_{c})$ from the MDC43 products. But the observation on the target day may be missing or of low quality. We seek to replace this with a gap-filled estimate based on the product QA flags. Gap filling is achieved with a robust smoother , having a smoothing factor $s$ of 0.5 (low smoothness). The inverse of bi-square weight $W$ for the target date, given by , is used to scale relative interpolation uncertainty. Then $σ_{r_{b}}^{2}$ for each of the six bands in Table is given by: A1 $σ_{r_{b, i}}^{2} = \frac{{0.015}^{2}}{W} \frac{\sum_{i = 1}^{i = 6} Λ_{i}^{- 1.6}}{Λ_{i}^{- 1.6}},$ where the base-level uncertainty of 0.015 is the broadband uncertainty in QA $= 0$ retrievals assessed by ; the spectral weighting follows and is applied to give a conservative estimate of uncertainty in the prediction of $R_{b} (Λ_{c})$ and to weight lower wavelengths more strongly. Here, $σ_{r_{b}, i}^{2}$ is the value of $σ_{r_{b}}^{2}$ for band index $i$ , with centre wavelength $Λ_{i}$ .

Appendix B Uncertainty considerations

A data assimilation system combines evidence from different streams by weighting them by their inverse uncertainties. In SIAC, the statistics of the uncertainties are assumed to be zero-mean Gaussian and thus only characterised by an associated covariance matrix. We review here the sources and values of these uncertainties. The observational and a priori constraints for the estimation of $X$ contain inverse covariance matrices $C_{\hat{y}}^{- 1}$ and $C_{x_{b}}^{- 1}$ that need definition. We also need to define the uncertainty associated with a posteriori BOA BRF $R$ , $C_{r}$ .

Observational uncertainty $C_{\hat{y}}$

The observational uncertainty in Eq. () has three main components that we can call $C_{m}$ , $C_{Λ}$ , and $C_{G}$ . The uncertainty associated with the MODIS BRF predictions arises mainly from (i) uncertainty in the MODIS BRF predictions and the temporal interpolation strategy, and (ii) the spectral transformations. The uncertainty in $r_{b} (Λ_{c})$ is given as $σ_{r_{b}}$ in Sect. . The uncertainty from the spectral transformations $C_{Λ}$ is calculated as the RMSE following Appendix . The observational uncertainty of the L1C product convolved with estimated ePSF, $C_{G}^{- 1}$ , is assumed to be much smaller than $C_{m}^{- 1}$ and is ignored in the estimation of $X$ . Any other uncertainties arising from the inadequacy of the radiative transfer model are also ignored. We assume $C_{\hat{y}}$ to be a diagonal matrix, so we need only define the variance terms $σ_{\hat{y}}^{2}$ .

Following , we apply an additional spectral weighting over relative wavelength $W_{Λ_{m}} = Λ_{m}^{- 1.6} / \overline{Λ_{m}^{- 1.6}}$ , where $\overline{Λ_{m}^{- 1.6}}$ is the mean of central wavelengths of the target sensor in $Λ_{m}$ over all bands raised to the power of $- 1.6$ . This has the effect of giving higher weight to shorter wavelengths to emphasise sensitivity to AOT. $C_{\hat{y}}^{- 1}$ is taken as a diagonal matrix, with elements B1 $σ_{\hat{y}}^{- 2} = W_{Λ} (σ_{r_{b}}^{- 2} + δ_{Λ_{m}}^{- 2})$ for each waveband in $Λ_{m}$ , defined on $G_{c}$ .

A priori state uncertainty $C_{x_{b}}^{- 1}$

We take the CAMS estimates of atmospheric state at 40 km $\times$ 40 km to be the mean values in $x_{b}$ . report that, globally, AOT has a small bias ( $- 0.04$ ) and an RMSE between forecast and observation of 0.17 versus AERONET match ups. report higher values around 0.23 over China, which suggests that we should take a more conservative approach in defining the uncertainty; so in SIAC, the a priori standard deviation for AOT $σ_{x_{b (aot)}}$ is a function of AOT, with a minimum threshold. A similar process of comparing CAMS TCWV with AERONET matchups is used to arrive at a relative uncertainty of $0.3$ for TCWV. $C_{x_{b}}^{- 1}$ in Eq. () is assumed to be a diagonal matrix. We first define standard deviation terms for the variables in $X$ : B2 $\begin{aligned} σ_{x_{b (aot)}} & = max (0.5 x_{b (aot)}, 0.02), \\ σ_{x_{b (tcwv)}} & = 0.3 x_{b (tcwv)}, \end{aligned}$ where $x_{b (aot)}$ and $x_{b (tcwv)}$ are the a priori estimates of atmospheric state in $X_{b}$ on $G_{c}$ . We assume no correlation between uncertainty in AOT and TCWV.

The inverse covariance function given in Eq. () is parameterised by smoothness $γ$ and is applied on the grid $G_{c}$ , with the first difference operator $D$ applied across rows and columns

The effect of applying a smoothness constraint in this way is similar to the combined prior and smoothness constraint used in EO-LDAS , GRASP , and but with an extra normalisation factor $k$ that allows the physical meaning of the inverse correlation and variance terms to be maintained for different degrees of smoothness.

. The normalisation factor

k^{2}

is the mean eigenvalue for

{(I + γ^{2} D^{T} D)}^{- 1}

, which can be given as B3

k^{2} = \sum_{i = 0}^{n} {[1 + γ^{2} (2 - 2 cos⁡ (\frac{i π}{n}))]}^{- 1} / n,

where

n

is the number of rows (columns) of pixels within the S2 and L8 spatial extents at the MODIS spatial grid

G_{c}

, as the term is applied in the row (column) direction.

From the results, for a cross-validation study (reported in Appendix ), we use a $γ$ of 5 for S2 and L8 AOT, a value of 5 for S2 TCWV, and 0.1 for L8 TCWV.

A posteriori state uncertainty $C_{x}$

Under the assumption that the log posterior is Gaussian, the mean of the a posteriori state $X$ is given by a value of $x$ that minimises $J = J_{obs} + J_{prior}$ , and the posterior covariance is approximately given by the inverse of the Hessian at the minimum point : B4 $C_{x} = {(H^{' ⊤} C_{\hat{y}}^{- 1} H^{'} + C_{x_{b}}^{- 1})}^{- 1},$ where $H^{'}$ represents the partial derivatives of $\hat{y}$ with respect to $x$ . The diagonal of $C_{x}$ is extracted as the posterior uncertainty for $X$ .

B4 Uncertainty in surface BRF

Define an augmented state vector $X_{a}$ on grid $G_{m}$ containing the atmospheric state variables in $X$ (AOT and TCWV) and observations $Y$ . Let $Δ_{i}$ be the partial derivative of the inverse observation operator $H^{- 1} (x_{a})$ given in Eq. () with respect to variables $i$ in $X_{a}$ , $i \in (AOT, TCWV, y)$ :

B5 $Δ_{i} = \frac{\partial H^{- 1} (X_{a})}{\partial i} .$ Define $Δ_{p_{j}}$ as the partial derivative of the Lambertian coupling term $p_{j}$ from Eq. () for $j \in {a, b, c}$ with respect to augmented state variable $i$ : B6 $Δ_{p_{j}} = \frac{\partial p_{j}}{\partial i},$ then B7 $Δ_{H_{i}^{- 1}} = - 1 [\frac{- y Δ_{p_{a}} + Δ_{p_{c}} (p_{b} - y p_{a})^{2} + Δ_{p_{b}}}{(y p_{a} p_{c} - p_{b} + 1)^{2}}] .$ The partial derivative of y with respect to TOA reflectance $y$ is B8 $Δ_{y} = \frac{p_{a}}{[p_{c} (y p_{a} - p_{b}) + 1]^{2}} .$ So the uncertainty of estimated surface reflectance $σ_{r}^{2}$ is B9 $σ_{r}^{2} = Δ_{H_{aot}^{- 1}}^{2} σ_{aot}^{2} + Δ_{H_{tcwv}^{- 1}}^{2} σ_{tcwv}^{2} + Δ_{y}^{2} σ_{y}^{2} .$

Appendix C Implementation details

C1 Cloud, shadow, snow, and large water body masking

The emphasis in this first version of SIAC is on mapping the land surface, so the L1C TOA S2 and L8 data need to have masks applied for areas of cloud, shadow, snow, and large water bodies. We describe the approach to this in this section.

Recall from Eq. () that estimates of $R_{b}$ are used to provide sample estimates of $\hat{Y}$ , which are used to match against aggregated observations $Y_{c}$ to estimate $X$ in the observational cost function in Eq. (). We need to avoid using samples in $R_{b}$ and $Y$ that are likely to introduce biases in this. An obvious filtering is to avoid pixels that are influenced by extraneous factors such as cloud or cloud shadow. To do this, we calculate masks of these from the TOA reflectance data $Y$ on the original data grid $G_{m}$ and reject samples from $Y_{c}$ that would be impacted by these.

For the cloud and cloud shadow mask, we trained a U-NET convolutional neural network (CNNs) with TensorFlow following , with training datasets including the Spatial Procedures for Automated Removal of Cloud and Shadow (SPARCS) dataset , L8 Biome data ;,the Sentinel-2 Cloud Mask Catalogue , and Sentinel-2 reference cloud masks . The overall accuracy obtained is around 0.9, which is similar to that of . A probability of more than 30 % is used for the flag cloud, while a 50 % threshold is used to for cloud shadow. Finally, a $3 \times 3$ kernel is repeated 10 times to dilate the cloud and shadow mask and to avoid cloud and shadow edge contamination to try to minimise contamination effects.

We also need to consider that some estimates of $R_{b}$ from the MODIS data may be unreliable. These are likely to be (i) pixels that are poorly constrained in the MODIS product, and (ii) those undergoing rapid changes during the integration period used in the MODIS product. The first of these should largely have a low QA value if the MODIS sampling data are poor and should be down-weighted, as specified in Appendix . However, we also note that we do not expect the BRDF kernels used in the product to be able to deal well with water bodies (they are designed for vegetation canopies) , so one should be wary of water pixel predictions. The second will largely be associated with sudden events that have a large impact on reflectance, such as snow fall or melting.

In this version of SIAC then, large water bodies are masked out using the ESA global 150 m water products , and snow pixels are masked by $(NDSI > 0.15) & (NIR > 0.11) & (GREEN > 0.1)$ from the TOA observations , where NDSI is the Normalised Difference Snow Index bands 3 and 11 for S2 and 3 and 6 for L8. Here, NIR refers to band 8 for S2 and 4 for L8. GREEN refers to band 3 for both S2 and L8.

To avoid erroneous spatial features over large water bodies introduced by the excessive extrapolation from distant land pixels, a conservative estimate of atmospheric parameters over water bodies is used, this being the median value of the retrieval from the rest of the image. This means that SIAC retrievals over water bodies may not be as accurate as over the land surface.

In the retrieval process, if a MODIS pixel on grid $G_{c}$ contains a medium resolution pixel of grid $G_{m}$ that is identified as cloud, shadow, snow, and/or large water bodies in the S2 or L8 image, the MODIS pixel will be discarded from the observational constraint. This is quite a conservative approach, and the current version of SIAC may have some limitations for or near water bodies and for snow-covered areas. But because of the Bayesian design of the algorithm, even if there is no information available from the observational data, a mostly good estimate of the atmospheric parameters is available from the a priori constraint.

Filtering of MCD43-simulated reflectance $r_{b}$

The previous masking removes most of the pixels likely to be unreliable for estimates of $R_{b}$ , but we apply a further filtering step to ensure the robustness of the samples used in the observational constraint. This is based on comparing modelled and measured BRF using a rough initial estimation of AOT and a priori values of water vapour.

This estimate is obtained by deriving a coarse per-pixel (on the $G_{c}$ grid of the MODIS data) estimate of AOT, then regularising this with a robust smoother to average any noise and to remove the influence of any outliers.

A coarse look-up table in AOT (AOT 0–2.5 in steps of 0.05) is used to compute $J_{obs} + J_{prior}$ for every pixel on grid $G_{c}$ that passes the initial masking. AOT values corresponding to the minimum cost value for every pixel are used to form an approximate AOT per-pixel map. This is then filtered with a robust spatial smoother , with a smoothness value $s$ of 20 to provide a smooth initial estimation of AOT. The robust metric used eliminates the influence of outliers in the AOT field that may arise from unreliable values of $R_{b}$ or other effects.

The smooth AOT estimate then forms part of a rough estimate of atmospheric state $X$ along with other information from $X_{a_{c}}$ . This is used to generate a first-pass atmospheric correction of observations on the grid $G_{c}$ , $Y_{c}$ , to BOA reflectance $R_{c}$ . This is compared with the MODIS estimate $R_{b}$ to derive a residual for each waveband. If the absolute value of this is smaller than 0.02 for visible bands and smaller than 0.03 for NIR-SWIR bands, the pixel will be used in further processing, otherwise it is masked and effectively discarded from consideration.

Although we expect the multiple constraints used in SIAC to provide some degree of robustness to any biases in $R_{b}$ for occasional pixels, it is found to be best to filter out gross errors using this approach. The multiple constraints in any case mean that we do not have to provide measures of $Y_{c}$ and $\hat{Y}$ for each pixel in $G_{c}$ , and only a sample is required.

Appendix D Spectral mapping

D1 Spectral libraries

Spectral correlation over most natural surfaces suggests that transformations between different spectral domains are possible. In , a set of linear transformations is used to transform from narrowbands (sensor) to broadbands. The linear relationship is conditional to the spectral library used, but the actual variation of land surface reflectance is much wider than the variation given by spectral libraries, so there is a need to include realistic spectra outside the spectral libraries. Improving on the linear transformation, a localised spectral interpolation based on $K$ -nearest neighbours is implemented in SIAC.

Figure D1

Example of spectra selection and comparison between the MCD43-simulated reflectance and mean spectra-simulated reflectance.

[Figure omitted. See PDF]

The dataset used to define these transformations is derived from merging multiple spectral libraries covering the target spectral range, re-sampled to 1 nm resolution. To emphasise the importance of vegetation and soils, simulated vegetation spectra using the PROSAIL model and a soil database were also used. In total, more than 6 500 spectra were used. The libraries used are shown in Table .

Table D1

List of spectral libraries used to define spectral transformations.

Library	Target type	Notes
USGS v7	Multiple	N/A
ASTER	Natural & manmade materials	N/A
KLUM	Urban	N/A
ICRAF-ISRIC	Soils	Only first 1 000 samples used

N/A: not applicable.

Figure D2

S2 (a) and L8 (b) reflectance predicted by MODIS Aqua-selected spectra from the spectral library. The uncertainty is shown with 1 $σ$ range. The original values are from the direct simulated reflectance by applying S2 and L8 RSR to the individual spectrum in the spectra library.

[Figure omitted. See PDF]

Given that the MODIS land bands are designed to capture most of the land surface properties, spectra selected with MODIS bands should be able to predict other optical sensors' reflectances with similar bands. Although there is a large number of spectra in the spectra library introduced in Table , it is still not sufficient to cover the vast variations of reflectance seen over the land surface. To deal with this limitation, the spectral searching procedure is split into two parts: visible and infrared spectral region. The spectral selection and comparison between the MODIS reflectance and mean spectra-simulated MODIS reflectance are shown in Fig. .

A set of five closest spectra from the spectral library are used to compute the weighted mean using an inverse distance weighting. This added number of samples introduces robustness to errors in both the MODIS surface reflectance input and the spectral library. Other numbers of selected spectra were tested, but it shows that spectra vastly different from the target spectrum are likely to be included if more than 10 spectra are used. Once the mean spectrum is calculated, it is then convolved with the target sensor-relative spectral responses (RSRs) to obtain the simulated surface reflectance at $Λ_{map} (r_{m})$ . The difference between MODIS-measured reflectance and the mean predicted reflectance in the MODIS bands is used as an indicator of uncertainty.

To test the effectiveness of the proposed spectral mapping method, the MODIS, S2, and L8 reflectances are simulated with individual spectra from the spectral library. Then the MODIS-simulated reflectance is used to get $K$ -nearest ( $K = 6$ in this case) neighbours spectra from the spectral library and to discard the spectra used for the computation of MODIS input reflectance, so the remaining spectra are independent from the input reflectance. The mean spectra are computed with weights computed from 1 divided by the standard Euclidean distance, and the mean spectra-simulated reflectance for MODIS, S2, and L8 are computed by convolving with their RSR function. The difference between mean spectra-simulated reflectance to the reflectance simulated with an individual spectrum in the spectra library for MODIS is used as a measure of standard error of the mean spectra-simulated reflectance. The result is shown in Fig. .

The spectral mapping results for S2 and L8 show that, over most of the cases, our spectral mapping can simulate both sensors well with high correlation (over 0.99 for all the bands), low RMSE (lower than 0.03 for all bands and 0.015 for the first 5 bands), and no bias introduced. The SWIR band around 2 200 nm shows the largest dispersion, which is attributed to the large variation in reflectance in this spectral region and the large difference in the band pass functions between MODIS and S2 and L8, as shown in Fig. .

The standard error estimated from MODIS reflectance provides a reasonably good estimation of the mean spectral estimated reflectance for S2 and L8, since a large discrepancy between simulations and observations implies that the input reflectance is not well represented in the spectral library.

D2 Results of spectral mapping

Results of the spectral mapping approach are shown in Fig. over four representative land cover types (vegetation, desert, urban, and snow). In these plots, the input reflectance is derived from the MODIS MCD43 BRDF products using Eq. (). The predicted reflectance is in line with the observations, with most of the observations falling within the predictive uncertainty envelope. In the DA system, poor matches to the spectral database will have large uncertainty, and those pixels will have a smaller impact on the inference.

Appendix E Spatial mapping

Due to the large differences in the spatial resolution between the MODIS (500 m) and S2 and L8 (10, 20, and/or 30 m), the measured reflectance values from them cannot be directly compared. We model the MODIS data effective PSF and use this to convolve the high resolution data in order to make it comparable with the MODIS products. Ideally, the MODIS cross-track direction PSF is triangular and rectangular in along-track direction as a result of optical ${PSF}_{opt}$ , detector ${PSF}_{\det}$ , image motion ${PSF}_{im}$ , and electronics ${PSF}_{el}$ :

E1 ${PSF}_{net} (x, y) = {PSF}_{opt} * {PSF}_{\det} * {PSF}_{im} * {PSF}_{el},$ where $*$ is a spatial convolution operator. In the MODIS MCD43 BRDF product, a number of individual observations are inverted together within a temporal window of 16 d. Each of the individual observations has a PSF described by Eq. (), but the combined product will have an effective PSF resulting from the combination of individual measurements with different angles and scanning geometries. In line with , , , and , we assume that the effective or equivalent PSF (ePSF) for the combined product is given by a two-dimensional Gaussian function in along-track ( $x$ direction) and across-track ( $y$ direction) directions, as shown in Fig. ; E2 $ePSF (x, y) = exp⁡ (- \frac{(x + Δ_{x})^{2}}{2 σ_{x}^{2}}) \cdot exp⁡ (- \frac{(y + Δ_{y})^{2}}{2 σ_{y}^{2}}),$ where $σ_{x}$ and $σ_{y}$ are the standard deviation of the Gaussian function expressed over the satellite image pixels unit; $Δ_{x}$ and $Δ_{y}$ represent the shifts in along and cross directions. According to and , there is also an angular deviation between the satellite orbit and the true north, which is given by E3 $tan⁡ θ = \frac{cos⁡ i - (1 / κ) {cos⁡}^{2} φ}{\sqrt{{cos⁡}^{2} φ - {cos⁡}^{2} i}},$ where $θ$ is the angular deviation, $i$ is the inclination angle, $φ$ is the latitude, and $κ$ is the daily recurrence frequency. Then, the rotation matrix ( $R_{θ}$ ) is E4 $R_{θ} = [\begin{array}{cc} cos⁡ θ & - sin⁡ θ \\ sin⁡ θ & cos⁡ θ \end{array}] .$ We now have an expression that will permit the comparison of the high-resolution TOA reflectance with the coarse-resolution predictions of surface reflectance obtained from the MCD43 product propagated through the atmosphere. This step is fundamental in defining how the proposed method solves for atmospheric composition, and the equality requires that the high-resolution data (S2 and/or L8) are convolved with the ePSF. Given the disparity of spatial resolutions, the S2 and L8 PSF effect is averaged out during the aggregation and has been neglected in the modelling process.

Figure E1

A typical MODIS ePSF on the spatial resolution of S2, i.e. a unit of 1 represents 10 m on the $X$ – $Y$ plane, and it follows the same notations as in Eqs. ()–(). The shaded areas on the two sides represent the Gaussian functions used for $x$ and $y$ directions, with 1 $σ$ shown with vertical dashed lines.

[Figure omitted. See PDF]

The spatial convolution is calculated in the frequency domain for efficiency.

Results of spatial mapping (PSF)

An effective point spread function (ePSF) of the MODIS MCD43 BRDF product is simulated with a two-dimensional Gaussian function, with $σ_{x}$ and $σ_{y}$ controlling the widths of Gaussian in along-track and across-track directions, respectively. Shifts in those two directions are also accounted for with estimated $Δ_{x}$ and $Δ_{y}$ to deal with the geolocation errors in the MCD43 products. Both of the $σ$ and $Δ$ are in the units of pixels for the target sensor, i.e. S2 or L8.

Figure E2

Comparison between MCD43-simulated surface reflectance after spectral mapping $r_{c} (Λ (m))$ with S2 TOA reflectance $\hat{y}$ and S2 TOA reflectance after spatial mapping $\hat{y} (G_{c})$ on 13 April 2016 S2 50SLH tile. Top row is the $r_{c} (Λ (m))$ and S2 TOA reflectance $\hat{y}$ , with the scatter plot between the corresponding pixels (pixel's centre geolocation) on the right side. Bottom row is the $r_{c} (Λ (m))$ , with $\hat{y} (G_{c})$ and their scatter plot.

[Figure omitted. See PDF]

We show an example of the PSF modelling in Fig. , where we compare the MCD43-predicted reflectance after spectral mapping $r_{c} (Λ (m))$ with S2 TOA reflectance $\hat{y}$ and $\hat{y} (G_{c})$ after the spatial mapping at a wavelength around 2 200 nm. $r_{c} (Λ (m))$ and $\hat{y} (G_{c})$ show broadly similar coarse patterns, but higher spatial details exist $\hat{y}$ . Per-pixel level comparison of them shows that $r_{c} (Λ (m))$ and $\hat{y} (G_{c})$ have a much stronger correlation, a slope very close to unity, and a bias close to 0, indicating that modelling the spatial mismatch is a required step in combining these two datasets.

Figure E3

Per-band comparison between the MCD43-simulated surface reflectance $r_{c} (Λ (m))$ and $\hat{y} (G_{c})$ at six MODIS bands.

[Figure omitted. See PDF]

We have assumed that, for a given scene, a single Gaussian PSF is required, in line with the findings of , and we assume further that we can use the PSF derived for the 2 200 nm band for all other bands. At this wavelength, the atmosphere is assumed to be spatially transparent with respect to AOT, and under the spectral similarity assumption of the BRF, the results should be similar for other bands . We illustrate the effect of atmospheric scattering by comparing the ePSF-convolved S2 TOA reflectance $\hat{y} (G_{c})$ with the MCD43-derived BOA reflectance predictions after spectral mapping $r_{c} (Λ (m))$ in Fig. . The pattern shows that there are consistent higher atmospheric effects for the shorter wavelengths that result in both a clear bias due to the important effect of aerosols in the intrinsic path radiance and a slope different to unity (due to the effect of aerosols on upward and downward atmospheric transmission and spherical albedo). For the longer wavelength bands after the NIR plateau, aerosol effects are less important, and the slope is close to unity and the bias close to 0, with the correlation generally being very high, and this also proves the validity of using PSF derived for the 2 200 nm band for all other bands.

Figure E4

The correlation map between $r_{c} (Λ (m))$ with $\hat{y} (G_{c})$ , with different $σ_{x}$ , $σ_{y}$ , $Δ_{x}$ , and $Δ_{y}$ values, in which the red dots represent the largest correlation values' positions.

[Figure omitted. See PDF]

Figure E5

The density scatter plots of solved PSF parameters, $σ$ (a, c) and $Δ$ (b, d) in $x$ and $y$ direction for L8 (a, b) and S2 (c, d), where the red markers stand for the median of those parameters. The units of the $x$ and $y$ axis are the number of pixels.

[Figure omitted. See PDF]

After solving for the ePSF parameters over a large number of S2 and L8 scenes globally, we note that some simplifications in the processing are possible. First, we see that the cost function is very flat around the minimum. Figure shows an example of this: for both S2 and L8, the region around the maximum correlation point has similar values (in excess of 0.98) to the maximum, suggesting that the cost function has limited sensitivities to the ePSF widths when it is close to some optimal values. A second important point is that the width of the ePSF over a large number of scenes tends to be well defined (see Fig. for an example of this). For S2, the median of $σ_{x}$ is around 26 (260 metres), and the median of $σ_{y}$ is around 34 (340 m). For L8, these numbers are similar; only, in this case, the number of pixels is 3 times larger to account for the higher spatial resolution of S2. The shift, however, appears to be more scene dependent and also shows a larger influence on the correlation cost function.

The points made above suggest that a fixed value of 260 m for $σ_{x}$ and 340 m for $σ_{y}$ may be used for all images, but the effect of the pixel shift needs to be inferred on a scene-by-scene basis. We have not said much of rotation angle $θ$ (introduced in Eq. ). In all cases we studied, its value is small (around $\pm 8$ $^{\circ}$ ), and because of the large size of the ePSF, its effect can be effectively compensated by $Δ_{x, y}$ . It is important to note that the aim here is to provide an estimate of an effective PSF that allows a comparison between estimates of coarse-resolution surface reflectance propagated to TOA and measurements of TOA reflectance convolved with the ePSF. To further reduce the computational burden of calculating the ePSF parameters, we have taken the median of $σ_{x}$ and $σ_{y}$ as a reference, assumed the rotation angle of ePSF to be 0 $^{\circ}$ , and only solved for $Δ_{x, y}$ on a scene-by-scene basis.

Appendix F Results of atmospheric parameter inversion and atmospheric correction

In this section, we illustrate cases of SIAC being used to infer atmospheric composition parameters. Due to S2 and L8 having bands outside of the strong O $_{3}$ absorption region, TCO $_{3}$ is taken from CAMS, and only AOT and TCWV are inferred from the data. Figure shows a demonstration of the procedure. Here, an image captured over the North China Plain (tile 50SMH) by S2 on 10 February 2016 has been processed. The CAMS prior mean AOT in Fig. is around 1 and is approximately constant over the scene. The TOA reflectance for band 1 of the S2 sensor (shown in log-transformed units to enhance the dynamic range) shows two clear high-aerosol stripes. The true-colour TOA image shows a very strong atmospheric effect, consistent with the expectation of high AOT. The retrieved AOT (bottom left panel in Fig. ) shows a marked departure from the prior value. Two very high aerosol stripes are clearly resolved. The result of applying the atmospheric correction results in an important reduction of the atmospheric effects is particularly evident from the BOA true-colour composite (bottom right panel of Fig. ).

Figure F1

The prior and posterior AOT over S2 50SMH on 10 February 2016 and their shared colour bar are in the first column. Figures in the second column are band 1 TOA and surface reflectance, while the TOA and BOA RGB images are shown in the third column for the same tile over the same time.

[Figure omitted. See PDF]

Figure F2

Example of retrieval on S2 data over (a) Amazon ATTO Tower site (S2 Tile 21MTT, 27 June 2019, top row) and (b) UACJ UNAM OR site (S2 tile 13SCR, 14 January 2019, bottom row). Top row of each panel, left to right: AOT prior mean from CAMS, TCWV prior mean from CAMS, blue band TOA reflectance, TOA RGB composite. Bottom row of each panel, left to right: a posteriori AOT mean, a posteriori TCWV mean, blue band BOA reflectance, BOA RGB composite.

[Figure omitted. See PDF]

Figure F3

Example of retrieval on S2 data over (a) XiangHe site (S2 tile 50TMK, 13 May 2019, top row) and (b) Evora site (S2 tile 29SNC, 29 July 2018). Top row of each panel, left to right: AOT prior mean from CAMS, TCWV prior mean from CAMS, blue band TOA reflectance, TOA RGB composite. Bottom row of each panel, left to right: a posteriori AOT mean, posterior TCWV mean, blue band BOA reflectance, BOA RGB composite.

[Figure omitted. See PDF]

Some artefacts are also apparent. In the bottom right corner of the scene, the AOT map reverts to the prior value from CAMS, which results in a poorer correction of the atmospheric effects. This is caused by lack of high quality MCD43 retrievals in this area at this time, which results in the AOT estimate being strongly driven by the prior from CAMS as well as some spatial diffusive effects from areas where the algorithm performs well. A second artefact is some visible stripes (visible in the middle top and bottom panels). These are caused by the combinations of observations from different detectors and have no relationship with the atmospheric correction method. It is also worth noting that, when solving for the ePSF parameters for this scene, the optimal linear correlation was only around 0.55, but clearly, the system is still able to produce reasonable results in this challenging environment.

We note that the scene shown in Fig. is a challenging one: at this time of the year, most of the soil is bare, and aerosol loading is very high. Methods that rely on dark dense vegetation (DDV) exploit an empirical relationship between the reflectance around 2 200 nm and the blue and red band reflectance. If no vegetation patches are available in the scene (or if their spatial sampling is limited and aerosol spatial dynamics are high), this leads to an inability to obtain a reliable AOT estimate. The Deep Blue AOT algorithm, used in MODIS aerosol retrieval, has been developed to overcome the shortcoming of the DDV method over bright land surfaces, and the combined products deliver global coverage . But those two methods have different assumptions and a resulting inconsistency in the merged product, especially over the transition regions which have comparatively low vegetation cover .

As a further illustration of the approach on Sentinel 2 data, we show similar visualisations of AOT and TCWV priors, the associated posteriors, TOA and BOA blue band reflectance, and TOA and BOA true-colour composites for a number of different sites spanning the globe in Fig. (Amazon ATTO Tower and UACJ UNAM OR), Fig. (XiangHe and Evora). While the effect of the atmospheric correction is evident in all these cases, it is important to note that the prior mean is significantly updated when the posterior mean of both AOT and TCWV are calculated. Spatial patterns are clearly visible in all the examples for both parameters, and in many cases, the average value from CAMS changes substantially when the proposed method is deployed. Since the retrieval are made with land pixels only, large water body pixels are filled with median aerosol values from all the land pixel retrievals, and the edge between land and water is expected.

Appendix G Spatial smoothness parameter estimation

To estimate atmospheric parameters, an estimate of surface reflectance is needed. This estimate is different from the actual surface reflectance and is likely to contain spatial artefacts, which will result in an unrealistic and noisy estimation of atmospheric parameters if an independent pixel-level retrieval strategy is used. To counter this, most practical approaches average the estimation of surface reflectance over a spatial window of fixed size. Within this window or block, atmospheric composition is assumed to be constant and inferred to reduce the noise in the estimated surface reflectance. This is pragmatic in many ways and compartmentalises the processing requirements to blocks of that window size. However, the block structure imposed can introduce spurious artefacts if the spatial gradient of atmospheric parameters is large and fails to estimate atmospheric parameters when no valid targets are found within the specified box.

Figure G1

$γ$ cross validation for (a) S2 AOT, (b) S2 TCWV, (c) L8 AOT, and (d) L8 TCWV. The $x$ axis in the subplots is $γ$ values ranging from $1 \times 10^{- 5}$ (essentially, no smoothness constraint) to 1 000 (very high smoothness). The $y$ axis shows $J_{obs}$ normalised by the $J_{obs}$ with a smallest $γ$ value of $1 \times 10^{- 5}$ . The red dots show mean normalised $J_{obs}$ over a set of 20 S2 and L8 tiles. The blue fill shows $\pm$ 1 standard deviation of normalised $J_{obs}$ over the 20 samples.

[Figure omitted. See PDF]

Within SIAC, the broad-scale (40 km) variations of atmospheric parameters are estimated from CAMS. But there are often finer-scale features that may impact our interpretation of surface reflectance, and we wish to be able to resolve these. To this end, we assume an effective resolution of $G_{c}$ (500 m) for atmospheric parameters, with the smoothness constraint expressed in Eq. () and controlled by $γ$ . This spatial constraint allows for gradual changes in atmospheric parameters $X$ at the spatial resolution of $G_{c}$ over the S2 and L8 image spatial extents. The values in $X$ that we solve for in SIAC are controlled by a weighting of the location and information content of samples in $Y$ and the assumed uncertainty of the a priori constraint that is, in essence, blurred over $G_{c}$ . The degree of smoothing imposed, and so the resultant correlation length of the derived atmospheric parameters, is mainly controlled by $γ$ . Its squared inverse, $1 / γ^{2}$ , a measure of roughness, can also be phrased as the expectation that there is no change at a scale of 500 m . Despite this physical understanding of $γ$ , it is not straightforward to arrive at a reasonable global estimate of this parameter. We know that too low a value means that $X$ may become oversensitive to the sample location of the observational constraints and fail to exploit the spatial correlations we know to exist in atmospheric parameters. Too high a value will overly smooth $X$ , and it will not be responsive to actual variations over the scene. Fortunately, we know from previous studies (e.g. ) that results should not be very sensitive to the precise value used and that there should be quite a wide range of tolerance. In that case, we should select the lowest tolerable value of $γ$ . In this context, we can use a cross validation approach over a representative set of scenes to gauge a useful global value. What we would expect to assess from such an experiment is the range of tolerable $γ$ values we might use, from which we can select the lowest tolerable value to use within SIAC. In ideal circumstances, we would see a broad but well-defined minimum in cross-validation cost. An absence of that would suggest a lack of sensitivity of the results to the selection of $γ$ . Ideally, we would use a consistent $γ$ value across different sensors, so that we could have the same assumed degree of smoothing.

We performed a cross validation study using 20 scenes in L8 and in S2, selected to cover a good dynamic range in AOT (0–2). We sampled over $γ$ values from $1 \times 10^{- 5}$ (very low) to 1 000 (very high). For each scene and for each $γ$ value we randomly masked half of the samples in $Y_{c}$ , then solved for $X$ using SIAC. With this $X$ , we calculated the observation cost $J_{obs}$ according to Eq. () for the masked samples in $Y_{c}$ . This gives a measure of observation prediction error (relative to uncertainty) for samples that have atmospheric state interpolated by the correlation function for each scene for given $γ$ . For very low $γ$ , the influence of observational samples on the predicted atmospheric state at masked locations will be low, and at these sites, $X$ would effectively be $X_{b}$ . As $γ$ increases, the influence of observational samples is higher, and we expect to get a better-resolved estimate of $X$ , until a point where we are smoothing so much that we lose that resolution. In our experiments, we performed the calculation of $J_{obs}$ over 10 random realisations of the masking to obtain an aggregate discrepancy for each scene, normalise this measure by that of the value for the lowest $γ$ used, then examine the mean and standard deviation of this measure of cross-validation error over the 20 scenes. The results are shown in Fig. .

There is mostly more variation in $J_{obs}$ over the scenes as $γ$ increases, partly due to the normalisation performed, and some sampling noise in the average cost (red points and line); but, as expected, we see a broad minimum region for AOT for S2 and L8 and for TCWV for S2. For L8 TCWV, the result is more complex, and we see an increase in cross-validation error for values of $γ$ above 0.1. This is because there is very limited sensitivity in the L8 spectral bands to TCWV. For low $γ$ (up to 0.1 here), we are effectively seeing the cross-validation error resulting from $X_{b}$ only. Beyond this point, we are over-smoothing the information in $X_{b}$ , so a global $γ$ of 0.1 for L8 TCWV is appropriate. For the other conditions, a compromise value of 5 can be considered as providing a consistent value for use over AOT for both sensors as well as S2 TCWV, although lower values of $γ$ for S2 TCWV might also be considered from these results. In the main results section, we use other approaches to verify that these choices are appropriate using other criteria.

Appendix H Optimal TOA uncertainty

We show the $σ$ of normalised error as a function of TOA reflectance uncertainty in Fig. . By changing the TOA reflectance uncertainty to 3 %, except for B12 (4.5 %), we can achieve the optimal uncertainty for the surface reflectance when the normalised error distributions have $σ$ of 1. This is in line with the validation of L1C products, where the TOA reflectance can achieve 3 % (target) of uncertainty most of the time . The result for L8 is similar to S2, but it is worth mentioning that the optimal uncertainties for L8 TOA are less than 3 % for all the bands, with the max being around 2.5 %. This could be attributed to the fact that fewer co-incident L8 and RadCalNet measurement samples are available due to its longer global revisit time.

Figure H1

Surface reflectance normalised error distribution as a function of TOA reflectance uncertainty. The red dot indicates the point by changing the TOA uncertainty to reach the optimal uncertainty ( $σ$ of 1 for the normalised error distribution). B6 for L8 in (b) has $σ$ smaller than 1 for all the time and so cannot determine the optimal $σ$ .

[Figure omitted. See PDF]

Appendix I Improvement over prior correction

We show the correction done by only using the prior values from CAMS predictions in Figs. , , and . For most of the time, the CAMS prior-corrected reflectances match the RadCalNet measurements well, but they are worse than the corrections done with SIAC, shown in Figs. , and , though the differences are small. This is because those RadCalNet sites are located in places with homogeneous landscapes and mostly low aerosol loading . Considering the low sensitivity of surface reflectance to low aerosol loading atmosphere condition, the improvement made on AOT estimation will not improve much on the mean of the estimated surface reflectance. This suggests that the current RadCalNet sites can only be used as a minimum quality check of atmospheric correction or land surface reflectance. More challenging and heterogeneous surface conditions are required for the evaluation of surface reflectance quality. The uncertainty comparison between the prior (CAMS) and posterior (SIAC) corrected surface reflectance in Fig. shows that the improvements on the uncertainty budget are around 10 % for visible to near-infrared bands and 5 % for SWIR bands for TOA uncertainty of 5 % to 2.5 %. The uncertainty improvement result for L8 is much more subtle; this could be attributed to the small sample size. It is also interesting to note that the surface reflectance uncertainty is a function of TOA uncertainty, and the proportion of improvements depends on the relative weight between the atmospheric parameters' uncertainty and the TOA uncertainty.

Figure I1

Comparison between the S2 (a) and L8 (b) TOA reflectance and RadCalNet simulated nadir-view TOA reflectance (top row), and the surface reflectance after correction with CAMS prior against RadCalNet nadir-view surface reflectance (bottom row) at Gobabeb. The blue lines on the left are different spectra measurements from RadCalNet, and the red dots with blue error bars (1 standard deviation) are the TOA or surface and TOA reflectance with uncertainty. The threshold uncertainty is given as black dashed lines in the scatter plots. The regression line is drawn as a red line, and the 1-to-1 reference line is drawn as a thick black dashed line in the middle.

[Figure omitted. See PDF]

Figure I2

Same as Fig. but for La Crau site.

[Figure omitted. See PDF]

Figure I3

Same as Fig. but for Railroad Valley Playa site.

[Figure omitted. See PDF]

Figure I4

Surface reflectance uncertainty comparison between CAMS prior corrections and SIAC corrections. The ratio is calculated as CAMS prior-corrected surface reflectance uncertainty divided by SIAC surface reflectance uncertainty for different values of TOA uncertainty. The red dot indicates the uncertainty ratio when 5 % of the TOA reflectance uncertainty is used, while the red square indicates the uncertainty ratio when 3 % of the TOA reflectance is uncertainty used.

[Figure omitted. See PDF]

Code availability

The code for SIAC is written in Python and is released under the GPLv3 open source licence. The code is available through the Zenodo from 10.5281/zenodo.6651964 .

Data availability

The input data are publicly available through references listed in Table , and the validations results over AERONET and RadCalNet sites are uploaded to Zenodo: 10.5281/zenodo.6652892 .

Author contributions

FY and PEL conceptualised the study. FY developed the code, prepared the datasets, and analysed the results. FY and PEL wrote the manuscript. JLGD suggested experiments and contributed to the drafting of the manuscript. All authors contributed to editing the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Acknowledgements

The authors would like to acknowledge financial support from the European Union Horizon 2020 research and innovation programme under grant agreement no. 687320 MULTIPLY (MULTIscale SENTINEL land surface information retrieval Platform) and from the European Space Agency (ESA) under contract no. 4000112388/14/I-NB SEOM SY-4Sci Synergy. Feng Yin, Philip E. Lewis, and Jose L. Gómez-Dans were supported by the Natural Environment Research Council's (NERC) National Centre for Earth Observation (NCEO) (project no. 525861). Feng Yin and Philip E. Lewis were supported by Science and Technology Facilities Council (STFC) of UK-Newton Agritech Programme (project no. 533651).

Financial support

This research has been supported by the European Commission, Horizon 2020 (MULTIPLY (grant no. 687320)), the European Space Agency (grant no. 4000112388/14/I-NB SEOM SY-4Sci Synergy), the National Centre for Earth Observation (grant no. 525861), and the Science and Technology Facilities Council (STFC) of UK-Newton Agritech Programme (project no. 533651).

Review statement

This paper was edited by Le Yu and reviewed by three anonymous referees.

Word count: 17094

Show less

© 2022. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Mitigating the impact of atmospheric effects on optical remote sensing data is critical for monitoring intrinsic land processes and developing Analysis Ready Data (ARD). This work develops an approach to this for the NERC NCEO medium resolution ARD Landsat 8 (L8) and Sentinel 2 (S2) products, called Sensor Invariant Atmospheric Correction (SIAC). The contribution of the work is to phrase and solve that problem within a probabilistic (Bayesian) framework for medium resolution multispectral sensors S2/MSI and L8/OLI and to provide per-pixel uncertainty estimates traceable from assumed top-of-atmosphere (TOA) measurement uncertainty, making progress towards an important aspect of CEOS ARD target requirements.

A set of observational and a priori constraints are developed in SIAC to constrain an estimate of coarse resolution (500 m) aerosol optical thickness (AOT) and total column water vapour (TCWV), along with associated uncertainty. This is then used to estimate the medium resolution (10–60 m) surface reflectance and uncertainty, given an assumed uncertainty of 5 % in TOA reflectance. The coarse resolution a priori constraints used are the MODIS MCD43 BRDF/Albedo product, giving a constraint on 500 m surface reflectance, and the Copernicus Atmosphere Monitoring Service (CAMS) operational forecasts of AOT and TCWV, providing estimates of atmospheric state at core 40 km spatial resolution, with an associated 500 m resolution spatial correlation model. The mapping in spatial scale between medium resolution observations and the coarser resolution constraints is achieved using a calibrated effective point spread function for MCD43. Efficient approximations (emulators) to the outputs of the 6S atmospheric radiative transfer code are used to estimate the state parameters in the atmospheric correction stage.

SIAC is demonstrated for a set of global S2 and L8 images covering AERONET and RadCalNet sites. AOT retrievals show a very high correlation to AERONET estimates (correlation coefficient around 0.86, RMSE of 0.07 for both sensors), although with a small bias in AOT. TCWV is accurately retrieved from both sensors (correlation coefficient over 0.96, RMSE $< 0.32$ g cm $^{- 2}$ ). Comparisons with in situ surface reflectance measurements from the RadCalNet network show that SIAC provides accurate estimates of surface reflectance across the entire spectrum, with RMSE mismatches with the reference data between 0.01 and 0.02 in units of reflectance for both S2 and L8. For near-simultaneous S2 and L8 acquisitions, there is a very tight relationship (correlation coefficient over 0.95 for all common bands) between surface reflectance from both sensors, with negligible biases. Uncertainty estimates are assessed through discrepancy analysis and are found to provide viable estimates for AOT and TCWV. For surface reflectance, they give conservative estimates of uncertainty, suggesting that a lower estimate of TOA reflectance uncertainty might be appropriate.

Details

Title

Bayesian atmospheric correction over land: Sentinel-2/MSI and Landsat 8/OLI

Author

Yin, Feng¹; Lewis, Philip E¹

; Gómez-Dans, Jose L¹

¹ Department of Geography, University College London, Gower Street, London WC1E 6BT, United Kingdom; National Centre for Earth Observation (NCEO), Space Park Leicester, Leicester LE4 5SP, United Kingdom

Pages

7933-7976

Publication year

2022

Publication date

2022

Publisher

Copernicus GmbH

ISSN

1991962X

e-ISSN

19919603

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.5194/gmd-15-7933-2022

ProQuest document ID

2732416828

Bayesian atmospheric correction over land: Sentinel-2/MSI and Landsat 8/OLI

Jump to:

Full text

Abstract

Details

Suggested sources