1 Introduction
The rate of projected future warming in northern Europe is amongst the highest in the world, driven to a large extent by the strong feedback involving snow and ice as the climate warms . As a consequence, the hydrological cycle intensifies , leading to more precipitation as well as more intense extreme events
Coupled atmosphere–ocean general circulation models (GCMs) remain our main source of information for projections of future climate. However, these have spatial resolutions that are too coarse for assessing the often localized impacts of changing precipitation patterns. Regional climate models (RCMs) at a spatial resolution of 10–15
Figure 1
The proposed two-stage weather generator approach for simulations of fine-scale daily precipitation in a future climate.
[Figure omitted. See PDF]
To obtain reference results for the current climate, impact models are commonly applied to high-resolution historical data products such as the Nordic Gridded Climate Dataset (NGCD,
This paper proposes a two-stage weather generator (WG) approach to generate high-dimensional simulations of future climate on a fine-scale grid. Specifically, a stochastic model describing a high-resolution data product in a reference period is combined with climate change projections based on a lower-resolution RCM. Weather generators are commonly used to generate spatially and temporally correlated fields of daily precipitation, with the early work of paving the way for many current approaches. illustrate the use of a generalized linear model (GLM) to describe daily precipitation series at individual sites, using a logistic regression model for the occurrence and a gamma model for the amounts. More recently, propose an approach relying on two latent Gaussian random fields to generate spatially correlated occurrence and intensity, with spatial heterogeneity described through both spatially varying covariates and regression parameters. propose a more computationally efficient approach, where a single latent Gaussian random field is used to describe the spatial correlation in both precipitation occurrence and intensity.
With applications related to hydrological impacts in mind, we consider a case study of nine different catchments in central Norway. The simulation of daily fine-scale precipitation for a catchment requires daily simulations of spatially correlated random fields on a high-resolution grid with roughly 1000–5500 grid cells, depending on the size of the catchment. As the catchments are located in different climatic zones, the stochastic model is estimated independently for each catchment. Spatial heterogeneity within a catchment is introduced via spatially varying covariates for both the occurrence and the intensity models, where the covariate contribution to the precipitation intensity may vary smoothly in space. Additionally, temporal aspects are modelled with seasonal effects and linear trends in the marginal distributions as well as an autoregressive component in the residual process. Climate change information from an RCM output may be added in a transparent manner by updating each component of the weather generator based on estimated climate change in the corresponding component at the coarser RCM scale. propose a similar model for obtaining high-resolution daily mean temperature projections.
As demonstrated in Fig. , the stochastic model generates realizations of future precipitation occurrence and intensity that are correlated in space and time, thus combining four separate components: spatial and temporal correlation structures and marginal models at each grid-cell location for probability of occurrence and intensity. The fine-scale spatial correlation structure is assumed constant over time, while climate change information from the RCM can be used to update the other three components in terms of both overall level as well as seasonal patterns. In addition to being stochastic in nature, the method provides a transparent way to add a climate change signal to the precipitation simulations. The success of the model producing realistic realizations for a future climate depends on two factors: the RCM must be able to correctly capture the climate change signal in the model components and the scale of the fine-scale change must be close enough to that of the RCM scale for climate change effects to be transferrable between the two scales.
The remainder of the paper is organized as follows. Section introduces the datasets and the study area. Details of the two-stage WG approach are given in Sect. together with a description of a reference method based on empirical quantile delta mapping as well as the evaluation methods used to compare the two approaches. The models are estimated based on data from the period 1957–1986 and the estimates are used to simulate data for the period 1987–2005. The results of this analysis and comparison of the various approaches are given in Sect. . The paper then concludes with a brief summary and discussion in Sect. .
2 Data and study areaWe apply our methodology to daily precipitation simulations from two RCMs from the EURO-CORDEX-11 ensemble. One (referred to as RCM1 in the following) combines the COSMO Climate Limited-area Model (CCLM) from the Potsdam Institute for Climate Research with boundary conditions from the CNRM-CM5 Earth system model developed by the French National Centre for Meteorological Research , whereas the other (referred to as RCM2) combines the CCLM model with boundary conditions from the MPI Earth system model developed by the Max Planck Institute for Meteorology . The RCM simulations are conducted over Europe at a spatial resolution of 0.11 or about 12 . In the historical period up to 2005 the outputs are simulated based on recorded emissions and are thus comparable to observed climate.
For observational reference data, we use the seNorge gridded data product version 2018 produced by the Norwegian Meteorological Institute as a subset of the Nordic Gridded Climate Dataset for Norway. The data result from a multi-scale spatial interpolation of measurements from 500 to 700 surface weather observation stations for the period 1957 to the present. The data have a daily temporal resolution and a spatial resolution of 1 over an area covering the Norwegian mainland and an adjacent strip along the Norwegian border. Compared with previous versions of the data product
Grid-cell precipitation is an areal average of sub-grid precipitations and, at a daily timescale, each value in a time series is an accumulation over 24 h. We upscale the fine-scale seNorge values to the coarse-scale RCM grid by calculating the weighted average over all seNorge grid cells within a given RCM grid cell, where the weights equal the proportion of each seNorge cell within the given RCM cell. The precipitation data have unit , which is approximately equivalent to ; we then set all values less than 0.1 to 0 before other processing.
Figure 2
The study area is located in Trøndelag in central Norway, covering the entire Trøndelag and a small part of neighbouring Sweden, and consists of 695 RCM grid cells (rectangular-like polygons) and 109 514 seNorge grid cells (within the polygons, not shown). For stochastic simulations of gridded daily precipitation, nine catchments within Trøndelag with catchment areas from 144 to 3084 (shaded in grey) are used; see also Table .
[Figure omitted. See PDF]
Table 1Characteristics of the nine catchments in Trøndelag, Norway, considered in the stochastic simulations of gridded daily precipitation.
Catchment | ID | Size | Downscaling | Median elevation |
---|---|---|---|---|
() | area () | () | ||
Gaulfoss | A | 3084 | 5479 | 734 |
Aamot | B | 286 | 1112 | 460 |
Krinsvatn | C | 206 | 1108 | 349 |
Oeyungen | D | 245 | 952 | 295 |
Trangen | E | 852 | 2327 | 558 |
Veravatn | F | 176 | 1101 | 514 |
Dillfoss | G | 484 | 1863 | 506 |
Hoeggaas | H | 491 | 1853 | 505 |
Kjeldstad | I | 144 | 940 | 578 |
For the study area, we consider the Trøndelag area in central Norway; see Fig. . The area comprises 695 RCM grid cells and 109 514 seNorge grid cells. The extraction of the climate change signal is performed at the RCM scale, while the fine-scale daily precipitation fields are generated at nine hydrological catchments within the domain; see Fig. and Table . Two of the catchments, Krinsvatn and Oeyungen, have a maritime climate, while the others have a continental climate. For each catchment, the modelling is performed over all seNorge grid cells within the RCM grid cells that cover the catchment, the spatial dimensions of which vary between approximately 940 and 5500 grid cells at 1 resolution. Both historical RCM simulations and seNorge observations are available over the time period 1957–2005. We use the time period 1957–1986 as a training period to estimate model parameters and perform an out-of-sample evaluation over the remaining 19 years 1987–2005. As a result, the training period consists of 10 950 , while the test period comprises 6935 .
Additionally, we use explanatory variables, or covariates, to describe the spatial variations in the statistical characteristics of the daily precipitation distributions. We consider latitude, longitude, and elevation as potential geographic covariates. Elevation information for the seNorge data is obtained from a digital elevation model based on a 100 -resolution terrain model from the Norwegian Mapping Authority . We upscale these data in the same manner as the daily mean precipitation to obtain the elevation at the RCM scale. Note that this is not equal to the orography information provided by EURO-CORDEX.
3 MethodsAs mentioned in the introduction, the aim of this study is to provide realistic projections of daily precipitation at a fine spatial scale over large areas. We apply a parametric weather generator approach that belongs to the class of models proposed by and . For computational feasibility, we apply the approach proposed by , where a discrete-continuous distribution with a single latent field is used to simultaneously model the marginal precipitation occurrence, intensity on wet days, and the space–time dependence. Specifically, we employ a combination of a latent non-stationary Gaussian space–time random field and a gamma distribution with parameters that vary in space and time, with each model component estimated independently. The precipitation process at the RCM scale is described using a similar statistical model, and the climate change signal is added to the fine-scale model by relating the models at the two spatial scales.
3.1 Marginal models for precipitation occurrence and intensity
Denote precipitation occurrence in grid cell at time by if there is precipitation and otherwise, where denotes the number of grid cells and the number of days in a given dataset. We follow and relate the pattern of wet and dry days to a latent Gaussian variable with mean and variance 1. Precipitation intensity (i.e. the amount conditional on ) is assumed to be gamma distributed with a constant shape and scale that varies over space and time, following e.g. and . Formally, we write Precipitation processes often show different features depending on the time of the year, and neighbouring sites tend to share a similar precipitation climate. Such systematic variations are modelled by letting the parameters and of the above distributions change smoothly across time and space. We describe this through three additive components: a spatial effect, a seasonal effect, and a linear climate change effect. In particular, we set where, in their simplest form, the three effect functions are given by for . Here, models the spatially varying baseline of the parameters, with being latitude, longitude, and mean elevation of grid cell . Seasonal changes are described by , with returning the calendar day of time point and capturing the potential linear trend, with returning the calendar year normalized so that describes a decadal trend in the data. This modelling framework corresponds to a GLM framework.
While the linear spatial effect function in Eq. () can capture the spatial variations in the occurrence at both spatial scales as well as the intensity at the RCM scale, we find that this model is too simple to capture the spatial variations in the intensity across a catchment at the finer 1 1 scale. At the finer scale, we thus expand Eq. () so that the covariate contribution varies smoothly in space , expanding the model to a generalized additive model (GAM; ). That is, we set for the two largest catchments A (Gaulfoss) and E (Trangen) where and are smooth functions, and the slightly simpler for the other catchments. This substantially improved the in-sample fit for all the catchments. Alternatively, propose spatially varying regression parameters.
To estimate the parameter of the latent Gaussian model specified in Eqs. () and ()–(), we transform the data to a binary dataset with if the observed value fulfils and if . We then estimate using probit regression with and , where denotes the cumulative distribution function (CDF) of the standard normal distribution. The estimation is performed using the function
3.2 Space–time correlation structure
The marginal models for precipitation occurrence and intensity defined in the previous section describe changes in the marginal distributional properties across space and time. For realistic simulations of daily precipitation fields, we additionally need to account for space–time correlations of individual realizations. Here, for computational feasibility given the dimensionality of our data, we follow the approach proposed by and define a single latent Gaussian process that drives the correlation in both occurrence and intensity. We further assume that spatial and temporal correlations can be estimated separately, with the parameters of each component allowed to vary over the year to account for potential seasonality in the correlation structure. In practice, this is performed by obtaining independent estimates for each calendar month and, subsequently, fitting a smooth function of the type given in Eq. () to the monthly estimates to obtain daily smoothly varying estimates. Furthermore, the correlation models are estimated independently for each catchment to account for differences between the different climatic zones.
The estimation of the correlation structure within frameworks with underlying assumptions of normality is complicated by the shape of the precipitation distribution, with its point mass in zero and the skewness of the positive part. To account for this, propose to estimate the Kendall rank correlation coefficient from the data and, subsequently, transform into the Pearson correlation by the identity . For the spatial correlation structure, we use this approach to estimate the correlation between all pairs of grid cells within a catchment using the
The Matérn correlation between two grid cells with Euclidean distance at time point is given by
9 where is the gamma function and is the modified Bessel function of the second kind. The nugget , partial sill , and range are assumed to vary over the year, while is assumed constant. An optimal value of is chosen such that the sum of squared errors of the fitted models over all 12 months is minimized. Then, a Matérn correlation function with a fixed value of is fitted again for each month to obtain monthly estimates of and . Here, we assume , so that the resulting matrix is a correlation matrix.
In the literature, spatial dependencies in intensity and occurrence are commonly modelled separately assuming two latent Gaussian fields, one driving the occurrence and the other the intensity. For correlations in intensity, parametric models include the exponential and power exponential models as well as the simple strategy of having constant intersite correlation . Correlations in occurrence are more challenging to model, as appropriate transformation from binary occurrence to marginal normality is less straightforward. illustrates an empirical approach to find a link between the unobservable correlation (from a Gaussian model) and observable but unknown correlation (from a bivariate binary model) for each pair of sites. use an exponential covariance function in a similar approach. propose to model the number of wet sites by a beta-binomial model and then utilize empirical conditional probabilities to allocate the positions of wet sites.
Following , we introduce the short-term autocorrelation through temporal dependence in the underlying spatial random field. Here, temporal correlation is assumed to follow an autoregressive (AR) process of order 1. At each grid cell, Kendall's is calculated for each month; the monthly value for the entire catchment is then taken as the median value over all grid cells in the catchment. Subsequently, a smooth function of the form in Eq. () is fitted to the 12 monthly values to obtain smoothly changing daily estimates . Stochastic simulation models for precipitation commonly assume an autocorrelation of order 1
To summarize, denote by the vector of random noise defined in Eq. () in all the grid cells at time . The random noise is assumed to follow a space–time correlation structure of the form where is a Matérn correlation matrix and the correlation coefficient is obtained as described above.
3.3 Relating models from two spatial scalesMarginal models outlined in Sect. are fitted to the coarser RCM-scale data for both the training and test periods, where the significance of coefficients is tested at the 0.05 level. In particular, for data from the test period, we incorporate the training-period estimates of the coefficients into the three model components in the following manner: (1) the baseline is fixed to be the sum of its estimated value and the increment due to the estimated linear trend in the training period; (2) for the seasonality and the potential linear trend , we use the training-period coefficients as a reference and effectively estimate and test the significance of the changes in these terms. In
Figure 3
seNorge estimates of the seasonality component in Eq. () in the training period 1957–1986 for all catchments at both spatial scales. Top: the estimated seasonality in the mean of the latent Gaussian field estimated by probit regression. Bottom: the estimated seasonality in the mean of the gamma distribution estimated within a GLM/GAM framework.
[Figure omitted. See PDF]
The models outlined in Sects. and are fitted to the finer seNorge scale data only for the training period. In order to obtain model parameter estimates at the finer scale in the test period, we need to relate the models at the two scales so that model changes between the training and test periods at the coarser scale can be used to infer model changes at the finer scale. Specifically, we may update the mean of the latent field in Eq. (), the parameters of the gamma distribution and in Eq. (), and the autocorrelation coefficient in Eq. (), while the structure of the spatial correlation matrix in Eq. () is assumed constant for the aforementioned reason.
For and , we may update each of the terms in Eqs. () and (), respectively. Here, the seasonality Eq. () and the potential linear trend component Eq. () of (and similar for ) are adjusted so that the average adjustment over all the time points in the test period fulfils where te indicates the test period, tr indicates the training period, and is a fine-scale grid cell located within a coarse-scale grid cell .
Table 2The estimated trend coefficient in Eq. () for each catchment based on data from 1957 to 1986 for in the probit model (left) and in the gamma model (right). Estimates are given for both 1 seNorge data and seNorge data upscaled to 12 resolution.
seNorge | seNorge | seNorge | seNorge | |
---|---|---|---|---|
Catchment | 1 1 | 12 12 | 1 1 | 12 12 |
Gaulfoss | 0.002 | 0.002 | 0.003 | 0.004 |
Aamot | 0.009 | 0.011 | 0.046 | 0.045 |
Krinsvatn | 0.035 | 0.036 | 0.023 | 0.020 |
Oeyungen | 0.020 | 0.019 | 0.045 | 0.047 |
Trangen | 0.001 | 0.000 | 0.038 | 0.039 |
Veravatn | 0.051 | 0.049 | 0.016 | 0.018 |
Dillfoss | 0.022 | 0.020 | 0.026 | 0.025 |
Hoeggaas | 0.010 | 0.010 | 0.024 | 0.023 |
Kjeldstad | 0.003 | 0.003 | 0.013 | 0.013 |
Figure shows the training-period estimates of the seasonality component given in Eq. (). While the seasonality patterns vary substantially across the different catchments as well as between the two model parts, the estimates are very consistent across the two spatial scales. We thus infer seasonality components for the fine scale during the test period by updating the fine-scale components from the training period according to the estimated changes between the training and test periods at the coarse scale. We see the same patterns for the trend coefficient in Eq. (); see Table . The trend coefficient and the correlation coefficient are thus updated in the same manner as the seasonality component. Finally, the shape parameter of the gamma distribution may be updated so that the ratio of the estimates in the training and test periods at the fine scale equals the ratio of the two estimates at the coarser scale.
In Sect. various versions of the method are compared, where individual model components are either updated according to information based on an RCM output or assumed stationary over the entire time period.
3.4 Daily fine-scale precipitation generatorWith the adjustments described above, the marginal models and the space–time Gaussian random field together form a precipitation generator for use on the fine-scale grid in the test period. The parameters of the generator are obtained using seNorge data in the training period and adjusted based on RCM data spanning both the training and test periods. Assume we want to simulate data at all grid-cell locations and time points , a total of locations and time points. Data simulation from the generator consists of the following steps, with the superscript indicating adjusted parameter estimates.
-
For each time point , spatially correlated but temporally independent random vectors of size are drawn from the multivariate Gaussian distribution with mean vector and correlation matrix specified by the Matérn correlation function, i.e. .
-
Temporal correlation is introduced by setting .
-
At grid cell and time , the probability of precipitation is . The precipitation amount is set as if and otherwise.
That is, as mentioned above, the fine-scale spatial correlation structure described by is the single part of the model that is not adjusted based on information from the RCM.
Table 3
Integrated quadratic distance (IQD) values comparing simulated and seNorge distributions over all days in 1987–2005. The results are averaged over all 1 1 grid cells in each catchment. The simple method seNorge uses the daily values over the period 1957–1986 as a prediction, WGs assumes trends estimated for 1957–1986 continue in 1987–2005, WG1.1 and WG2.1 include seasonality and trend estimates from RCM1 and RCM2, respectively, in the gamma model, while for WG1.2 and WG2.2, RCM information is included in both the gamma model and the probit model. Results of the reference method are denoted EQM1 for RCM1 and EQM2 for RCM2. The best method for each catchment is indicated in bold.
Catchment | seNorge | WGs | WG1.1 | WG2.1 | WG1.2 | WG2.2 | EQM1 | EQM2 |
---|---|---|---|---|---|---|---|---|
Gaulfoss | 3.46 | 3.99 | 2.87 | 3.10 | 3.91 | 2.97 | 3.73 | 2.80 |
Aamot | 2.23 | 1.64 | 2.90 | 2.37 | 2.37 | 2.86 | 2.67 | 2.33 |
Krinsvatn | 8.18 | 1.94 | 3.02 | 1.96 | 2.54 | 1.79 | 12.27 | 7.62 |
Oeyungen | 5.52 | 5.94 | 7.14 | 7.46 | 4.90 | 6.44 | 11.20 | 4.91 |
Trangen | 9.37 | 5.56 | 5.12 | 5.50 | 6.12 | 5.49 | 10.72 | 7.84 |
Veravatn | 11.26 | 2.66 | 2.37 | 2.24 | 2.77 | 2.22 | 15.45 | 8.12 |
Dillfoss | 5.17 | 6.59 | 4.73 | 4.27 | 6.97 | 4.23 | 5.58 | 3.05 |
Hoeggaas | 2.65 | 5.84 | 3.54 | 3.21 | 6.15 | 3.17 | 3.21 | 1.46 |
Kjeldstad | 6.96 | 6.71 | 4.32 | 4.00 | 6.51 | 3.96 | 7.38 | 3.50 |
Overall | 4.88 | 4.50 | 3.60 | 3.65 | 4.61 | 3.54 | 5.82 | 3.83 |
To assess the performance of the proposed method, we use the empirical quantile delta mapping method as a reference. The RCM outputs of approximately 12 12 resolution are first re-gridded to the 1 1 seNorge grid using bilinear interpolation, as implemented in the
Table 4
Estimated changes in the trend coefficient in Eq. () between the training period 1957–1986 and the test period 1987–2005, for in the probit model (left) and in the gamma model (right). Estimates for three different data sources at 12 resolution are shown: upscaled seNorge data and two RCM outputs.
Catchment | seNorge | RCM1 | RCM2 | seNorge | RCM1 | RCM2 |
---|---|---|---|---|---|---|
Gaulfoss | 0.026 | 0.022 | 0.000 | 0.034 | 0.040 | 0.025 |
Aamot | 0.000 | 0.018 | 0.013 | 0.081 | 0.034 | 0.013 |
Krinsvatn | 0.014 | 0.044 | 0.000 | 0.043 | 0.037 | 0.014 |
Oeyungen | 0.019 | 0.044 | 0.000 | 0.103 | 0.021 | 0.021 |
Trangen | 0.080 | 0.012 | 0.000 | 0.012 | 0.020 | 0.000 |
Veravatn | 0.093 | 0.029 | 0.000 | 0.039 | 0.018 | 0.023 |
Dillfoss | 0.021 | 0.033 | 0.000 | 0.069 | 0.031 | 0.028 |
Hoeggaas | 0.000 | 0.033 | 0.000 | 0.057 | 0.039 | 0.034 |
Kjeldstad | 0.039 | 0.022 | 0.000 | 0.038 | 0.040 | 0.040 |
Evaluation and comparison of the different approaches are performed by comparing various aspects of the resulting datasets. For an overall ranking of the approaches, we employ the proper evaluation metric integrated quadratic distance (IQD) that compares the full distributions of observed and modelled precipitation . That is, denote by the empirical cumulative distribution function (ECDF) of seNorge precipitation over all time points in the test set at a given grid cell and by the corresponding ECDF from one of the modelling approaches. The distance between and as measured by the IQD is then given by The overall performance of the model at a catchment is then calculated as the average IQD over all grid cells in the catchment area, with a lower value indicating a better performance. The IQD fulfils the property that the true data-generating process is expected to obtain an IQD value of 0 when compared against ECDFs based on data samples of any size. It is thus an appropriate metric for ranking competing methods . For the WG approach, we can easily obtain a precise approximation of the marginal distribution in each grid cell by simulating multiple realizations from each daily distribution. For the EQM approach, however, the marginal distribution in a grid cell is estimated by combining one value for each day in the time period of interest.
For an improved understanding of the behaviour of the models, we further perform several empirical diagnostics. To analyse the marginal distributions at each grid cell, we compare means of daily precipitation, wet-day frequency given by the number of wet days, wet-day intensity as measured by the mean and standard deviation of the precipitation on wet days only, and representation of heavy precipitation as measured by the 95th percentile of positive precipitation. Diagnostics of the temporal data structure are performed by assessing dry–wet temporal patterns and seasonal patterns of temporal autocorrelation coefficients, while empirical functions of Pearson's correlation as a function of distance are used to perform spatial data diagnostics.
4 Results
We perform model inference using data from 1957 to 1986 and infer climate change effects by comparing the coarse-scale RCM data from the two time periods 1957–1986 and 1987–2005. Simulations of fine-scale precipitation for the test set 1987–2005 are then compared against the seNorge data for the test period 1987–2005.
We consider three versions of the WG method, where we include varying degrees of climate change information derived from the RCM data. A stationary version, denoted by WGs, assumes that trends estimated for the seNorge data in the training period continue into the test period, with the remaining model components fixed at their estimates in the training period. That is, no RCM information is used. A version denoted by WG1.1 and WG2.1 for RCM information derived from RCM1 and RCM2, respectively, includes climate change information from the RCM in the seasonality and trend components of the gamma model for precipitation amount on wet days. Finally, a version denoted by WG1.2 and WG2.2 for RCM information derived from RCM1 and RCM2, respectively, includes climate change information from the RCM in the seasonality and trend components of both the gamma model and the probit model for precipitation occurrence. The various WG methods are compared against the reference method in Sect. denoted EQM1 and EQM2 derived from RCM1 and RCM2, respectively, as well as a simple method that uses the empirical distributions of the fine-scale seNorge data in the training period directly as predictions for the corresponding empirical distributions of the fine-scale seNorge data in the test period.
4.1 Marginal performance
We evaluate the marginal performance of the simulations by comparing empirical distributions of simulations and observations over all time points in the test set. Specifically, we compare the empirical distribution of the seNorge data in every 1 1 grid cell to simulations for that same grid cell using the IQD. The average IQD values over all grid cells in each catchments are given in Table . Overall, the WG methods that include RCM information perform better than the stationary approach, which again outperforms using the historical data directly. The WG simulations have better performance than the EQM for both RCM1 and RCM2. The best-performing simulation is WG2.2, where both the gamma model for precipitation amount and the probit model for the wet frequency are updated with climate change information from RCM2. The EQM based on RCM2 performs quite well, while the EQM based on RCM1 yields the worst-performing simulations.
Figure 4
Relative bias in various marginal summary statistics at the 1 1 scale in the largest catchment, Gaulfoss. The observed seNorge data in the training period 1957–1986, the stationary WGs simulation, and three simulations using climate change information from RCM2 are compared against the seNorge data in the test period 1987–2005.
[Figure omitted. See PDF]
The IQD values in Table vary substantially across the simulation methods for individual catchments. To investigate this further, we take a closer look at the trend coefficient estimates, as the estimated changes in seasonality are quite stable across catchments for a given RCM and model component (results not shown). The estimates of the trend coefficient in Eq. () based on the seNorge training data from 1957 to 1986 are given in Table in Sect. above. For the probit model, the trend estimates are positive in all but one catchment, the small inland catchment Kjeldstad, where a small negative trend is estimated. As a result, the probability of precipitation is expected to increase over time. The rate of the increase varies substantially for the different catchments, ranging from 0.001 in Trangen to 0.051 in Veravatn. For the gamma distribution, the trend coefficient estimates are highly varying across catchments, with negative estimates for three catchments and positive estimates for six catchments, indicating no consistent trend pattern in the amount of daily precipitation on wet days. When fitting these models to the RCM data in the training period, we found insignificant trend estimates for the probit model in seven catchments based on RCM1 and five based on RCM2, while the number of cases for the gamma model is six based on RCM1 and four based on RCM2.
Figure 5
Average annual precipitation (a) in the period 1957–2005 and the digital elevation map (b), both at the 1 1 scale in the catchment Gaulfoss.
[Figure omitted. See PDF]
Figure 6
Empirical spatial correlation of precipitation amount at the catchment Gaulfoss for each month of the year. Results are shown for the seNorge data in the test period 1987–2005 (red dots) and for the EQM simulation based on RCM2 (cyan dots). The Matérn spatial correlation estimated with the WG method based on seNorge data in the training period 1957–1986 is indicated in grey, with the width of the bar indicating the spread of the daily estimates within the month.
[Figure omitted. See PDF]
The estimated changes in trend coefficients at the 12 12 scale between the training and test periods are listed in Table . The zeros in the table indicate that the changes are not significantly different from 0 at the 0.05 level. The seNorge estimates for the probit model are mostly positive, corresponding to a higher trend estimate in the test period than the training period. The estimates based on RCM1 are consistently negative, while no change is estimated based on RCM2 except for Aamot. For the gamma model, approximately as many positive and negative values are observed, while estimates in all catchments are positive by both RCMs. Note that the stationary simulation WGs assumes the same trends in the training and test periods, corresponding to values of 0 in Table .
The simulations WGs, WG1.1, and WG2.1 share the same probit model for precipitation occurrence, while the gamma model for the precipitation amount differs. For the gamma model, five catchments have a strong positive climate change signal according to the upscaled seNorge data, where both RCMs project a change in the same direction. Looking at the IQD values in Table , we see this translates directly into lower IQD values compared to the WGs simulations. IQD values are higher than WGs in the three catchments closest to the coast (Aamot, Krinsvatn, and Oeyungen), where both RCMs project a positive change against the observed negative change. For Trangen, WG2.1 and WGs have similar IQD values because they both apply no change in the trend. In general, both RCMs provide useful climate change information for the gamma model, which makes the overall performance of WG1.1 and WG2.1 better than WGs.
Figure 7
Empirical spatial correlation of precipitation amount at the catchment Kjeldstad for each month of the year. Results are shown for the seNorge data in the test period 1987–2005 (red dots) and for the EQM simulation based on RCM2 (cyan dots). The Matérn spatial correlation estimated with the WG method based on seNorge data in the training period 1957–1986 is indicated in grey, with the width of the bar indicating the spread of the daily estimates within the month.
[Figure omitted. See PDF]
A similar effect can be seen when comparing the IQD values for Gaulfoss, Trangen, and Kjeldstad based on the simulations WG1.1 and WG1.2. While these two simulations share the same gamma model, WG1.1 assumes a stationary probit model and WG1.2 applies climate change information from RCM1 to the precipitation occurrence. Here, the climate change estimates from RCM1 are negative, going in the opposite direction to the seNorge data, and accordingly WG1.2 is worse than WG1.1, which assumes no change in the trend. The negative change applied in WG1.2 in Hoeggaas can also relate to the reduced performance compared with WG1.1. In Veravatn and Dillfoss, however, the estimates based on RCM1 are in the same direction as the observed ones, but this somehow does not translate into a better performance of WG1.2. For Aamot, where no change is estimated by the seNorge data, a negative change by RCM1 seems to make WG1.2 better than WG1.1, and a positive change by RCM2 makes it the only catchment where WG2.2 is worse than WG2.1. In the other catchments, WG2.2 is slightly better than WG2.1 given that they both apply no change in the trend of the probit model; this indicates that the changes in the seasonality projected by RCM2 are generally reasonable, and only the effect seems limited in most catchments.
Figure 8
Proportion of different 2 d dry–wet patterns for the seNorge data in the training period 1957–1986 and the test period 1987–2005 as well as for six different simulations of the test period. The results are aggregated over all grid cells in the catchments Gaulfoss (a) and Oeyungen (b). Dry days are indicated with 0 and wet days with 1. For ease of interpretation, horizontal dashed lines are drawn at the levels of the test set.
[Figure omitted. See PDF]
Further analysis of the marginal performance of four of the simulations as well as the seNorge reference is shown in Fig. for the largest catchment, Gaulfoss, while the climatology and elevation information is given in Fig. . The leftmost plot in Fig. a shows that the frequency of wet days for the seNorge data is generally lower in the training period than the test period. This again results in a significant bias in the overall mean (see Fig. b), while the general correspondence between the amount distributions on wet days is quite good. Here, the IQD value is 3.46 for seNorge, 3.99 for WGs, 3.10 for WG2.1, 2.97 for WG2.2, and 2.8 for EQM2. WG2.1 and WG2.2 share the same distribution for the precipitation amount on wet days, and given that RCM2 projects zero change in the trend of the probit model, performance of the two simulations is different solely due to the different seasonality, which again is minimal; see Fig. a. While EQM2 has the lowest IQD value, it appears that this method overestimates the wet frequency (see Fig. a), the spread on wet days (Fig. d), and thus also the 95th percentile on wet days (Fig. e). However, the IQD score is less sensitive to these errors than to the erroneous overall mean.
Figure 9
Smoothly changing daily estimates of the correlation coefficient in Eq. () for each catchment, estimated based on the seNorge data in the training period 1957–1986 (green dotted lines), inferred by adding the climate change information from RCM1 (cyan dashed lines) and RCM2 (purple dashed lines) for the test period 1987–2005, and as a reference the values estimated based on the seNorge data in the test period 1987–2005 (red solid lines).
[Figure omitted. See PDF]
4.2 Spatial and temporal correlation structureThe spatial correlation structure at the 1 1 scale cannot be inferred from the 12 12 RCM data, and we thus assume that the fine-scale spatial correlation estimated based on the training data also holds for the test data. This is assessed in Fig. for the largest catchment, Gaulfoss, and in Fig. for the smallest catchment, Kjeldstad. The Matérn correlation function estimated based on the training data appears to capture the overall structure of the test data, indicating no large deviations in spatial structure between the two time periods. However, there are some smaller deviations, indicating smaller changes in the seasonal pattern of the spatial structure. In particular, the estimated correlation is slightly higher than that observed in February and somewhat lower in autumn, especially at Kjeldstad. For both catchments, the largest spread of the daily estimates of the correlation function is in the spring months of April and May.
The spatial structure of the EQM simulation differs somewhat from that of the data. The correlation is too strong in the winter months of December, January, and February and too weak in June. It further appears that the EQM is more successful in modelling the spatial correlation of the data from the larger catchment Gaulfoss than the data from the small catchment Kjeldstad, whose area of 144 is approximately 5 % of the area of Gaulfoss at 3084 .
In order to assess the temporal correlation structure of the various simulations, first consider the 2 d dry–wet patterns shown in Fig. . For the inland catchment Gaulfoss, the proportions of 2 consecutive dry days and 2 consecutive wet days is approximately equal in the training set, while the test set has fewer instances of 2 consecutive dry days, with a corresponding increase in 2 consecutive wet days. The proportions of 2 consecutive dry or wet days for the simulations are mostly in between the values for the seNorge training and test sets, except for EQM2, which has the highest frequency of wet days; see also Fig. a. At the coastal catchment Oeyungen, nearly 50 % of all the 2 d patterns observed in the training period, and over 50 % in the test set, are 2 consecutive wet days. Here, all the simulations yield a lower proportion of 2 consecutive wet days than the observed test data, while the proportions of pairs with 1 wet day and 1 dry day is higher. The results shown here for the WG method are based on a single simulation for each model version. We found that these results may vary slightly between realizations from the same model (results not shown). In addition, we have compared the sequencing of dry days generated by different methods and found that the distribution of dry spells is similar across all simulations for a given catchment, where the majority consist of the short-term cases and a drought event longer than 2 weeks is rare (results not shown).
The temporal correlation applied in the daily fine-scale precipitation generator for the test period is assessed in Fig. . As described in Sect. , the short-term autocorrelation of the WG model is introduced through the temporal dependence in the underlying spatial random field. Data at both spatial scales have the same temporal dimensionality, and we thus assume that the fine-scale temporal correlation coefficients can be updated by the changes projected by an RCM between the training and test periods. Estimates based on seNorge data in the training period indicate higher temporal dependence in spring and winter and lower dependence in summer. In the test period, dependence becomes lower in spring and summer and higher in October and November. The changes in spring are generally not realistically projected by RCMs, except for RCM2 in Trangen, while the changes in summer and early winter are better captured by RCM2 than RCM1 in most catchments.
5 Conclusions and discussion
This paper proposes a two-step stochastic downscaling and bias-correction approach for future projection of daily precipitation. In a first step, a stochastic weather generator for a high-resolution grid is developed using a historical gridded observation-based data product. In a second step, the weather generator is inferred for a future climate by using only the projected changes between a historical reference period and a future period based on a coarser-scale RCM. In the current application, the observation-based data product is available on a 1 1 grid, and the climate change information stems from an RCM on a 12 12 grid. In this setting, there appears to be good correspondence between catchment-scale seasonality and linear trend patterns at the two spatial resolutions, making the transformation of information between the two scales feasible.
The WG approach is applied to data from nine hydrological catchments in central Norway, with each study area ranging in size from approximately 1000 to 5500 and compared against an EQM and a simple persistence reference method. The methods are trained on daily data from 1957 to 1986 and tested on out-of-sample data from 1987 to 2005. Based on an evaluation of the resulting marginal distributions, the WG method overall outperforms the EQM approach, both in terms of the IQD score and based on empirical assessment of marginal summary statistics. However, all the simulation methods show large variations in the performance between individual catchments. The WG method furthermore yields realistic temporal and spatial correlation structures.
The historical RCM runs used here are available until 2005, and the observation-based data are available from 1957, yielding a dataset with 49 years of data. With 30 years of data used to train the models, this leaves only 19 years of data for the out-of-sample evaluation. With only 19 years of data in the test period, we may expect to see some effects of natural variability when comparing the seNorge data product and the largely free-running RCMs. Looking at the linear trend coefficient in the probit model, it seems that the seNorge data upscaled to 12 resolution are generally able to capture the change where there are proportionally more wet days in the test period than in the training period, while the RCM data either project strong negative changes or simply no change in most catchments. For the gamma model, however, both RCMs seem to have projected correct changes in the trend and seasonality. Overall, we see that all versions of the WG method yield better performance than the marginal persistence reference method based on seNorge data from 1957 to 1986, and including RCM information improves upon the stationary WG approach. Furthermore, the transparent way in which the RCM information is included in the WG simulations allows for a direct assessment of this information and its plausibility .
In our case study, the training and test periods are two consecutive time periods. However, in climate change impact studies, there is commonly a large gap of the order of decades between the historical period and the future period of interest. In this case, it may be necessary to expand our proposed model to also account for large-scale climate oscillation or teleconnection patterns, such as the El Niño–Southern Oscillation (ENSO) and the Indian Ocean Dipole (IOD), particularly in regions where rainfall climatologies are dominated by such patterns
While the application in this paper focuses on climate projections, the modelling framework proposed here provides a more general approach to computationally efficient stochastic downscaling of precipitation. Other potential applications include seasonal and decadal weather and climate predictions. The availability of computationally efficient downscaling methods is especially important in settings where large ensembles are needed in order to achieve prediction skill; see e.g. .
Code availability
Code is available upon request from the authors.
Data availability
The seNorge version 2018 data are available at
Author contributions
All the authors defined the scientific scope of this study together. TLT and QY formulated the methodology of the paper. QY prepared the R code for the statistical modelling, simulations, and evaluations of the proposed method. TLT provided support in many parts of the R code. WKW provided the results of the reference model and Fig. 2. TLT and QY contributed to the write-up of the manuscript. All the authors provided ideas and suggested improvements during the entire process of conducting the research.
Competing interests
The authors declare that they have no conflict of interest.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Acknowledgements
This work was supported by the Research Council of Norway through project no. 255517 “Post-processing Climate Projection Output for Key Users in Norway”. The work of Thordis Thorarinsdottir was additionally supported by the Research Council of Norway through project no. 309562 “Climate Futures”.
Financial support
This research has been supported by the Research Council of Norway (Norges Forskningsråd, grant nos. 255517 and 309562).
Review statement
This paper was edited by Thomas Kjeldsen and reviewed by two anonymous referees.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Climate change impact assessment related to floods, infrastructure networks, and water resource management applications requires realistic simulations of high-resolution gridded precipitation series under a future climate. This paper proposes to produce such simulations by combining a weather generator for high-resolution gridded daily precipitation, trained on a historical observation-based gridded data product, with coarser-scale climate change information obtained using a regional climate model. The climate change information can be added to various components of the weather generator, related to both the probability of precipitation as well as the amount of precipitation on wet days. The information is added in a transparent manner, allowing for an assessment of the plausibility of the added information. In a case study of nine hydrological catchments in central Norway with the study areas covering 1000–5500
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Norwegian Water Resources and Energy Directorate, Oslo, Norway; Department of Geosciences, University of Oslo, Oslo, Norway
2 Norwegian Computing Center, Oslo, Norway
3 Norwegian Water Resources and Energy Directorate, Oslo, Norway
4 Department of Geosciences, University of Oslo, Oslo, Norway