Introduction
A quantitative appraisal of precipitation () amount and of its spatiotemporal distribution is essential for many scientific and operational applications, including but not limited to increasing our understanding of the hydrological cycle, assessing the hydrological impacts of human activities, assessing water resources, irrigation planning, and forecasting of droughts and floods . However, is also one of the meteorological variables that is most difficult to estimate, due to its high spatiotemporal heterogeneity . There are currently four principal measurement or modeling sources to determine : (i) ground-based gauge observations, (ii) ground-based radar remote sensing, (iii) satellite remote sensing, and (iv) atmospheric retrospective-analysis models.
Due to the highly localized nature of gauge observations, the accuracy of interpolation-based spatial estimates is highly dependent on gauge network density and the degree of spatial coherence, both of which are highly variable globally . In addition, gauge observations are typically affected by systematic biases in mountainous environments due to the elevation bias in gauge placement and in snow-dominated regions due to wind-induced under-catch . Ground-based radars offer high temporal and spatial resolution but their coverage is limited to developed regions . Moreover, very few merged radar data products are readily available.
Satellites are capable of observing large areas instantaneously at a high resolution. They are particularly suitable for rainfall estimation in the tropics, which exhibit highly heterogeneous rainfall patterns due to the importance of convective storms . However, satellite retrieval approaches are susceptible to systematic biases, relatively insensitive to light rainfall events, and tend to fail over snow- and ice-covered surfaces . Finally, atmospheric reanalysis models are ideally suited for simulating the evolution of large-scale (synoptic) weather systems, but poorly represent the variability associated with convection due mainly to their relatively low resolution and deficiencies in the parameterizations of sub-grid processes . Cloud-resolving models have only recently started to tackle the convection- link (e.g., ). Although only available regionally, ground-based gauges and radars are generally regarded as the most reliable way of estimating and thus have frequently been used for evaluation and improvement of satellite- and reanalysis-based estimates .
Table lists a selection of 21 publicly available
gridded datasets suitable for (quasi-)global hydrological studies (for
more exhaustive overviews, see
Non-exhaustive overview of freely available (quasi-)global gridded datasets. If a particular dataset is available in different spatial resolutions or in different variants we only listed the “best” one (e.g., we list the gauge-adjusted 3B42 variant of TMPA rather than the non-gauge-adjusted 3B42RT variant). The datasets are sorted first by source and then alphabetically by short name. MSWEP has been added for the sake of completeness.
Short name | Full name and details | Data source(s) | Spatial | Spatial | Temporal | Temporal | Reference(s) | |
---|---|---|---|---|---|---|---|---|
resolution | coverage | resolution | coverage | |||||
1 | CPC Unified | Climate Prediction Center (CPC) Unified | Gauge | 0.5 | Global | Daily | 1979–present | , |
2 | CRU | Climatic Research Unit (CRU) Time-Series (TS) | Gauge | 0.5 | Global | Monthly | 1901–2014 | |
3 | GPCC | Global Precipitation Climatology Centre (GPCC) Full Data Reanalysis | Gauge | 0.5 | Global | Monthly | 1901–present | |
and First Guess | ||||||||
4 | PREC/L | PRECipitation REConstruction over Land (PREC/L) | Gauge | 0.5 | Global | Monthly | 1948–present | |
5 | UDEL | University of Delaware (UDEL) | Gauge | 0.5 | Global | Monthly | 1901–2014 | |
6 | CHIRPS | Climate Hazards group Infrared Precipitation with Stations (CHIRPS) | Gauge, satellite | 0.05 | 50 N–50 S | Daily | 1981–present | |
7 | CMORPH | CPC MORPHing technique (CMORPH) | Gauge, satellite | 0.25 | 60 N–60 S | Daily | 1998–present | |
8 | GPCP-1DD | Global Precipitation Climatology Project (GPCP) 1-Degree Daily | Gauge, satellite | 1 | Global | Daily | 1996–2015 | |
(1DD) Combination | ||||||||
9 | GSMaP-MVK | Global Satellite Mapping of Precipitation (GSMaP) Moving Vector | Gauge, satellite | 0.1 | 60 N–60 S | Hourly | 2000–present | |
with Kalman (MVK) | ||||||||
10 | IMERG | Integrated Multi-satellitE Retrievals for GPM (IMERG) | Gauge, satellite | 0.1 | 60 N–60 S | 30 min | 2014–present | |
11 | PERSIANN-CDR | Precipitation Estimation from Remotely Sensed Information using | Gauge, satellite | 0.25 | 60 N–60 S | 6 hourly | 1983–2012 | |
Artificial Neural Networks (PERSIANN) Climate Data Record (CDR) | ||||||||
12 | TMPA 3B42 | TRMM Multi-satellite Precipitation Analysis (TMPA) 3B42 | Gauge, satellite | 0.25 | 50 N–50 S | 3 hourly | 1998–present | |
13 | MERRA-Land | Modern Era Retrospective-Analysis for Research and Applications | Gauge, reanalysis | 0.5 0.67 | Global | Hourly | 1979–present | |
(MERRA)-Land | ||||||||
14 | PFD | Princeton global meteorological Forcing Dataset | Gauge, reanalysis | 0.25 | Global | 3 hourly | 1948–2012 | |
15 | WFDEI | WATCH Forcing Data ERA-Interim (WFDEI) | Gauge, reanalysis | 0.25 | Global | 3 hourly | 1979–2014 | |
16 | NCEP-CFSR | National Centers for Environmental Prediction (NCEP) Climate | Reanalysis | 0.3125 | Global | Hourly | 1979–2010 | |
Forecast System Reanalysis (CFSR) | ||||||||
17 | ERA-Interim | European Centre for Medium-range Weather Forecasts ReAnalysis | Reanalysis | 0.25 | Global | 3 hourly | 1979–2014 | |
Interim (ERA-Interim) | ||||||||
18 | JRA-55 | Japanese 55-year ReAnalysis (JRA-55) | Reanalysis | 1.25 | Global | 3 hourly | 1959–present | |
19 | CHOMPS | Cooperative Institute for Climate Studies (CICS) High-Resolution | Satellite | 0.25 | 60 N–60 S | Daily | 1998–2007 | |
Optimally Interpolated Microwave Precipitation from Satellites | ||||||||
(CHOMPS) | ||||||||
20 | SM2RAIN-ASCAT | Based on Advanced Scatterometer (ASCAT) data | Satellite | 0.5 | Global | Daily | 2007–2015 | |
21 | CMAP | CPC Merged Analysis of Precipitation (CMAP) | Gauge, satellite, | 2.5 | Global | 5 day | 1979–present | |
reanalysis | ||||||||
22 | MSWEP | Multi-Source Weighted-Ensemble Precipitation (MSWEP) | Gauge, satellite, | 0.25 | Global | 3 hourly | 1979–2015 | This study |
reanalysis |
0.25 spatial resolution for the conterminous USA. 1 spatial resolution for 2014–present. 80 km effective spatial resolution (i.e., the resolution of the employed atmospheric model).
The datasets have been designed for different applications and provide sometimes widely varying estimates , even among gauge-adjusted datasets . Despite several studies that intercompared and evaluated these datasets in different regions (for non-exhaustive overviews, see and ), so far no clear consensus has emerged on which estimation approach is superior overall. Indeed, each approach comes with potential disadvantages that may restrict its usefulness for global hydrological modeling. Common issues include the following:
With the exception of CMAP, all datasets listed in Table employ only one or two of the main data sources – either gauge and satellite, or gauge and reanalysis – and thus do not take full advantage of the complementary nature of satellite and reanalysis data identified in previous studies (e.g., );
Many of the listed datasets do not explicitly and fully account for gauge under-catch and/or orographic effects, and consequently underestimate in many regions around the globe (e.g., );
Many datasets (numbered 6–12 and 14–21 in Table ) do not incorporate gauge observations or do so on a monthly basis, and hence may not make optimal use of valuable information on the daily variability provided by gauges;
Some datasets (1–5, 8, 13, 16, 18, 20, and 21) have a spatial resolution that is much coarser ( 0.5) than desirable for hydrological applications, given the importance of non-linear responses to rainfall;
Several datasets (6, 7, 9–12, and 19–20) have a quasi-global spatial coverage, precluding truly global terrestrial applications;
A few of the datasets (2–5 and 21) have a monthly temporal resolution which is insufficient for most dynamic hydrological modeling applications;
Finally, several datasets (7, 9, 10, 12, 19, and 20) cover a relatively short period ( 20 years), which is problematic when assessing hydrological changes in a long-term context.
Flow chart summarizing the main steps carried out to produce MSWEP. The different colors represent different timescales. refers to normalized (i.e., anomalies).
[Figure omitted. See PDF]
This research was motivated by the opportunity to develop a global gridded dataset with a 3-hourly temporal and 0.25 spatial resolution for the period 1979–2015, characteristics that make it more suitable for hydrological applications for the above-mentioned reasons. The dataset has been named Multi-Source Weighted-Ensemble Precipitation (MSWEP) as it seeks to optimally combine data from various gauge, satellite, and reanalysis data sources.
MSWEP methodology
Figure presents a flow chart summarizing the main steps carried out to produce MSWEP. The methodology described here corresponds to version 1.1 of the dataset. In brief, we first derived a long-term bias-corrected climatic mean (explained in Sect. ). Next, several gridded satellite and reanalysis datasets were evaluated in terms of temporal variability to assess their potential inclusion in MSWEP (explained in Sect. ). The long-term climatic mean was subsequently temporally downscaled in a stepwise manner first to the monthly, then to the daily, and finally to the 3-hourly timescale using weighted averages of anomalies derived from gauge, satellite, and reanalysis datasets to yield the final MSWEP dataset (explained in Sect. ). This method was used since it (i) is relatively easy to understand and implement; (ii) accommodates the inclusion of datasets with 3-hourly, daily, as well as monthly temporal resolutions; (iii) is largely data-driven (i.e., the weights are based on gauge observations); (iv) accounts for cross-correlation among datasets of the same type (satellite or reanalysis); (v) treats random (i.e., temporally variable) and systematic (i.e., long-term) errors separately; (vi) accounts for gauge under-catch and orographic bias; and (vii) yields reliable estimates (as the comparative performance evaluation described in Sect. will demonstrate).
Bias correction of CHPclim
The long-term mean of MSWEP was based on the recently released Climate Hazards Group Precipitation Climatology (CHPclim) dataset (0.05 resolution; version 1.0; ), a global climatology based on gauge observations and satellite data. The CHPclim data were replaced with more accurate Parameter-elevation Relationships on Independent Slopes Model (PRISM) data for the conterminous USA (1 km resolution; ) and data for New Zealand (0.05 resolution). For brevity, however, we refer to this combination of gauge-analysis products as CHPclim. Although CHPclim has been corrected for orographic effects it has not been adjusted for gauge under-catch, and is thus expected to underestimate the actual , especially in snow-dominated regions. Two approaches were tested to correct for this.
Bias correction using catch-ratio equations
The first approach uses country-specific catch-ratio (CR) equations for snowfall from , as summarized by , and the rainfall CR equation for unshielded gauges from . The mean wind speed data required as input were derived from a global 5 km map of mean wind speed at 80 m height based on atmospheric reanalysis model output . To estimate the wind speed value at 5 m height we used the wind profile power-law relationship defined as follows: where (m s) represents the wind speed at height (m), and (m s) represents the wind speed at reference height (m). The exponent (unitless) is an empirical coefficient that depends on surface roughness and atmospheric stability and was set to 0.3 following . A global map of the snowfall fraction of was computed based on air temperature () data from the gauge-based WorldClim monthly climatic dataset (1 km resolution; version 1.4 release 3; ) using a threshold of 1C to distinguish between rain and snow. It is noted that the CR equations were not designed for application at monthly timescales. However, the necessary daily station observations of , , and wind speed are unfortunately generally not available in mountainous regions where they are needed the most .
Bias correction based on observations
The second approach tested to bias-correct CHPclim involves the use of observation-based estimates of long-term streamflow () and potential evaporation () to infer the “true” (cf. ). For this purpose we employed the relationship, which has been frequently used to estimate long-term actual evaporation () from long-term and values. Here we use it in an inverse manner to estimate, only for regions with snowfall and/or complex topography, the underestimation of long-term from long-term and data under the assumption that the long-term equals the difference between long-term and and that changes in water storage are negligible. The relationship was chosen over several other similar relationships because it includes an empirical parameter related to the plant-available water capacity termed (unitless). This parameter was set to the “shortgrass and crops” value of 0.5 to produce a conservative correction of . The relationship is given by the following: where , , and are measured in millimeters per year (mm yr). Assuming that , the relationship reformulated to yield is given by the following: where is measured in millimeters per year (mm yr) and and are defined as and respectively.
Following this method, we inferred long-term from records for
13 762 regions (12 233 catchments and 1529 interstation catchment areas)
across the globe. Long-term estimates for these catchments were computed
from the same three sources as those used by , namely (i) the
US Geological Survey (USGS) Geospatial Attributes of Gages for Evaluating
Streamflow (GAGES)-II database , (ii) the Global Runoff
Data Centre (GRDC;
To assess the sensitivity of the values inferred using Eq. () to biases in , we used estimates derived using three formulations: (i) the temperature-based equation, (ii) the radiation-based approach, and (iii) the FAO-56 Penman–Monteith combination equation . The data were produced by using meteorological inputs from the WFDEI dataset. Since the resulting values generally differed by 10 % depending on the formulation, the mean of the three estimates was adopted.
Next, we computed bias correction values for each catchment area as the ratio
of inferred from Eq. () over the original, unadjusted .
The resulting values were subsequently interpolated to produce a continuous
global map by calculating, for each 0.25 grid cell with snowfall
and/or complex topography, the median bias correction factor of the
10 “closest” catchments. Note that in calculating the median we also included
catchments with bias correction factors 1. An empirical distance measure
was developed as follows:
where (km) is the longitudinal distance to the catchment centroid, (km) is the latitudinal distance to the catchment centroid, and (m) is the
difference between the grid-cell mean elevation and the catchment-mean elevation
(note the different horizontal and vertical length units). By
including , the bias correction factors obtained for mountainous
catchments exert less influence on the adjacent flat terrain. By using the
median, the impact of potential outlier errors in the observations,
estimates, and catchment boundary data is limited. Bias
correction factors were only calculated for regions with snowfall, defined by
a snowfall fraction 0.25, and/or complex topography, defined by an
“effective terrain height” 100 m . The snowfall fraction
was computed using WorldClim data using a threshold of
1C, while the elevation data necessary for calculating and the
effective terrain height were derived from Consultative Group for
International Agricultural Research (CGIAR) Shuttle Radar Topography Mission (SRTM)
data (90 m resolution; version 4.1; ) for latitudes
60 N and GTOPO30 (1 resolution;
Overview of the (quasi-)global datasets considered for inclusion in MSWEP.
Short name | Full name and details | Data source | Spatial | Temporal | Spatial | Temporal span | Reference |
---|---|---|---|---|---|---|---|
resolution | resolution | coverage | |||||
CMORPH | CPC MORPHing technique (CMORPH) | Satellite | 0.25 | 3 hourly | 60 N– | Jan 1998 | |
version 1.0 | 60 S | to Jan 2015 | |||||
GSMaP-MVK | Global Satellite Mapping of Precipitation | Satellite | 0.1 | Hourly | 60 N– | Mar 2000 | |
(GSMaP) Moving Vector with Kalman | 60 S | to Dec 2010; | |||||
(MVK) standard versions 5 and 6 | Mar 2014 | ||||||
to Oct 2015 | |||||||
TMPA 3B42RT | TRMM Multi-satellite Precipitation Analysis | Satellite | 0.25 | 3 hourly | 50 N– | Jan 2000 | |
(TMPA) 3B42RT version 7 | 50 S | to Oct 2015 | |||||
PERSIANN | Precipitation Estimation from Remotely | Satellite | 0.25 | 3 hourly | 60 N– | Mar 2000 | |
Sensed Information using Artificial Neural | 60 S | to Feb 2015 | |||||
Networks (PERSIANN) | |||||||
SM2RAIN-ASCAT | Based on ASCAT data | Satellite | 0.5 | Daily | Global | 2007–2015 | |
NCEP-CFSR | National Centers for Environmental | Reanalysis | 0.31 | Hourly | Global | Jan 1979 | |
Prediction (NCEP) Climate Forecast System | to Mar 2011 | ||||||
Reanalysis (CFSR) | |||||||
ERA-Interim | European Centre for Medium-range Weather | Reanalysis | 0.25 | 3 hourly | Global | Jan 1979 | |
Forecasts ReAnalysis Interim (ERA-Interim) | to Dec 2014 | ||||||
JRA-55 | Japanese 55-year ReAnalysis (JRA-55) | Reanalysis | 1.25 | 3 hourly | Global | Jan 1959 to | |
present | |||||||
CPC Unified | Climate Prediction Center (CPC) Unified | Gauge | 0.5 | Global | Daily | 1979– | , |
version 1.0 and real time | present | ||||||
GPCC | Global Precipitation Climatology Centre | Gauge | 0.5 | Global | Monthly | 1901– | |
(GPCC) Full Data Reanalysis and First | present | ||||||
Guess version 7 |
Selected to be included in MSWEP. 80 km effective spatial resolution (i.e., the resolution of the employed atmospheric model). 0.25 spatial resolution for the conterminous USA. 1 spatial resolution for 2014–present.
Gauge-based evaluation of gridded datasets
Eight non-gauge-adjusted satellite and reanalysis datasets were evaluated in terms of temporal variability to assess their potential inclusion in MSWEP (see Table ). These datasets were specifically chosen for their lack of gauge adjustments, allowing us to evaluate their performance using gauge observations. For the evaluation, we considered daily data from 99 444 gauges included in the Global Historical Climatology Network-Daily (GHCN-D) database and 16 912 gauges included in the Global Summary of the Day (GSOD) database (amounting to 116 356 stations in total). All gridded data were resampled to 0.5 for consistency, and if there were multiple gauges located within a single 0.5 grid cell the mean gauged was calculated. Only stations with 4 months of data (not necessarily consecutive) between March 2000 (marking the start of GSMaP-MVK and PERSIANN) and December 2010 (marking the end of GSMaP-MVK version 5) were used, and if a particular station was included in both databases we only kept the one in the GHCN-D.
Previous evaluation studies have used a large variety of evaluation metrics (see the overview by ). Here we use Pearson correlation coefficients computed between 3 day mean rainfall and snowfall time series from the gauges and the gridded datasets. Correlation coefficients are highly sensitive to large events, which is a desirable feature since large events are more important in the context of hydrological modeling. We used 3-day averages instead of daily averages to account for mismatches in the 24 h measurement period between the datasets and the gauges and to reduce the influence of erroneous measurements. Snowfall and rainfall were evaluated separately, and were distinguished using a threshold of 1C based on daily mean data from the ERA-Interim atmospheric reanalysis dataset resampled to 0.5 using ”nearest neighbor” and offset to match the long-term mean of WorldClim. Gauges with correlation coefficients 0.4 for all datasets were deemed unreliable and excluded from the analysis (cf. ). The performance of each dataset in terms of bias was not evaluated since MSWEP relies on CHPclim for its long-term average.
Merging procedure
The procedure to generate the MSWEP dataset involves four distinct stages of merging (see Fig. ). In the first, we produced a 0.25 global map of long-term mean based on the CHPclim dataset, which was replaced with more accurate regional datasets for the USA and New Zealand, and corrected for bias (see Sect. ).
In the second merging stage (see Fig. ), for the monthly, daily, and 3-hourly timescales separately, and for the satellite and reanalysis datasets separately, we merged anomalies from the different datasets by weighted averaging to yield one anomaly estimate at each timescale for each data source. The weight maps for each dataset were derived for rainfall and snowfall separately from the 3-day correlation coefficients obtained in the evaluation using gauge observations (see Sect. ). The resulting 3-day correlation coefficients were first squared to yield the explained variance and subsequently interpolated to produce continuous global weight maps by calculating, for each 0.25 grid cell, the median of the 10 most nearby gauges. The rainfall interpolated weight was used for grid cells with 3C, while the snowfall interpolated weight was used for grid cells with 3C. The weights for satellite data were set to 0 for grid cells with 3C. data were taken from ERA-Interim, resampled to 0.25 using nearest neighbor, and offset to match the long-term means of WorldClim.
In the third merging stage (see Fig. ), for the monthly, daily, and 3-hourly timescales separately, we merged the anomalies from the different data sources by weighted averaging to yield one anomaly estimate at each timescale. For the satellite and reanalysis data sources, the weights were set to the maximum of the weights assigned to the individual datasets in the second merging stage. For example, if for a particular grid cell the GSMaP-MVK dataset was assigned the highest weight in the second merging stage, we used the GSMaP-MVK weight for the satellite data source in the third merging stage. Weights for the gauge data were empirically calculated as a function of gauge density by means of the following expression: where (unitless) represents the weight for grid cell , (km) the distance to grid cell , (km) the range of influence, and the number of stations for grid cell . The summation is over all grid cells 1, 2, …, within a 1000 km radius of grid cell . was set to 25 km using trial and error. Typical values for range from 0 in completely ungauged regions to 1 for grid cells containing only one isolated station and approximately 2.5 in densely gauged regions (e.g., in southeast Australia). As the available gauge observations vary over time, the associated weight maps vary accordingly.
In the fourth and final merging stage (see Fig. ), the long-term mean was downscaled in a stepwise manner using the anomalies first to the monthly timescale, then to the daily timescale, and finally to the 3-hourly timescale to yield the MSWEP dataset.
MSWEP performance evaluation
Evaluation using FLUXNET gauge observations
Observed data from 125 tower sites included in the FLUXNET dataset were used to evaluate eight datasets: (i) the final MSWEP, (ii) the gauge-only MSWEP, (iii) the satellite-only MSWEP, (iv) the reanalysis-only MSWEP, (v) WFDEI (adjusted using CRU TS3.1), (vi) GPCP-1DD (version 1.2), (vii) TMPA 3B42 (version 7), and (viii) CPC Unified (version 1.0 and real time; see Table for details). The FLUXNET data were used for this purpose because they are completely independent; they have not been used in the development of any of the datasets. The different MSWEP variants were obtained by setting the weights of the other two sources to zero (e.g., for obtaining the gauge-only MSWEP, the satellite and reanalysis weights were set to zero). All data with a resolution 0.5 were resampled to 0.5 using averaging for consistency. Only stations with 120 days of data (not necessarily consecutive) during the period 1998–2015 for all datasets were used. The list of stations is provided in the Appendix.
HBV model parameter descriptions and calibration ranges.
Parameter (units) | Description | Minimum | Maximum |
---|---|---|---|
TT (C) | Threshold temperature when is simulated as snowfall | 2.5 | 2.5 |
CWH (–) | Water-holding capacity of snow | 0 | 0.2 |
CFMAX (mmC day) | Melt rate of the snowpack | 0.5 | 5 |
CFR (–) | Refreezing coefficient | 0 | 0.1 |
FC (mm) | Maximum water storage in the unsaturated-zone store | 50 | 700 |
LP (–) | Soil moisture value above which reaches | 0.3 | 1 |
BETA (–) | Shape coefficient of recharge function | 1 | 6 |
UZL (mm) | Threshold parameter for extra outflow from upper zone | 0 | 100 |
PERC (mm day) | Maximum percolation to lower zone | 0 | 6 |
K0 (day) | Additional recession coefficient of upper groundwater store | 0.05 | 0.99 |
K1 (day) | Recession coefficient of upper groundwater store | 0.01 | 0.8 |
K2 (day) | Recession coefficient of lower groundwater store | 0.001 | 0.15 |
MAXBAS (day) | Length of equilateral triangular weighting function | 1 | 5 |
Three performance metrics were considered. The first is the Pearson correlation coefficient (), which ranges from 1 to 1, with higher values corresponding to better performance. The second is the root mean square error (RMSE), which ranges from 0 to , with lower values corresponding to better performance. The third is the absolute bias (), defined as follows: where is the long-term mean of the gridded estimates and the long-term mean of the station record. Bias values range from 0 to 1, with lower values corresponding to better performance. The and RMSE values were computed based on daily (rather than 3-day mean) data.
Evaluation using hydrological modeling
The performance of the previously mentioned eight datasets (see Sect. ) was further evaluated using hydrological modeling for 9011 catchments ( 50 000 km), by calibrating the simple conceptual hydrological model HBV using from each of the different datasets. For the calibration, we used the common efficiency (NSE) objective function computed between simulated and observed time series. All data with a resolution 0.5 were resampled to 0.5 using averaging for consistency. The observed daily and associated catchment boundary data were obtained from the same three sources as those used for the CHPclim bias correction (see Sect. ). Catchments were required to have an area 50 000 km and a record length 1 year between 1998–2015 (not necessarily consecutive).
HBV operates at a daily time step, has two groundwater stores and one unsaturated-zone store, and uses a triangular weighting function to simulate channel routing delays. The model was chosen because of its flexibility, low computational cost, and successful application under a large variety of climatic and physiographic conditions (e.g., ). Besides daily time series, the model requires daily time series of and as inputs. was calculated using the equation as given by with daily net radiation, , atmospheric pressure, wind speed, and relative humidity derived from WFDEI, and surface albedo from monthly climatic data derived from the European Space Agency (ESA) GlobAlbedo dataset . data were also taken from WFDEI. Only a single “elevation/vegetation zone” was considered for each catchment. Table lists the model parameters including their calibration range. The stores were initialized by running the model for the first 10 years of the record if the record length was 10 years, or by running the model multiple times for the entire record if the record length was 10 years.
The model was recalibrated for each dataset for the period with simultaneous observed and input data in a lumped fashion to minimize the computational time. For the calibration we used the ( ) evolutionary algorithm implemented using the Distributed Evolutionary Algorithms in Python (DEAP) toolkit . The calibration procedure was almost the same as in ; however, to limit the computational time we reduced the population size () to 12 (from 24), the recombination pool size () to 24 (from 48), and the number of generations to 10 (from 25).
Global maps of (a) bias correction factors estimated using country-specific equations, (b) bias correction factors inferred using the relationship from observations and estimates, (c) bias correction factors computed by interpolating the -based values, and (d) mean annual (mm yr) from CHPclim bias-adjusted using the interpolated bias correction factors and used for MSWEP. The data points in (b) represent the centroids of the catchments ( 12 493) and the interstation regions ( 1651). Note that there are many values in (b) , but these are not of interest here and thus are shown as 1.
[Figure omitted. See PDF]
Results and discussion
Bias correction of CHPclim
The CHPclim dataset is used to determine the long-term mean of MSWEP. Although CHPclim has been adjusted for orographic effects, it does not explicitly account for wind-induced gauge under-catch and is thus likely to underestimate , particularly in snow-dominated regions. We tested two approaches to bias-correct CHPclim. Figure a shows bias correction factors computed using country-specific CR equations and a global 5 km map of mean wind speed. The spatial patterns agree reasonably well with those from three previous studies based on interpolation of CR values computed from daily station observations of , , and wind speed . Figure b shows bias correction factors inferred for the 13 762 regions using the relationship in combination with observations and estimates (see Sect. and Eq. ). These values were subsequently interpolated to obtain a continuous global map (see Fig. c). The resulting bias correction factors are highest in Alaska, northern Canada, the Andes, Scandinavia, the Central Asian mountain ranges, and northeastern Russia (Fig. c).
There is little agreement between Fig. a and c, particularly in western Russia where the use of corrected using the country-specific equations to force a hydrological model would probably result in appreciable overestimation. Although further research is needed to ascertain the exact cause of this discrepancy, these results suggest firstly that the country-specific CR equations are unsuitable for global application, and secondly, that there is probably still a substantial amount of orographic unaccounted for in CHPclim. Indeed, almost all regions with high bias correction values ( 1.25) in Fig. c exhibit a low density of gauges predominantly located at low elevations in regions with complex topography which might have confounded the estimation of orographic in CHPclim. The global patterns of Fig. c agree reasonably well with those from , who estimated the bias in due to orographic effects at global scale using a similar approach based on observations from 357 (mountainous) catchments ( 10 000 km).
Figure d presents the CHPclim adjusted using the interpolated bias correction factors shown in Fig. c. The bias correction of CHPclim resulted in an increase in global terrestrial mean annual (Antarctica excluded) from 817 to 877 mm yr, amounting to an increase of 7.4 %. This number is lower than the 17.9 % increase reported by , possibly because CHPclim has already been corrected for orographic effects. Our estimate is close to the 850 mm yr cited by based on the GPCC dataset, which includes orographic corrections and accounts for gauge under-catch using .
Gauge-based evaluation of gridded datasets
We evaluated eight non-gauge-adjusted gridded datasets in terms of temporal variability using observations from thousands of gauges around the world, to assess their value for inclusion in MSWEP (see Table ). All datasets with a resolution 0.5 were resampled to 0.5 for this exercise for consistency. Figure shows 3-day temporal correlation coefficients calculated between rainfall derived from the datasets and the gauges for 0.5 grid cells containing at least one gauge. The satellite datasets CMORPH, GSMaP-MVK, and TMPA 3B42RT exhibit very similar performance at global scale (Fig. a–c). In agreement with previous regional studies (e.g., ), PERSIANN performed rather poorly (Fig. d), possibly because it is predominantly based on cloud-top IR observations. SM2RAIN-ASCAT performed similarly to PERSIANN (Fig. e). Conversely, found that SM2RAIN-ASCAT performed similarly to TMPA 3B42RT based on 5-day correlations. The lower performance obtained here for SM2RAIN-ASCAT could be due to the use of only 4 years of data for SM2RAIN-ASCAT (2007–2010) vs. 11 years for TMPA 3B42RT (2000–2010). However, the performance of SM2RAIN-ASCAT appears to be somewhat complementary to the other satellite datasets in that it performs best in the western USA where the other satellite datasets perform rather poorly (Fig. ). Better results could potentially be obtained by applying the SM2RAIN technique to a multi-satellite soil moisture product, for example the Essential Climate Variable Soil Moisture (ECV-SM) dataset .
The three reanalysis datasets exhibited very similar performance patterns overall, although ERA-Interim consistently performed slightly better than the other reanalysis datasets, in line with several earlier regional studies evaluating different atmospheric variables (e.g., ). Although NCEP-CFSR performed reasonably well, it was not incorporated in MSWEP due to the presence of wave-like artifacts around mountain ranges (cf. ). Patterns of temporal correlations obtained for the USA for the satellite and reanalysis datasets (Fig. ) correspond well with those obtained by and . Figure presents the difference in 3-day rainfall correlation coefficients between TMPA 3B42RT and ERA-Interim, highlighting the complementary performance of satellite and reanalysis datasets reported in previous large-scale studies . Based on these results, we decided to use CMORPH, GSMaP-MVK, TMPA 3B42RT, ERA-Interim, and JRA-55 for determining the rainfall variability for ungauged regions in MSWEP.
Correlation coefficients calculated between 3-day mean rainfall derived from the datasets and the gauges. Each data point represents a 0.5 grid cell containing at least one gauge.
[Figure omitted. See PDF]
The difference in 3-day rainfall correlation coefficients between TMPA 3B42RT and ERA-Interim. Each data point represents a 0.5 grid cell containing at least one gauge ( 6668). Red and yellow indicate TMPA 3B42RT performs better, whereas blue indicates ERA-Interim performs better.
[Figure omitted. See PDF]
Figure shows the 3-day temporal correlation between snowfall derived from the datasets and the gauges for 0.5 grid cells containing at least one gauge. All five satellite datasets consistently exhibit very low correlation coefficients (Fig. a–e), due to the well-documented failure of retrieval techniques over snow- and ice-covered surfaces, and more generally their inability to detect snowfall in all conditions (e.g., ). Conversely, the reanalysis datasets showed high correlation coefficients for most grid cells (Fig. f–h), even exceeding those obtained for rainfall estimation (Fig. f–h), possibly because almost all snow originates from (non-convective) large-scale synoptic weather systems which tend to be represented well by atmospheric models. Based on these results, we decided to use ERA-Interim and JRA-55 for determining the snowfall variability for ungauged regions in MSWEP.
Correlation coefficients calculated between 3-day mean snowfall derived from the datasets and the gauges. Each data point represents a 0.5 grid cell containing at least one gauge.
[Figure omitted. See PDF]
Weight maps for MSWEP were calculated for each of the selected datasets based on the 3-day correlation coefficients. The weight maps for the satellite and reanalysis datasets are static in time, although we used the rainfall weight for grid cells with 3C and the snowfall weight for grid cells 3C. Conversely, the weight maps for the gauges were computed based on gauge density and thus vary daily depending on the available gauge observations. Figure a shows the sum of the weights used to merge the gauge, satellite, and reanalysis components of MSWEP during merging stage 3 (see Fig. ) at the daily timescale on an arbitrarily chosen day (25 April 2006). The relative contributions of the gauge, satellite, and reanalysis components to the total weight are shown in Fig. b–d, respectively.
Map (a) shows the sum of the gauge weight (see Eq. ) and the satellite and reanalysis weights (derived from 3-day correlation coefficients at surrounding gauges) used during merging stage 3 (see Fig. ) at the daily timescale on an arbitrarily chosen day (25 April 2006; note the non-linear color scale). The contributions of the gauge, satellite, and reanalysis components to the total weight on the same day are shown in (b)–(d), respectively.
[Figure omitted. See PDF]
Evaluation of MSWEP using FLUXNET gauge observations
Observed data from 125 FLUXNET stations were used to evaluate the final MSWEP, the MSWEP gauge, satellite, and reanalysis components, and four existing state-of-the-art gauge-adjusted datasets (WFDEI-CRU, GPCP-1DD, TMPA 3B42, and CPC Unified). Figure presents box plots of the distribution of , RMSE, and values obtained for each dataset. The merged MSWEP performed better than the individual MSWEP components in terms of both and RMSE (Fig. a and b, respectively), confirming the effectiveness of the employed merging scheme. The merged MSWEP also appears to perform better overall than all four existing datasets in terms of both and RMSE (Fig. a and b, respectively), providing the best estimates in terms of for 60.0 % of the stations and in terms of RMSE for 68.8 % of the stations. Interestingly, MSWEP performed average in terms of absolute bias (Fig. c), obtaining the best value for only 9.6 % of the stations, which is somewhat unexpected given the sophisticated interpolation techniques employed in producing the CHPclim dataset . A possible reason for this is that the FLUXNET data are generally not corrected for gauge under-catch. Moreover, some of the records may be subject to measurement bias – at the Chokurdakh station in northeastern Russia (site code RU-Cok; 70.82 N, 147.49 E), for example, snowfall was not measured at all (A. Budishchev, personal communication, 2016). The relatively good performance of the reanalysis-only MSWEP and WFDEI-CRU datasets (Fig. a and b) confirms the usefulness of ERA-Interim for gap-filling of FLUXNET records .
Box plots of the distribution of (a) daily correlation (), (b) root mean square error (RMSE), and (c) absolute bias () values computed between observed and estimated at the 125 FLUXNET sites. Note that for the satellite-only MSWEP only the 113 stations located at 60 N/S were used, while for TMPA 3B42 only the 87 stations located at 50 N/S were used. The locations of the 125 FLUXNET stations and the values obtained by MSWEP are shown in (d). The list of stations is provided in the Appendix.
[Figure omitted. See PDF]
Evaluation of MSWEP using hydrological modeling
We further evaluated the performance of MSWEP based on hydrological modeling for 9011 catchments ( 50 000 km). Figure presents daily NSE scores obtained after the calibration of HBV using daily data from the final MSWEP, the gauge-only MSWEP, the satellite-only MSWEP, and the reanalysis-only MSWEP. The respective datasets attained the highest calibration NSE for 28.7, 53.1, 6.7, and 11.5 % of the catchments. The good results for the gauge-only MSWEP are, however, somewhat misleading since the large majority of the catchments are located in regions with very dense measurement networks (the USA, Europe, and Australia; cf. Figs. b and a). Figure b shows median calibration NSE values for 5 latitude bands. These results once again confirm the complementary nature of satellite and reanalysis data clearly demonstrated in the previous evaluation of datasets using gauge observations (see Sect. ). For most latitudes, the final MSWEP performs either better or only slightly worse than the best-performing single-source MSWEP (Fig. b), confirming the efficacy of the merging scheme. The final MSWEP performs particularly good at latitudes 17.5 N/S and 67.5 N (Fig. b) where gauge densities are relatively low. The reanalysis-only MSWEP exhibits especially low scores for the 17.5–22.5 N latitude band (Fig. b), mainly reflecting the low performance achieved for Puerto Rico, perhaps due to the coarse resolution of the atmospheric models used to produce the reanalysis datasets.
The dataset with the best daily NSE score obtained after calibration of HBV is shown in (a). Also shown are (b) median NSE scores for 5 latitude bands and (c) the number of catchments () for each 5 latitude band. Each data point in (a) represents a catchment centroid ( 9011). The NSE scores have been computed between simulated and observed time series.
[Figure omitted. See PDF]
To examine in more detail how the final MSWEP, the gauge-only MSWEP, the satellite-only MSWEP, and the reanalysis-only MSWEP perform in regions with dense vs. parse gauge networks, Fig. a plots the median calibration NSE for the datasets as a function of catchment-mean distance to the closest gauge. For the 7953 well-gauged catchments, defined here by catchment-mean distance to the closest gauge 25 km, the final MSWEP performed slightly worse than the gauge-only MSWEP. However, only 16.1 % of the global land surface (excluding Antarctica) has a gauge located at 25 km distance. For the 1058 sparsely gauged catchments, defined here by catchment-mean distance to the closest gauge 25 km, MSWEP performs overall considerably better than the gauge-only MSWEP. Contrary to expectations, the median NSE scores increase with increasing distance to the closest gauge for several of the datasets, which is primarily because the more sparsely gauged groups contain less (semi-)arid catchments, which tend to exhibit lower NSE scores. The overall low performance of the satellite-only MSWEP reflects the lack of tropical catchments in our catchment set.
The change in median calibration NSE for the datasets as a function of catchment-mean distance to the closest gauge. The distance to the closest gauge was computed using the GPCC and CPC Unified datasets. The NSE scores have been computed between simulated and observed time series.
[Figure omitted. See PDF]
Figure presents daily NSE scores obtained after calibration of HBV using daily data from MSWEP and four existing state-of-the-art gauge-corrected datasets (WFDEI-CRU, GPCP-1DD, TMPA 3B42, and CPC Unified). MSWEP obtained the highest calibration NSE for 40.4 % of the catchments, while the other datasets obtained the highest calibration NSE for 12.2, 4.1, 11.3, and 32.1 % of the catchments, respectively (noting that the spatial coverage of TMPA 3B42 is limited to 50 N/S). The median calibration NSE score obtained by MSWEP is 0.49 vs. 0.30, 0.22, 0.35, and 0.45 for the other datasets, respectively. However, similar to the gauge-only MSWEP (see Fig. ), the scores for CPC Unified are probably somewhat inflated since most catchments are located in regions with very high gauge densities. Indeed, the picture changes somewhat when we examine median calibration NSE scores for 5 latitude bands; MSWEP performs either similar to or substantially better than the other datasets for all latitudes (Fig. b). The good performance of WFDEI-CRU for the central Interior Plains of North America (Fig. a) appears to be attributable to overestimation by the other datasets including MSWEP.
The dataset with the best daily NSE score obtained after calibration of HBV is shown in (a). Also shown are (b) median NSE scores for 5 latitude bands and (c) the number of catchments () for each 5 latitude band. Each data point in (a) represents a catchment centroid ( 9011). The NSE scores have been computed between simulated and observed time series.
[Figure omitted. See PDF]
Figure b plots the median calibration NSE for the five datasets (MSWEP, WFDEI-CRU, GPCP-1DD, TMPA 3B42, and CPC Unified) as a function of catchment-mean distance to the closest gauge. For the 7953 well-gauged catchments (defined by catchment-mean distance to the closest gauge 25 km), MSWEP obtained a median NSE of 0.49 vs. 0.30, 0.21, 0.35, and 0.47 for the other datasets, respectively, with MSWEP obtaining the best NSE in 39.0 % of these catchments. However, as mentioned previously, only 16.1 % of the global land surface (excluding Antarctica) has a gauge located at 25 km distance. For the 1058 sparsely gauged catchments (defined by catchment-mean distance to the closest gauge km), MSWEP obtained a median NSE of 0.52 vs. 0.30, 0.32, 0.39, and 0.29 for the other datasets, respectively, with MSWEP obtaining the best NSE in 51.6 % of these catchments. Thus, MSWEP performs slightly better than CPC Unified in well-monitored regions, whereas it performs considerably better than all other datasets in sparsely gauged regions which comprise most (83.9 %) of the global land surface.
Figure presents daily NSE scores obtained after calibration of HBV forced with MSWEP. The patterns obtained for the USA exhibit good agreement with those obtained in six previous studies using different hydrological models and forcing data . The NSE values are consistently lower in (semi-)arid regions in North America, Africa, and Australia, likely due to the highly non-linear rainfall-runoff response, the high transmission losses, and the flashy nature of the under these conditions . The lower scores obtained for polar catchments in North America and Iceland are attributable to underestimation, suggesting that there is still some unaccounted for. Note that a low score for a particular catchment does not necessarily mean that MSWEP is unreliable as errors in the , , or catchment boundary data or flow regulation could also be responsible.
Daily NSE scores obtained after calibration of HBV forced by MSWEP. The NSE scores have been computed between simulated and observed time series. Each data point represents a catchment centroid ( 9011).
[Figure omitted. See PDF]
Hydrological modeling has been used in numerous previous studies to evaluate the quality of datasets (e.g., ). However, these previous studies were generally regional in nature and used observations from a relatively small number of stations (7, 23, 9, 1, 22, 300, and 1, respectively, for the cited studies), which may have led to conclusions with limited global-scale applicability. Moreover, some of these studies did not recalibrate the hydrological model for each dataset , which may have led to additional uncertainty since hydrological models are inevitably imperfect representations of reality and thus produce uncertain estimates even with “perfect” meteorological forcing. Conversely, we used observations from a large set of 9011 stations around the globe, spanning all terrestrial latitudes and covering a broad range of physiographic and climatic conditions, which allowed us to examine how -gauge proximity influences the calibration NSE. Furthermore, we recalibrated the model for each catchment and dataset, thereby minimizing the uncertainty in model parameters.
One could argue that the evaluation of MSWEP using hydrological modeling presented here is not completely independent, since the observations used for the evaluation have also been used in the development of MSWEP for the bias correction of CHPclim (see Sect. ). However, the evaluation is in fact nearly completely independent since the results for a particular catchment will not change much, if at all, if that catchment was excluded from the bias adjustment exercise, since the bias correction factors were computed for each grid cell as the median of the 10 closest catchments. Moreover, the large majority of catchments are located in regions not affected by the bias correction (81.7 % of the catchments have an interpolated bias correction factor between 1 and 1.02; see Fig. c). Furthermore, it is noted that all data were resampled to 0.5 for consistency; both MSWEP and TMPA 3B42 may produce better calibration scores if the original 0.25 resolution data are used.
Caveats and future work
Although MSWEP overall performed better than existing state-of-the-art gauge-adjusted datasets, some caveats need mentioning:
As with many datasets, spurious temporal discontinuities can occur due to (i) the introduction of the satellite data source in 1998, (ii) satellite sensor degradation and instrument changes, (iii) differences in time coverage among stations, and (iv) changes in the observations assimilated in the reanalysis datasets (e.g., ). Thus, care should be exercised when using MSWEP for trend analysis.
Sub-daily temporal variability in tropical regions should be interpreted with caution prior to the launch of TRMM in 1998, since atmospheric models poorly represent the diurnal cycle in (sub-)tropical regions .
The estimated bias correction factors are influenced by uncertainties in the , , and catchment boundary data and particularly the interpolation. The spatial bias correction factors are available along with the data, so the original, non-corrected long-term averages can be reconstructed.
The use of PRISM data for the conterminous USA and the restriction of the bias correction to regions with snowfall and/or complex topography may result in spatial discontinuities in the long-term average of MSWEP.
Although MSWEP has a 0.25 spatial resolution (corresponding to 25 km at the equator), the effective spatial resolution reflects the gauge density in regions where the gauge component dominates (Fig. b), while it reflects the resolution of the atmospheric models ( 80–125 km) in regions where the reanalysis component dominates (Fig. d). In regions where the satellite component dominates, the effective resolution can be assumed to be 0.25 (Fig. c).
Future updates and improvements of MSWEP are intended to take advantage of new and improved data sources. For example, MSWEP may be updated with ERA5, the planned successor to ERA-Interim, while the TMPA dataset may be replaced by the Integrated Multi-satellitE Retrievals for GPM (IMERG) dataset once it has been extended backwards to the start of the TRMM era. Given the promising complementary information from SM2RAIN-ASCAT (Fig. ), any such dataset ingesting soil moisture retrievals from multiple satellites should also be considered for inclusion in MSWEP, if available. Moreover, the gauge component of MSWEP may be improved by accounting for the variable correlation length in the interpolation (cf. ), while the biases at high northern latitudes may be improved using Gravity Recovery and Climate Experiment (GRACE) data (cf. ). Finally, we hope to explore the possibility of producing a near-real time variant of MSWEP. Many of the datasets incorporated in MSWEP are available in near-real time, suggesting this is feasible.
Conclusions
We developed a global gridded dataset (1979–2015) with several novel features: (i) it takes advantage of a wide range of data sources, including gauge, satellite, and reanalysis data, to provide reliable estimates at global scale; (ii) it accounts for gauge under-catch and orographic effects using observations from an unprecedentedly large set of 13 762 stations worldwide; and (iii) it has a high 3-hourly temporal and 0.25 spatial resolution. We summarize our findings as follows:
The high-resolution global CHPclim climatic dataset, which explicitly accounts for orographic effects, still appears to contain widespread biases. These biases could not be satisfactorily corrected using country-specific equations that account for gauge under-catch, suggesting that CHPclim still underestimates orographic in some regions, probably mainly due to the topographic bias in gauge placement. We used the relationship in combination with observations from 13 762 stations around the globe to produce a bias-adjusted variant of CHPclim for use in MSWEP.
Eight non-gauge-adjusted datasets were evaluated in terms of 3-day temporal variability using observed data from thousands of gauges around the globe to assess their value for inclusion in MSWEP. For rainfall, CMORPH, GSMaP-MVK, and TMPA 3B42RT performed equally well, while PERSIANN and SM2RAIN-ASCAT performed less well. The satellite datasets consistently performed better in the tropics, while the reanalysis datasets consistently performed better at mid- to high latitudes. For snowfall, all satellite datasets performed poorly, while the reanalysis datasets performed even better than for rainfall. The obtained correlation coefficients were interpolated to produce global weight maps for the merging scheme of MSWEP.
Independent observed data from 125 FLUXNET tower stations around the globe were used to evaluate the performance of MSWEP as well as four existing state-of-the-art global datasets (GPCP-1DD, WFDEI-CRU, TMPA 3B42, and CPC Unified). The median value obtained by MSWEP was 0.67 vs. 0.44–0.59 for the other datasets, with MSWEP obtaining the best score for 60.0 % of the stations. MSWEP also performed best in terms of RMSE, but performed averagely in terms of absolute bias.
We further evaluated the performance of MSWEP using hydrological modeling for 9011 catchments ( 50 000 km) around the globe. Specifically, we calibrated the HBV hydrological model against daily observations using from MSWEP and the four previously mentioned existing datasets. For the 1058 sparsely gauged catchments (defined by catchment-mean distance to the closest gauge 25 km), representative of 83.9 % of the global land surface (excluding Antarctica), MSWEP obtained a median NSE of 0.52 vs. 0.29–0.39 for the other datasets.
Data availability
MSWEP is available via
FLUXNET stations used in this study
The 125 FLUXNET sites used in this study including the primary reference, where available, are: AR-SLu, AR-Vir, AT-Neu, AU-ASM , AU-Ade, AU-Cpr, AU-Cum, AU-DaP, AU-DaS, AU-Dry, AU-Emr, AU-Fog, AU-RDF, AU-Rig, AU-Tum, AU-Whr, BE-Bra, BE-Lon, BE-Vie, BR-Sa3, CA-Gro , CA-NS1 , CA-NS3 , CA-NS4, CA-NS5 , CA-NS6 , CA-NS7 , CA-Qfo , CA-SF1 , CA-SF2 , CA-SF3 , CA-TP2 , CG-Tch, CH-Cha , CH-Fru , CH-Oe1 , CN-Cha, CN-Cng, CN-Dan, CN-Din, CN-Du2, CN-Du3, CN-Ha2, CN-HaM, CN-Qia, CN-Sw2, CZ-BK1, CZ-BK2, DE-Akm, DE-Gri, DE-Hai, DE-Kli, DE-Lkb, DE-Obe, DE-RuS, DE-Spw, DE-Tha, DK-Eng, DK-NuF , DK-Sor, DK-ZaH , ES-LJu, ES-LgS, ES-Ln2, FI-Hyy, FI-Jok, FR-Gri, FR-Pue, GH-Ank, IT-CA1, IT-CA2, IT-CA3, IT-La2, IT-Lav, IT-PT1, IT-Ren, IT-Ro1, IT-Ro2, IT-Tor, JP-MBF, JP-SMF, MY-PSO, NL-Hor, NL-Loo, NO-Adv, PA-SPn, PA-SPs, RU-Che, RU-Cok, RU-Fyo, RU-Ha1, RU-Sam, RU-SkP, RU-Vrk, SD-Dem , SE-St1, US-AR1, US-AR2, US-ARM , US-Blo , US-CRT , US-Goo, US-Ha1 , US-IB2 , US-Ivo , US-Lin, US-Los , US-MMS , US-Me6 , US-Myb, US-Ne1 , US-Ne2 , US-Ne3 , US-Oho , US-SRM , US-Syv , US-Ton , US-UMd , US-Var , US-WCr , US-WPT , US-Whs , and US-Wkg , ZA-Kru, and ZM-Mon.
The authors declare that they have no competing interests.
Acknowledgements
We gratefully acknowledge the dataset developers for producing and making their datasets available. The GRDC and the USGS are thanked for providing most of the observed data. We further wish to thank Vera Thiemig, Feyera Aga Hirpa, and Jutta Thielen-del Pozo for fruitful discussions on the intercomparison and evaluation of datasets; Roxana Petrescu and Artem Budishchev for useful comments regarding the FLUXNET dataset; Emanuel Dutra for advice on the merging procedure; and Niko Wanders, Luca Brocca, Alessandro Cescatti, and an anonymous reviewer for reviewing an earlier draft. This work used eddy covariance data acquired and shared by the FLUXNET community, including these networks: AmeriFlux, AfriFlux, AsiaFlux, CarboAfrica, CarboEuropeIP, CarboItaly, CarboMont, ChinaFlux, Fluxnet-Canada, GreenGrass, ICOS, KoFlux, LBA, NECC, OzFlux-TERN, TCOS-Siberia, USCCC, and Swiss FluxNet. The FLUXNET eddy covariance data processing and harmonization was carried out by the ICOS Ecosystem Thematic Center, AmeriFlux Management Project and Fluxdata project of FLUXNET, with the support of CDIAC, and the OzFlux, ChinaFlux, and AsiaFlux offices. This research received funding from the European Union Seventh Framework Programme (FP7/2007–2013) under grant agreement no. 603608, “Global Earth Observation for integrated water resource assessment”: eartH2Observe. Vincenzo Levizzani acknowledges partial funding from the “Progetto di Interesse NextData” of the Italian Ministry of Education, University, and Research (MIUR). Diego Miralles acknowledges financial support from The Netherlands Organization for Scientific Research through grant no. 863.14.004. The views expressed herein are those of the authors and do not necessarily reflect those of the European Commission. Edited by: P. Gentine Reviewed by: N. Wanders, R. Wartenburger, and one anonymous referee
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2017. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Current global precipitation (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details




1 European Commission, Joint Research Centre (JRC), Via Enrico Fermi 2749, 21027 Ispra (VA), Italy
2 Fenner School of Environment & Society, The Australian National University, Canberra, Australia
3 National Research Council of Italy, Institute of Atmospheric Sciences and Climate (CNR-ISAC), Bologna, Italy
4 Inland Water Systems Unit, Deltares, Delft, the Netherlands
5 Laboratory of Hydrology and Water Management, Ghent University, Ghent, Belgium; Department of Earth Sciences, VU University Amsterdam, Amsterdam, the Netherlands
6 Laboratory of Hydrology and Water Management, Ghent University, Ghent, Belgium