Full text

Turn on search term navigation

Isaac Mugume 1 and Charles Basalirwa 1 and Daniel Waiswa 1 and Joachim Reuder 2 and Michel d. S. Mesquita 3 and Sulin Tao 4 and Triphonia J. Ngailo 5

Academic Editor:Aiguo Song

1, Department of Geography, Geoinformatics and Climatic Sciences, Makerere University, P.O. Box 7062, Kampala, Uganda

2, Geophysical Institute, University of Bergen, Allegaten 70, 5007 Bergen, Norway

3, Uni Research Climate, Bjerknes Centre for Climate Research, Bergen, Norway

4, School of Applied Meteorology, Nanjing University of Information Science and Technology, Nanjing, Jiangsu 21004, China

5, Department of General Studies, Dar es Salaam Institute of Technology, P.O. Box 2958, Dar-es-Salaam, Tanzania

Received 17 February 2016; Accepted 4 April 2016

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

The models are used in many fields such as engineering, agriculture, health, business, and weather and climate for simulation and prediction. They help to understand the different subprocesses underlying a given process and have undergone tremendous improvements due to developments in computing technology. These models range from simple (e.g., linear regression models) to complex models (e.g., weather and climate prediction models); Glahn and Lowry [1] categorized the models as dynamical and statistical. A combination of dynamical and statistical models is also used in operational forecasting especially using statistical techniques to correct output from a dynamical model.

The national meteorological services usually operate high resolution numerical weather prediction models so as to give accurate guidance to users of weather information [2]. The accuracy of a given model is the measure of how close the model predicted fields are compared to independently observed atmospheric fields [3, 4] but it can be affected by errors in initial conditions, imperfections in the model, and inappropriate parameterizations. When a model agrees with observations, the confidence in using the model is higher [5] but the present agreement does not necessarily guarantee the skill for the future model prediction.

The main advantage of models is their objectivity [1]. However, the presence of systematic errors is due to bias [6] which occurs due to difference in model response to external forcing [7] such as errors in initial conditions. This bias can manifest as overprediction or underprediction and is defined by the World Meteorology Organization as the mean difference between forecast values and mean actual observations [8] while Haerter et al. [9] define bias as time independent component of error in model output.

A couple of methods have been proposed to correct for the bias. Maraun [10] used quantile-quantile method and found that uncorrected regional climate models underestimated precipitation and produced many drizzle cases. Durai and Bhradwaj [11] investigated four statistical bias correction methods (namely, best easy systematic method, lagged linear regression, nearest neighbor, and running mean removal) and noted that the running mean and nearest neighbor methods improved the forecast skill. These methods attempt to reduce the bias in the next forecast using the information from the bias of the previous forecast [12]; however they influence the model output if prediction is based on bias corrected data [8] and they cannot correct improper representation of processes producing the model output [9].

Many studies have employed the parametric methods such as RMSE [13-15], MAE [14, 15], and ME [16] relative error [13, 16] to analyze the bias of numerical models but have put less emphasis on graphical tools as well as the nonparametric method. In the present study, we investigate the performance of the bias analysis methods on actual January 2015 temperature data and simulated temperature data using the Weather Research and Forecast (WRF) model (Tables 1 and 2). The rest of the paper is organized as follows: Section 2 describes the data sources, Section 3 presents overview of the methods of bias analysis, Section 4 presents results and discussion, and Section 5 gives summary and conclusion.

Table 1: Statistical bias measures of actual and model simulation for maximum temperatures.

Measure	arua	ebb	ksse	jinja	mbra	gulu
RMSE	2.19	7.50	4.41	8.86	2.47	2.37
MAE	2.01	7.22	4.05	8.60	2.24	1.98
ME	-2.01	-7.22	-4.05	-8.60	-1.92	-1.91
Rel. bias	-0.06	-0.26	-0.12	-0.28	-0.07	-0.06
BES	-2.06	-7.13	-4.03	-8.69	-2.21	-1.94
Skewness	0.80	-0.17	0.34	0.12	0.89	0.10
STM	-0.97	-1.00	-0.94	-1.00	-0.81	-0.81

Table 2: Statistical bias measures of actual and model simulation for minimum temperatures.

Measure	arua	ebb	ksse	jinja	mbra	gulu
RMSE	5.59	2.77	1.58	3.90	2.83	2.19
MAE	5.31	2.37	1.25	3.31	2.50	1.72
ME	5.31	-1.69	0.78	3.25	-2.39	0.21
Rel. bias	0.46	-0.09	0.05	0.21	-0.15	0.01
BES	5.23	-1.86	0.66	3.31	-2.29	0.11
Skewness	0.32	0.39	0.44	-0.06	-0.16	0.09
STM	1.00	-0.55	0.35	0.87	-0.94	-0.03

2. Data

We simulate January 2015 temperature using WRF model version 3.7 [17], with parameterizations schemes: WRF single moment 6-class scheme microphysics, the Kain-Fritsch cumulus parameterization, the Asymmetric Convective Model option for planetary boundary layer, the Rapid Radiative Transfer Model for longwave radiation, and the Dudhia scheme for shortwave radiation. This data is compared with observed January 2015 temperature (maximum and minimum temperature) data obtained from the Uganda National Meteorological Authority (UNMA). We use six stations (namely, Arua (arua), Entebbe (ebb), Kasese (ksse), Jinja (jinja), Mbarara (mbra), and Gulu (gulu)). For a given day and station, the maximum simulated temperature is compared with the maximum observed temperature and the minimum simulated temperature is compared with the minimum observed temperature.

3. Methods of Bias Analysis

In order to comprehensively investigate the performance of numerical models, it is important to evaluate them on many metrics other than using a single method [5]. In this section, we present the popular methods for analyzing bias of numerical models. The parametric methods are presented in Sections 3.1-3.6 while the nonparametric method considered is described in Section 3.7.

3.1. The Difference Measures

Willmott et al. [3] suggested a difference variable, D, given by the difference between the model predicted value, M, and observed value, O, that is, [figure omitted; refer to PDF] This is appropriate for point measurements. It is this measure that gives rise to other measures like the root mean square error (RMSE), the bias or mean error (ME), and the mean absolute error (MAE).

For a model j, with time-ordered data set (_Mi ) we define the difference _Dij as follows: [figure omitted; refer to PDF] where i is the ith data point and _Oi is the corresponding ith observed value from time-ordered actual observed data set (_Oi ). A positive (negative) value indicates that model output is higher (lower) than the actual values.

3.2. The RMSE

The RMSE is the square root of the average squared differences (^Dij2 ) and is a popular statistical measure for the performance of numerical model in atmospheric research [15]. For a model, j, the RMSE is thus defined as follows: [figure omitted; refer to PDF]

The RMSE is a good criteria to classify the accuracy of a model and a low index indicates higher accuracy.

3.3. The MAE

The MAE is the average of the magnitudes of differences (_Dij taken as positive) and is also a popular index for estimating bias in atmospheric studies. For a model, j, the MAE is defined as follows: [figure omitted; refer to PDF] and, just like RMSE, a low index indicates higher accuracy.

3.4. The Bias

The bias, also known as the mean error (ME), is obtained by averaging the differences (_Dij ) over the number of cases. For a given model output, _Mj , the ME is calculated from [figure omitted; refer to PDF] The magnitude of ME is equal to the MAE if all the predicted values of the model are higher (or lower) than the actual values. A value of bias close to zero indicates that model values are in fair agreement with actual values with zero implying no bias.

The relative bias is another bias measure suggested by Christakis et al. [16] in which ME is divided by average observations and given as follows: [figure omitted; refer to PDF] The bias given by (5) and (6) gives both the direction and probable magnitude of the error.

3.5. The Skewness Coefficient

The skewness coefficient is a moment measure based on symmetry [18]. Having obtained the differences between the model and actual values (_Dij ), positive (or negative) skewness indicates that model outputs are largely lower (or higher) than actual observations. The skewness coefficient is defined as follows: [figure omitted; refer to PDF] with _sj as the standard error of the sample biases forming a distribution {_D1j ,_D2j ,...,_Dnj } and calculated as follows: [figure omitted; refer to PDF]

3.6. The Bias Easy Systematic Method

The bias easy systematic (BES) method considers location measures (especially quartiles) and is given by Durai and Bhradwaj [11] as follows: [figure omitted; refer to PDF] where _q1 , _q2 , and _q3 are the sample lower quartile, median, and upper quartile, respectively, of the differences, _Dij , and it is commended for its robustness for taking care of extreme values by Woodcock and Engel [12].

3.7. Sign Test Method

The sign test method (STM) is a nonparametric method based on assigning a score, _θ^ij , that compares the prediction, _Mij , and observation, _Oi , at a given point. If the model predicts higher values than observation (_Mij >_Oi ), we assign positive one (i.e., _θ^ij =+1), if the model prediction is equal to observed value (_Mij =_Oi ), we assign zero (i.e., _θ^ij =0), and if the model predicts a value lower than observation (_Mij <_Oi ), we assign negative one (i.e., _θ^ij =-1); thus [figure omitted; refer to PDF]

For a model j forming a distribution of scores, (_θ^ij ), of size n, such that (_θ^1j ,_θ^2j ,...,_θ^nj ), the mean is computed as follows: [figure omitted; refer to PDF]

If the mean score, _Θ¯j , for a given model is positive, the model is generally considered to overpredict; if it is negative then the model underpredicts. Otherwise there is no significant bias. We suggest the hypothesis as [figure omitted; refer to PDF] and consider for unbiased model (i.e., zero bias) [figure omitted; refer to PDF] For a distribution of sample size less than 30 (n<30), we propose the use of Student's t-distribution and make approximation to normal distribution for large samples (n>30). The standard error is computed using [figure omitted; refer to PDF] The nonparametric statistic for measuring bias is then corrected and calculated using [figure omitted; refer to PDF] We can then test this for a given significance level and make statistical inferences.

4. Results and Discussion

In comparing model results with observations, we assume that observed values are accurate and that it is the model predicted values that contain error because, as explained by Piani et al. [19], the models have inconsistencies that are sometimes not solved by bias correction. This thus brings the necessity of clearly determining the direction and magnitude of the bias. The magnitude of the bias can be affected by other factors, namely, the geographical location and season [11]. These factors are not considered in the study but it is possible to compare spatial and temporal bias using the different bias analysis methods.

Table 1 presents bias estimation using maximum temperatures as simulated by WRF model and actual observed values for maximum temperature while Table 2 presents bias estimation using model simulated values and actual observed values for minimum temperature. These tables help to explore the different possible cases and we obtain a negative bias for all maximum temperatures (Table 1) and some stations have positive bias for some minimum temperatures (Table 2). These cases are also presented using time series figures (Figures 1-12). The time series figures help to investigate how the biases change with time and the greater the departure from the curves (model simulated curve and observed curve), the greater the bias. For Gulu, (Figures 11 and 12) the model and actual observations follow roughly the same trend. For Kasese (Figure 6) there is high variability for actual minimum temperatures compared to those presented by model. For Jinja (Figure 7) actual observations have increasing trend while model values have a decreasing trend over the period (20-30 days). These results imply that a given model can have varying performance in different geographical regions, hence bias.

Figure 1: Arua: max_temp.