Indian monsoon data assimilation and analysis

Full text

Turn on search term navigation

INTRODUCTION

The Indian Monsoon Data Assimilation and Analysis (IMDAA) project is a formal collaboration between the Met Office (MO), the National Centre for Medium Range Weather Forecasting (NCMRWF) and the India Meteorological Department (IMD). The project is funded by the Indian Ministry of Earth Sciences through the National Monsoon Mission. The principal aim of this 4 year project is to develop and run, for the first time, a long‐period high‐resolution regional reanalysis over the Indian subcontinent. The development has been completed and production runs are underway. The reanalysis will produce a consistent data set of high‐resolution fields for a wide range of atmospheric variables available from 1979 to 2016 (satellite era). Production runs began in 2017, and as of May 2017, 10 years of computation have been completed.

The monsoon is the primary weather phenomenon affecting the Indian subcontinent and is distinguished by the seasonal reversal of wind and the associated changes in precipitation. There are several comprehensive reviews of the monsoon that describe its main characteristics, predictability and prediction (Webster et al., ; Gadgil, ; Goswami, ). Since the monsoon provides around 80% of annual rainfall (Turner and Annamalai, ), agriculture in the region is highly dependent on the monsoon's strength and onset date. Climate models yield uncertainty about how the changing climate is affecting the monsoon (Dobler and Ahrens, ). The IMDAA reanalysis will be a useful tool for increasing our understanding of the monsoon, how it has changed over the past four decades and provides scientists with a better framework to understand future monsoon trends.

The prediction of the monsoon is notoriously difficult and there are many aspects of monsoon processes from the onset through the development and decay that are relatively poorly understood and represented in model simulations (Turner et al., ). The IMDAA reanalysis will produce a long‐term historical record of climate and extreme weather events over a region spanning the Indian peninsula and surrounding areas in the form of a high‐resolution data set, which can be exploited to better understand the characteristics of the monsoon. To be able to use this data confidently in studies it is paramount that the quality of the reanalysis is well understood.

The IMDAA reanalysis is produced on a limited area domain, allowing use of much higher resolution (12 km) than is typical of global reanalyses. The higher resolution grid allows better representation of real‐world characteristics such as orography and coastline. The hope is that the higher resolution will give improved representation of physical processes and improved use of high‐resolution observation data. In a similar reanalysis over Europe (EURO4M project, Jermey and Renshaw ()), it was found that the higher resolution model outperformed its global driving model particularly in simulating intense small‐scale rainfall events.

A pilot reanalysis was run for 2008 and 2009 prior to production runs. The 2 years were run separately as two streams with a 2‐week spin up, so starting from the previous years. This paper outlines the system developed for the reanalysis and used in this pilot run and the results of this study. A follow on paper is being prepared extending the analysis to the first 10 years of the reanalysis (1979–1989).

Details of the IMDAA reanalysis system, including the forecast model, data assimilation and observations used are described in section 2. In section 3, initial results of the reanalysis system during a pilot study are presented. A brief summary of the conclusions and a discussion are provided in section 4.

MODEL AND DATA

Forecast model

The reanalysis system uses the Met Office Unified Model, UM (Davies et al., ), with the Even Newer Dynamics dynamical core (ENDGame), described in Wood et al. (). This dynamical core uses a semi‐implicit semi‐Lagrangian formulation to solve the non‐hydrostatic, fully compressible deep‐atmosphere equations of motion. Prognostic fields are discretized horizontally onto a regular latitude–longitude grid with Arakawa C‐grid staggering (Arakawa and Lamb, ), whilst the vertical discretization employs a Charney–Phillips staggering (Charney and Phillips, ) using terrain following hybrid height coordinates. The discretised equations are solved using a nested iterative approach centred about solving a linear Helmholtz equation.

The IMDAA reanalysis is produced with the Met Office UM Global Atmosphere 6.0 configurations. Full details of the configuration are described in Walters et al. () which also outlines the parametrizations employed to represent sub‐grid scale processes in the atmosphere, such as convection, the surface, the boundary layer and mixed‐phase cloud physics.

The lateral boundary conditions are provided by the European Centre for Medium‐Range Weather Forecasts Interim Reanalysis (ERA‐Interim), (Dee et al., ). UM analyses were not used as the UM forecast model has evolved over the period and is thus not consistent unlike ERA‐Interim. The model boundary top is a solid lid at 40 km. The Hadley Centre Ice and Sea Surface Temperature data set version 2 (HadISST2) (Titchner and Rayner, ), provides sea surface temperatures (SST) to the reanalysis system up to 2010. For the modern period, 2010 to present day, the MO Operational sea surface temperature and sea ice analysis (OSTIA) is used (Donlon et al., ) for SST, after first degrading to the resolution of HadISST2.

The domain of the IMDAA Regional Reanalysis system is shown in Figure . The regional model domain includes more than just the Indian peninsula. The domain extends westwards out to 30°E to incorporate West Africa and eastwards to 120°E to include East Asia. The latitudinal extent is from 45°N to 15°S. This large domain was chosen to fully incorporate all the known areas of monsoon influences such as the East African highlands, Himalayas and Bay of Bengal. The vast extent of this domain also enables this reanalysis to be used not just in South Asian monsoon studies but also in analysing the East Asian monsoon.

IMDAA domain displaying the orography from the model in metres

The domain has horizontal resolution of the order of 12 km (or 0.11°), which is higher than currently available global reanalyses, including both ERA‐Interim (80 km) and the new Fifth European Centre for Medium‐Range Weather Forecasts Reanalysis (30 km, ERA5), (Hersbach and Dee, ). The reanalysis is produced on 63 model levels reaching to a height of approximately 40 km.

Data assimilation

Four times a day, four dimensional variational (4DVAR) data assimilation (Rawlins et al., ) is performed, which estimates the optimal atmospheric state given the observations and background state within a 6‐hr window, assimilating both satellite and conventional observation data. These reanalyses of the atmospheric state are produced every 6 hr and (re)forecasts from these give atmospheric states for intermediate hours. Since the reanalysis is based on a full numerical weather prediction (NWP) system, a full set of physically consistent meteorological fields are produced at each analysis and forecast time.

Observations

The reanalysis takes advantage of the substantial work of the ECMWF ERA team in collating and archiving many decades of observation data. This data is available from the ECMWF MARS archive system. The observation types and number of observations assimilated per cycle (i.e., every 6 hr assimilation), for the 2008–2009 pilot were:

Surface stations (land, ship, buoy)—2,200
Upper air (radiosonde, pilot, wind profiler)—100
Aircraft (AMDAR, AIREP)—1,700
AIRS satellite radiances—3,500
ATOVS satellite radiances—11,500
IASI satellite radiances—2,500
GPS radio occultations (bending angle)
Atmospheric Motion Vectors (satellite winds)—1,000
Scatterometer winds—1,000

NCMRWF and IMD have worked to retrieve extra observation data from locally held archives. A substantial number of surface and upper air observations have been recovered from magnetic tape. Figure shows (in red) stations (for June 15, 1997) that were not available from the ECMWF archive for surface (map on left) and upper air (map on right) observations.

View Image - Location of surface (left) and upper air (right) observations for June 15, 1997 from the ECMWF archive (blue) and those recovered from tape by NCMRWF (red)

Location of surface (left) and upper air (right) observations for June 15, 1997 from the ECMWF archive (blue) and those recovered from tape by NCMRWF (red)

In any analysis it is important to exclude observations of poor quality. Two approaches were taken to reject bad observations. Bayesian quality control (Ingleby and Lorenc, ) rejects any individual observation that is judged to differ largely from the model. In addition, on a monthly basis any poor quality data that are identified, based on observation minus background statistics, are added to rejection lists and excluded from assimilation. Any station showing significant bias or standard deviation from background is rejected for the whole month. The system also calculates bias corrections for surface pressure and for aircraft and sonde temperature.

For satellite data, usage was guided by the experience of the ERA reanalyses. As part of ERA‐40 and ERA‐Interim, the ECMWF reanalysis team have constructed lists of dates when individual instruments, and even individual channels, are and are not reliable. The IMDAA reanalysis uses these in its own selection. It also follows ECMWF in using VarBC (Variational Bias Correction) to apply bias correction to satellite radiances (Dee and Uppala, ). VarBC analyses bias corrections as part of the assimilation process. In this way the biases change with time so as to fit drifts in instrument bias.

Table summarises the fits to observations over the 2‐year pilot period. The mean of the root mean square (RMS) differences between the reanalyses and observations (O–A) and the mean of RMS differences between the reanalyses and background (O–B) are given for selected observations across the entire domain. RMS O–A's are smaller than RMS O–B's indicating that the reanalyses are closer to the observed state than the background.

Summary of fits to observations

Observation	O‐B	O‐A
SYNOP temperature (K)	1.55	1.11
SYNOP relative humidity (0–1)	0.09	0.06
SYNOP u component of wind (m/s)	1.71	1.57
SYNOP v component of wind (m/s)	1.70	1.56
SYNOP pressure (Pa)	89.19	79.65
AMV u component of wind (m/s)	3.58	3.42
AMV u component of wind (m/s)	3.21	3.04
Aircraft u component of wind (m/s)	2.71	2.22
Aircraft v component of wind (m/s)	2.71	2.23
Aircraft potential temperature (K)	1.41	1.24
Scatterometer u component of wind (m/s)	1.58	1.38
Scatterometer v component of wind (m/s)	1.64	1.44

The mean of RMS differences between the reanalyses and observations (O–A) and the mean of RMS differences between the reanalyses and background (O–B) are given for selected observations across the entire domain. AMV are atmospheric motion vectors.

Validation data

The quality of the regional reanalysis, IMDAA, is compared to its parent, ERA‐Interim. Both are compared with independent gridded observation data.

The NCMRWF‐IMD Merged Satellite‐Gauge (NMSG) data set (Mitra et al., ) provides daily precipitation estimates over India at 1° resolution (approximately 110 km), hereafter referred to as Indian gridded observations. It must be noted that although this data set is referred to as gridded observations they are not observations in the strict sense of the word, the data set is created by merging rain gauge observations with satellite observations and thus involves certain assumptions and techniques. The observations used to create the data set are not assimilated by the IMDAA reanalysis or ERA‐Interim and therefore the merged data set can be used as an independent comparison for both systems.

The National Oceanic and Atmospheric Administration (NOAA) Climate Prediction Center Morphing Technique (CMORPH) (Joyce, Janowiak, Arkin, & Xie, ), provides global precipitation estimates at a quarter degree resolution (approximately 28 km) using satellite data, again these are independent of the IMDAA and ERA‐Interim reanalyses.

Neither the Indian gridded observations nor CMORPH are necessarily more accurate than the reanalysis data. Satellite‐derived precipitation estimates are known to have significant biases (Jiang, Ren, Yong, Yang, & Shi, ), which may make the data set less accurate than a high‐resolution reanalysis. Over land, CMORPH is derived from data from rain gauges, which provide accurate precipitation measurements at their point location, but may not be representative of the wider grid‐box. For the purposes of comparison, CMORPH may be considered accurate at larger scales over land and both gridded observational data sets may be considered accurate with position, but not intensity.

RESULTS

This section describes some of the results seen in the reanalysis. It specifically aims to show main monsoon features in precipitation and wind fields and comparing against the Indian gridded observations (received from NCMWRF), ERA‐Interim and CMORPH.

Seasonal precipitation accumulation plots, calculated from June to September (JJAS), show good agreement overall between the global reanalysis, regional reanalysis and Indian gridded observations for both 2008 and 2009. Figures and show the seasonally accumulated precipitation for JJAS for 2008 and 2009 for (a) IMDAA reanalysis, (b) ERA‐Interim, (c) Indian gridded observations and (d) CMORPH. It is evident that all the major precipitation areas for this season are depicted in all four data sets; the precipitation band along the Western Ghats, at the foothills of the Himalayas and the precipitation in the North Bay of Bengal. The rain shadow (area of little precipitation east of the Ghats) is also discernible in all four data sets. However, closer examination reveals that there are subtle differences between the two reanalyses and the gridded observations. First, it is clear that the global reanalysis, ERA‐Interim, is of lower resolution as demonstrated by the smoothed precipitation field (Figures b and b), able to broadly capture the precipitation but not able to represent the finescale detail. The regional reanalysis (Figures a and a) depicts a more complex structure and generally higher maximum rainfall. There also appear to be finer mesoscale processes occurring as evidenced by the filamental structure visible in the precipitation. It is difficult to assess whether this is correct as the gridded observations are at a much coarser resolution. Broadly, IMDAA matches the location of high precipitation shown in the two gridded observational data sets (Figures c,d and c,d) and the values of precipitation accumulation are closer to those seen in the gridded data sets too. There is a tendency for IMDAA to confine the precipitation in a narrower band to that seen in gridded observations, although this may be an artefact of comparing data sets at different resolutions. However, if IMDAA is regridded to the coarser resolution of the Indian gridded observations (not shown), the high precipitation regions are still not as spread out as those seen in the gridded observational data sets. A more objective analysis would prove useful as well as extending it to look at more seasons.

View Image - Precipitation accumulation for June–September 2008 in millimetres for (a) IMDAA, (b) ERA‐Interim, (c) Indian gridded observations and (d) CMORPH

Precipitation accumulation for June–September 2008 in millimetres for (a) IMDAA, (b) ERA‐Interim, (c) Indian gridded observations and (d) CMORPH

View Image - Precipitation accumulation for June–September 2009 in millimetres for (a) IMDAA, (b) ERA‐Interim, (c) Indian gridded observations and (d) CMORPH

Precipitation accumulation for June–September 2009 in millimetres for (a) IMDAA, (b) ERA‐Interim, (c) Indian gridded observations and (d) CMORPH

Root mean square error (RMSE) maps of precipitation accumulation were calculated, for all the data sets against each other, for the monsoon seasons (June–September) for 2008 (Figure ) and 2009 (not shown). Concentrating on the RMSE maps calculated with respect to the Indian gridded observations (Figures a–c), the highest RMSE values are seen, as expected, along the western Indian coastline and in the foothills of the Himalayas, that is, where most of the precipitation accumulation over the season is seen and where it was identified that the models may be modelling the extent of the precipitation differently compared to the Indian gridded observations. Figure d shows the lowest RMSE overall, when the regional IMDAA model is being compared with ERA‐Interim. The two reanalyses model the precipitation similarly, with the highest RMSE differences visible around the foothills of the Himalayas where perhaps the IMDAA reanalysis was showing a more filamental structure in precipitation than that seen in ERA‐Interim. The similarity of the two models is not surprising considering IMDAA is nested within ERA‐Interim.

View Image - RMSE maps of precipitation accumulation for June–September 2008 in millimetres for (a) IMDAA against Indian gridded observations, (b) ERA‐Interim against Indian gridded observations, (c) CMORPH against Indian gridded observations, (d) IMDAA against ERA‐Interim, (e) IMDAA against CMORPH and (f) CMORPH against ERA

RMSE maps of precipitation accumulation for June–September 2008 in millimetres for (a) IMDAA against Indian gridded observations, (b) ERA‐Interim against Indian gridded observations, (c) CMORPH against Indian gridded observations, (d) IMDAA against ERA‐Interim, (e) IMDAA against CMORPH and (f) CMORPH against ERA

In addition to RMSE maps, Pearson correlation maps were also computed between the different data sets for 2008 (Figure ) and 2009 (not shown due to similarity between Figure ). Once again, the highest correlation is seen in Figure d, the correlation map between IMDAA and ERA‐Interim where there is largely a positive association between the two models. Interestingly, the lowest correlation is between the two gridded observational data sets and this is somewhat evident in the seasonal precipitation accumulation plots (Figures and ). CMORPH appears to underestimate precipitation over a season compared to the gridded Indian observations. This highlights the need to utilise as many different observations as possible for comparison with IMDAA as it is evident that there are errors in the gridded observations themselves.

View Image - Pearson correlation maps of precipitation accumulation for June–September 2008 in millimetres for (a) IMDAA against Indian gridded observations, (b) ERA‐Interim against Indian gridded observations, (c) CMORPH against Indian gridded observations, (d) IMDAA against ERA‐Interim, (e) IMDAA against CMORPH and (f) CMORPH against ERA

Pearson correlation maps of precipitation accumulation for June–September 2008 in millimetres for (a) IMDAA against Indian gridded observations, (b) ERA‐Interim against Indian gridded observations, (c) CMORPH against Indian gridded observations, (d) IMDAA against ERA‐Interim, (e) IMDAA against CMORPH and (f) CMORPH against ERA

Other features worthy of note in the precipitation accumulation plots (Figures and ) are the better representation of precipitation over the east coast of Vietnam and South East China in the South China Sea, the Gulf of Thailand and an area of the Indian Ocean in the south of the IMDAA reanalysis domain in IMDAA reanalysis relative to ERA‐Interim. This is barely captured by the global reanalysis, whereas the regional reanalysis captures again the shape and is closer in value to those seen in the gridded observations. Also, the IMDAA reanalysis does not give the excess of precipitation seen in ERA‐Interim compared to the gridded observational data sets over Malaysia and Indonesia.

The patterns of June to September 2008 seasonal mean 850 hPa winds from the regional and global reanalyses are very similar, both showing westerly flow across the Indian peninsula and south‐westerly flow over the Arabian Sea and Bay of Bengal. Mean wind speeds are also consistent between the two reanalyses, and the blocking effect of the Himalayas on the flow is seen in wind patterns exhibited by the IMDAA reanalysis and ERA‐Interim (Figure a,b).

Figure shows the timeseries of all India rainfall (AIR), the mean daily precipitation area‐averaged over India, for the two monsoon seasons (JJAS) for 2008 and 2009, respectively. Foremost, CMORPH appears to consistently show reduced AIR compared with the three other data sets. This can be seen for both years and throughout the 4 months. However, CMORPH does appear to follow the same peaks and troughs of the rainfall through time as the other data sets. As mentioned earlier, we would expect CMORPH to be accurate with position but not intensity and this does indeed seem to be the case. Averaged precipitation maps of CMORPH for the monsoon seasons also show less precipitation over the Indian land than in any of the other reanalyses or Indian gridded observations (Figure a–d). The other three data sets show good agreement with each other in AIR in the changes in daily precipitation and intensity. Encouragingly, IMDAA appears to match the intensity of the Indian gridded observations better than ERA‐Interim.

Timeseries of averaged daily precipitation over India (mm/day) for (a) JJAS 2008 and (b) JJAS 2009

The mean absolute error (MAE) and Pearson correlation coefficient were also computed between the different data sets for AIR. The results are presented in Table . In summary, the largest differences in MAE are seen when comparing the gridded observation sets, once again. CMORPH and Indian gridded observations have the largest MAE for AIR. Both IMDAA and ERA‐Interim compare quite similarly against Indian gridded observations with IMDAA showing a slight smaller MAE than ERA‐Interim against Indian gridded observations and slightly higher correlation. However, it is clear that more years need to be examined.

MAE in mm/day and the Pearson correlation coefficient between models and gridded observations for AIR for 2008 and 2009

	2008	2009
	MAE	Pearson correlation	MAE	Pearson correlation
IMDAA vs Indian gridded observations	0.992	0.932	1.248	0.905
ERA‐Interim vs Indian gridded observations	1.494	0.916	1.326	0.888
CMORPH vs Indian gridded observations	2.55	0.829	3.558	0.771
IMDAA vs ERA‐Interim	1.202	0.924	0.917	0.914
IMDAA vs CMORPH	2.340	0.814	2.860	0.768
CMORPH vs ERA‐Interim	1.820	0.767	2.820	0.714

DISCUSSION AND CONCLUSIONS

This paper highlights the successful development of the first regional reanalysis model over India, set up in collaboration with NCMRWF and IMD as part of the Monsoon Mission Project. Production runs of the regional reanalysis are underway. In this paper, a pilot study looking at 2008 and 2009 has been investigated, revealing that the regional reanalysis is doing as well as ERA‐Interim in precipitation and wind fields over the monsoon months. The IMDAA reanalysis is capable of capturing the main precipitation areas over India in detail and able to capture the intensity of precipitation throughout the monsoon region.

Future work will look at better inspecting the high resolution of the data and in particular using it in high‐resolution studies such as the INCOMPASS project. INCOMPASS stands for Interaction of Convective Organisation and Monsoon Precipitation, Atmosphere, Surface and Sea. This was a large‐scale observational campaign in India that occurred during the 2016 monsoon, including the UK's Atmospheric Research Aircraft and should be an excellent source of independent observations.

ACKNOWLEDGEMENTS

Funding for this work was provided by the National Monsoon Mission, Ministry of Earth Sciences, Government of India. The IMDAA regional reanalysis is the result of many individuals who have contributed over many years to the scientific research and development of the systems employed in the regional reanalysis. We would also like to thank all the technical suport we received in the development of this complex system. The authors are also grateful to ECMWF for making available ERA‐Interim reanalysis data sets: model fields and gridded observations.

Word count: 3558

Show less

© 2018. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

A high resolution, long‐term regional reanalysis over the Indian subcontinent has been developed and is currently in production. The regional reanalysis has been produced as part of the Indian Monsoon Data Assimilation and Analysis (IMDAA) project and is the outcome of a collaboration between the Met Office (MO), the National Centre for Medium Range Weather Forecasting (NCMRWF) and the India Meteorological Department (IMD). The reanalysis will produce a consistent data set of high‐resolution fields for a wide range of atmospheric variables available from 1979 to 2016. Production runs started in 2017, and computations for 10 years have been completed as of May 2017. The entire production will be completed in early 2018. This article introduces the IMDAA regional reanalysis, describes the forecast model, data assimilation method, and input data sets used to produce the reanalysis. The performance of the system from a pilot study run for 2008–2009 are presented indicating that the regional reanalysis is able to capture major monsoon features—a key phenomenon in the Indian subcontinent.

Details

Title

Indian monsoon data assimilation and analysis regional reanalysis: Configuration and performance

Author

Mahmood, Sana¹; Davie, Jemma¹; Jermey, Peter¹; Renshaw, Richard¹; George, John P²; Rajagopal, E N²; Rani, S Indira²

¹ Weather Science, UK Met Office, Exeter, UK
² NCMRWF, Noida(U.P), India

Section

RESEARCH ARTICLES

Publication year

2018

Publication date

Mar 2018

Publisher

John Wiley & Sons, Inc.

e-ISSN

1530-261X

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1002/asl.808

ProQuest document ID

2017677578

Indian monsoon data assimilation and analysis regional reanalysis: Configuration and performance

Jump to:

Full text

Abstract

Details

Suggested sources