1. Introduction
Regular monitoring of the paddy area is vital as rice production supports rural livelihoods in Asia, where more than 1.21 billion tonnes of rice were harvested to feed 4.56 billion people in 2018 [1,2]. The rice production often struggles with the effects of climate change [3,4], water shortage [5,6], inadequate machinery supplies in developing countries [7], and soil degradation [8], which in turn requires intensified monitoring to ensure sustained food production. The common practice of monitoring rice crops in Indonesia is by using local government officers at a sub-district level to collect data based on field visits and information provided by the farmers. This data is expensive to collect, non real-time, and inefficient to handle spatio-temporal changes in rice-producing areas [9,10,11]. Moreover, the paddy area needs to be monitored in near-time due to the need for a continuous water supply from irrigation canals and fertiliser inputs at critical stages [12]. Thus, timely and accurate information on rice growth stages is vital for planning and management of the rice farming system, which is critical for sustainable food security at the regional and national scale [13,14].
Remote sensing can offer cost-effective near real-time solutions to analyse land-use changes [15,16,17], cropping patterns [18,19,20,21,22], growth stages [21,23,24], and crop detection over large areas [25]. Previous studies have widely used coarse-resolution multitemporal optical imagery, such as Moderate Resolution Imaging Spectrometer (MODIS) [26,27,28,29,30], for rice monitoring. MODIS has a significant advantage of daily revisit time and able to generate vegetation indices, such as enhanced vegetation index (EVI) and normalized different vegetation index (NDVI). These indices can be used to quantify rice growth stages accurately, and therefore to recreate a temporal evolution of rice production. This data can be fed in to establish regional and national productivity of rice yield inventories [29,31,32]. However, most of the other optical satellites have less frequent visit times, and they are prone to cloud cover and shadow, resulting in gaps in the temporal data, particularly in monsoon season [33]. Additionally, their medium to coarse spatial resolution (>250 m) hampers the capability to discriminate the rice growth stages within paddy fields. Moreover, paddy rice fields are often highly fragmented, resulting in mixed pixels. Some prior studies show that single-date rice monitoring with Landsat 7/8 can be done with better accuracy [24,34,35] with a revisited time of 16 days, which is difficult to get clear data useful at the operational level.
To avoid the cloud interference, microwave or synthetic aperture radar (SAR) data from RADARSAT [36,37], ALOS-PALSAR [38,39], and Sentinel-1 [40,41,42,43,44,45], have been explored for rice mapping. The backscattering of the microwave radiation can be used to detect the rice cropping pattern due to phenological changes of the rice canopy structure having characteristic microwave scattering properties [46]. The previous studies on the mapping of rice growth stages are limited to identifying rice/non-rice area [47,48,49], and the cropping pattern using various sensors [50]. Rudiyanto, Minasny, Shah, Che Soh, Arif, and Indra Setiawan [21] demonstrated the detection of rice phenology in Indonesia and Malaysia rice fields, using hierarchical clustering with vertical-horizontal backscatter from Sentinel-1. On the other hand, Phung et al. [51] were able to correlate between incident angles of backscattering and day of planting based on smoothed VH backscattering intensity images. These studies have utilised the fact that the inundation and vegetative rice growth stages typically have the lowest backscattering intensity due to the flooded soil that reflects the radar energy specularly away from the sensor. As the rice grows, the backscattering intensity becomes higher when the rice canopy is established (e.g., reproductive phase) as the canopy can give double-bounce scattering [52,53]. Finally, the backscattering intensity becomes lower again after harvesting due to a loss of biomass [54]. The utility of RADAR for rice growth stage monitoring can also benefit from the frequent satellite revisit times, and penetration through cloud cover affected by many tropical countries. The only global and free dataset of RADAR is provided by Sentinel-1 satellite imagery. Other RADAR-capable satellites, such as ALOS-PALSAR and TerraSAR-X, are limited in uses or available via paid subscriptions. Moreover, the speckle noise issue can be a problem in complex land-use [55] and can be minimized with the deep learning approach [56,57]. Importantly, microwave remote sensing has a significant time lag to be analysed near-real-time due to its slow processing time and high demand for local knowledge to classify the rice growth stages [21]. Moreover, the short time is vital for rice mapping for validating the rice production prediction on a national or local scale.
Considering the advantages from both optical and radar images, combining multiple remote sensing data (data fusion) has the potential to improve the accuracy with high temporal coverage. The fusion between active and passive remote sensing have been adopted [58,59,60,61], to overcome the missing data, and have higher accuracies due to the ability to distinguish specific crop area, especially near water bodies. According to Schmitt and Zhu [62], there are three types of data fusion: (1) raw data fusion, (2) feature extraction fusion, and (3) decision-making level. Feature extraction fusion is more attractive than the others because the fusion process can be carried out with multiple images with different temporal and spatial resolution. The feature fusion uses similar objects from multiple sources that are combined for further assessment. For example, Park et al. [63] developed a method by combining the data of Landsat, MODIS, and ALOS-PALSAR to map paddy area in South Korea, which increased the accuracy by 6–9%, compared to individual sensor information. Similarly, Cai et al. [64] reported high accuracy for mapping rice paddy area in China with multiple sensors (Sentinel-1/Sentinel-2/MODIS). The resultant high accuracy is due to the strong correlation between the leaf area index (LAI) and biomass with cross-polarization of microwave backscattering [65]. To date, the study of integration between optical and radar-based imagery on rice growth stages classification is not investigated thoroughly on the feature extraction and decision-making fusion level to increase accuracy.
Considering some limitations of previous studies, the actual demand for near-real time rice growth stage prediction models using remote sensing is high. Thus, this study aimed to develop an automatic workflow for generating 10-m-resolution multitemporal maps of rice growth stages by combining Sentinel-2, MOD13Q1, and Sentinel-1 imagery. The result of this study, both the method and the map products, can be used for mapping at national and regional scales to ensure food security and production with timely and accurate spatial information using multiple remote sensing data sources.
2. Background, Study Area, and Data
2.1. Rice Growth Stages
The growth stage of paddy rice in Asia, particularly Indonesia, can be split into five classes, which reflect the rice cultivation practices in any given time [66,67,68]. First, the bare land area is filled by water (flooding) and ploughed for (15–25 days) depending on the water irrigation schedule (Figure 1). The second stage is the vegetative stage, which the rice grows from seedling emergence until panicle development (55–65 days; Figure 1). The third stage is the reproductive stage where the leaf stem bulges with the panicle, also known as the booting stage (20–25 days; Figure 1). Flowering and pollination also occur in this stage. The final stage is the ripening stage, which includes fertilisation, the grain is filling/expanding, and the paddy grain becomes dry-brown (30–35 days; Figure 1). The total cultivation time is 120–150 days, depending on the varieties. Intercrops, growing other crops (maize, soybean, green beans) between the rice crop, are also adopted in some areas.
2.2. Study Area
In this study, we chose two distinctive regions with different climates (wet and dry area) and fragmented paddy area from Java Island, Indonesia (Figure 2). The West Area is located on West Java, which represents the paddy area, mainly irrigated area and low land, while the East Area is located with larger rain-fed paddy field and have a high fragmented area. Both areas have a monsoon climate with two seasons: the wet season and the dry season [69]. The wet season is from October to March, and the dry season is April to September (see Supplementary Figure S1). Typically, short-duration varieties, such as IR64, Ciherang, of glutinous rice are widely adopted in this area [19,21].
2.2.1. West Area
West Area consists of paddy area in three regencies, which are Karawang (96,482 ha), Subang (84,228 ha), and Indramayu (115,555), with a total area of 310,265 ha [70] (Figure 2). Most of the area is irrigated paddy field from Jatiluhur area (258,633 ha), established on alluvial and lithosol soil. The water distribution is maintained by the state-owned company easing water requirements. However, further issues arise from sedimentation, breaking drainage, and land-use change, especially in the dry season. The season comprises of two rice planting cycles, separated by one month of bare land. This bare land period is intended to rest the soil and stop the crop pest life cycle (Figure S2). The time of planting changes based on the catchment location, with the first planting season starting from November and the second in March in the upper catchment areas. The downstream areas start their season in February due to constant floods during the rainy season, while the second planting is in July.
2.2.2. East Area
East Area is located on West Java Province consists of four regencies, including Bojonegoro (78,677 ha), Lamongan (87,336 ha), Jombang (48,704 ha), and Nganjuk (42,918 ha), totalling an area size of 257,635 ha [71] (see Supplementary Table S1). The soil types of this region are much more diverse, including alluvial soils, grumusol, and regosol. The main cropping pattern can be divided into four types. The first cropping pattern is rice planted three times throughout the year with planting times in December, April, and August with water pumping from the Bengawan Solo river for year-round irrigation. This cropping pattern is only found in this one area near the Bengawan Solo river facilitating the farmer to cultivate during the dry season by pumping water. The second cropping pattern is planting rice twice annually with the first planting season in December and the second in April. The third cropping pattern utilises one rice planting period and a secondary crop, such as maize, soybean, and shallots, planted in the dry season. The fourth cropping pattern consists of only rice cultivation during the rainy season (Figure S1).
2.3. Satellite Imagery
This study is based on three satellite datasets from 1 June to 29 September 2018. The first dataset comes from Sentinel-2 A/B in Level-1C (L1C) format, which is a Top-Of-Atmosphere product from Copernicus Open Access Hub [72]. Sentinel-2 satellites provide 5 days of revisited time over the study area. The scenes of Level-1C were subjected to atmospheric corrections into Level-2A by using Sen2Cor [73]. The Sen2Cor produces surface reflectance, which includes seven bands with a 10-m resolution, four bands with a 20-m resolution, six bands with a 60-m resolution, and Scene Land Classification (SCL) with a 20-m resolution. Here, we used 11 bands as the predictors for the Sentinel-2 model (Table S2). Moreover, bands 5, 6, 7, 8A, 11, and 12 were resampled into a 10 × 10 m pixel size to match with high-resolution bands.
The second dataset was MOD13Q1 from Terra sensor with a 250-m resolution from Google Earth Engine (GEE), which was pre-processed to surface reflectance [74,75] and resampled in a 10 × 10 m pixel size. We used this dataset for filling the gap of Sentinel-2 imagery because it is a composite 16-day period of daily MODIS observation. MOD13Q1 consists of the normalized difference vegetation index (NDVI) and enhanced vegetation index (EVI), which are chosen with the highest NDVI value in the 16-day period. The calculation of NDVI and EVI is based on MODIS/Terra bands with the formula as follows:
(1)
(2)
The third dataset was Sentinel-1A, which was downloaded from GEE and pre-processed with Sentinel-1 toolbox to remove thermal noise, calibrating radiometric and terrain correction (
The total number of acquisitions used in this study were 200 scenes for Sentinel-2, 28 scenes for Sentinel-1, and 17 composite scenes for MOD13Q1 (Table S3). All of the dataset was masked with a high-resolution paddy field area from the Indonesia Ministry of Agriculture to reduce complexity with other land use before further processing.
3. Methods
The workflow of analysing multisource remote sensing data for multitemporal mapping of rice growth stages from June–September 2018 was illustrated in Figure 3. First, we labelled and synchronised data from multiple satellites (Sentinel-2, MOD13Q1, and Sentinel-1) with the collected field data. The next step is to build two separate prediction models for rice growth stages using the support vector machine methodology for the Sentinel-2 and another for fusion between MOD13Q1 and Sentinel-1 (MOD13Q1/Sentinel-1). After assessing the accuracy of both models (Sentinel-2 and MOD13Q1/Sentinel-1), the image data were combined into one time series to fill the cloud-obscured pixels of Sentinel-2. Additionally, rice planting detection was applied to separate rice cultivation and non-rice cultivation activity on the rice fields. The final rice growth stages map is the integration of the Sentinel-2/MOD13Q1/Sentinel-1 maps. Finally, we calculated the consistency percentage of the Sentinel-2 and Sentinel-2/MOD13Q1/Sentinel-1 maps over time. In the following subsection, the data conversion steps are described in detail.
3.1. Field Data and Dataset Labelling
The field data was collected as purposive and random sampling with a Global Positioning System (GPS)-enabled mobile phones and synchronised with a pocket GPS receiver between 20 July and 4 September 2018. The sampling points were selected randomly with a 500-m buffer distance using the official paddy area as the base map, which returned 227 points in the West Area and 171 points in the East Area in total. These sites were then visited in the field. The enumerator of the field data made sure that the sampling area has a dimension at least of 50 × 50 m and contained the same rice growth stage throughout to reduce impacts of mixed pixels. Each field site has associated field photos that were used for labelling. The field observations were then matched up with the closest and cloud-free satellite dataset’s date for extraction.
The flooding class has been underrepresented due to fieldwork being conducted in the dry season. To ensure the field dataset was balanced, 70 flooding locations were manually extracted from a Sentinel-2 false colour composite image, taken on 6 June 2018. In total, the number of Sentinel-2 datasets utilised was 426 observations, and MOD13Q1/Sentinel-1 had 468 observations. The difference is due to the changes in cloud cover within the datasets.
3.2. Prediction Models for the Rice Growth Stages
In this study, we used SVM because of its suitability in remote sensing data applications and its capacity to handle complex classification problems [81,82]. SVM maps the data points in high-dimensional feature space and separates the class using hyperplane with a kernel function, such as linear, polynomial, sigmoid, and radial function. The closest data points of the hyperplane become the support vectors, which determine the position and orientation of the hyperplane to find the maximum margin (the distance between the support vectors with the hyperplane) [83]. Here, we used the radial basis function (RBF) because of the high variability and complex predictors for rice condition [33,80]. Moreover, Kavzoglu and Colkesen [84] suggested that SVM with RBF has advantages, such as training pixels being needed less than with other classifiers and flexible with a statistical distribution range in remotely sensed data. Moreover, SVM with RBF has been applied successfully for rice growth stage mapping before and outperformed other classification methods, such as random forest and neural network [24]. There are two kernel parameters of SVM RBF can be used to increase the accuracy, i.e., cost penalty (C) was used to define trade-off between model complexity and error and Sigma for smoothing the vector [85].
Two SVM models were built in this study, using the R programming language with the caret package [86], as follows:
(a). Sentinel-2 model: this prediction model was based on Sentinel-2 input bands (Table S2) as predictors labelled based on the field data temporally closest to the Sentinel-2 imagery acquisition. The Sentinel-2 model to predict rice growth stages was trained using the field dataset with a 70:30% random split (i.e., 299 and 127 observation from the field dataset). The relationship between the bands of Sentinel-2 and the rice growth stages can be expressed as follows:
(3)
(b). MOD13Q1/Sentinel-1 model was developed by combining MOD13Q1 and Sentinel-1 with predictors of NDVI and EVI from MOD13Q1 and VH for three consecutive three-time lag series (e.g., Sentinel-1 image of VH on t day (T0), t-12 days (T1), and t-24 days (T2) in decibel (dB)). We found that using three consecutive VH values had better accuracy, which can be explained by the typical length of each rice growths stage of around 24 days. We used 330 points for the training dataset and 138 points for the test dataset. The relationship of the MOD13Q1 indices and multitemporal backscattering data of Sentinel-1 and the rice growth stages can be expressed as follows:
(4)
The two models were trained with the parameter cost penalty (2n with n = 1 to 10) to define trade-offs between model complexity and error and Sigma (2−25, 2−20, 2−15, 2−10, 2−5, 0) for smoothing the vector [85]. Moreover, the resultant models were cross-validated with the leave-one-out cross-validation method to ensure an unbiased result. The highest accuracy model was selected to be used to classify rice growth stages on the satellite dataset.
3.3. Generating Multitemporal Maps for Growth Stages by Integrating Sentinel-2/MOD13Q1/Sentinel-1
The final paddy growth stage maps were preferred to be based on Sentinel-2 classification results due to its better resolution than MOD13Q1. However, temporal gaps in the Sentinel-2 data frequently occur due to cloud coverage and shadows. These gaps were filled with the prediction maps generated from the integration of MOD13Q1 and Sentinel-1, which has the same spatial resolution using the resampling technique. The Sentinel-2-based rice growth stage classification was filtered for pixels that had not been bare land for a period 120 days (t1) and 150 days (t2) for the East Area and West area, respectively (Figure 4). The timeline of each processing multitemporal map for a 16-day period from 10 June–29 September 2018 can be seen in Table S4.
The differences between time windows were due to different rice cultivation between those areas during the dry season. The East Area has limited time to flood the rice field (<20 days) compared to the West Area (20–40 days). Cloud and shadow were removed using the scene classification band. The Sentinel-1 VH was stacked up to find the lowest VH value (VHmin) over the time window t1 and t2 [52]. Low radar backscatter signal can be associated with both the water/vegetative stage due to radar energy reflected off and bare land due to penetration radar energy on dry land [21], making classification using only backscattering sensor data challenging. This confusion can be reduced by utilizing Sentinel-2-based rice growth stage classification. If the VHmin was higher than −19 dB and the rice growth stages are reproductive or ripening, then the pixel value will be replaced by a non-rice planting class, and others would have the rice growth stages value from Sentinel-2 maps. This step is to separate the maize or soybean in the reproductive or ripening phase because the spectral profile is nearly the same as rice’s profile, as illustrated in Figure 5 [21,52].
In some instances, we found even the complete Sentinel-2 and MOD13Q1/Sentinel-1 was unable to have cloud-free data every 16-day period. Thus, we used rice’s detection result to replace the cloud pixels. If the VHmin was lower than −19 dB, the pixel would be marked as the rice planting class. Otherwise, the cloud pixels were marked as another crop class. This resulted in a seven-class map, including bare land, flooding, vegetative, reproductive, ripening, rice planting, and other crops. The integration of Sentinel-2/MOD13Q1/Sentinel-1 will have more rice growth stages data than the Sentinel-2 rice growth stage model alone.
3.4. Accuracy Assessment and Temporal Changes
The accuracy of the Sentinel-2, MOD13Q1/Sentinel-1 model, and rice planting detection were tested using the comparison of predicted values and ground-reference data. By using the confusion matrix, the overall accuracy, user’s accuracy (UA), and producer’s accuracy (PA) can be calculated as suggested by [87,88,89]. The test data for the Sentinel-2 model is 127 points, and MOD13Q1/Sentinel-1 is 138 points.
The temporal changes of predictions maps were reclassified into four classes: (1) No Change, there is no change between stages; (2) change correctly, the stages change into progressively according to the common practices of rice cultivation in Figure 1; (3) change incorrectly, the stages change into two stages forward or back to previous stages; and (4) cloud and shadows, one of the images has cloud/shadows. Thus, the average of no change, change correctly, and change incorrectly can be calculated. The consistency of the map of Sentinel-2 was tested by comparing two images of different 5, 10, 15, 20, 25, and 30 lag-days, and the Sentinel-2 / MOD13Q1 / Sentinel-1 maps were 16-day lag-days, respectively. The formulas of consistency and inconsistency as follows:
(5)
(6)
(7)
(8)
A high consistency value is better since the prediction model can predict the growth stages in current time and previous time accurately.
4. Results
4.1. Spectral Bands
Figure 6a provides the mean and deviation of reflectance values for different rice growth stages based on the ground truth dataset. The result indicates that five classes have distinctive spectral features with significant variability. The reflectance profiles of bare land and flooding are different from rice growth stages (vegetative, reproductive, and ripening) as the rice growth stages have the higher mean value on Red Edge 3, NIR, and SWIR1 band. The flooding class displays the lowest reflection on the NIR band due to almost no reflection off from the vegetation. Moreover, the spectral bands of rice growth stages are concurrent with a multi-angle spectrometer from Sun et al. [87], except for the flooding phase because of the different rice management in the study area. The spectral trend in vegetative and reproductive phases followed a similar pattern.
The averages of NDVI and EVI values can be seen in Figure 6b. It shows that bare land and flooding have similar NDVI values but are different on EVI, which indicated that EVI has a more significant role in separating bare land and flooding [90]. The NDVI and EVI value for rice growth stages follows the common values of rice growth stages where the vegetative stage is the lowest values, and the highest values are the heading stage due to the dense canopy. Lastly, the ripening stage is in the middle as it changes the vegetation to golden colour [91].
Figure 6c shows the relationship between rice growth stages in three consecutive acquisition dates. It shows that the bare land, ripening, and reproductive stages have backscattering values >−19 dB, meaning the surface has relatively dense biomass over 24 days. The bare land is decreasing, and otherwise, ripening and reproductive is decreasing. The main feature is the flooding class, which shows the lowest backscattering value on the previous 24 days and increases afterwards. The backscattering values on the vegetative stage were stable <19 dB, which shows the low scattering due to specular reflection [92].
4.2. Accuracy Assessment of Rice Growth Stages Model
Overall, the accuracy of the Sentinel-2-based model is higher than the combined MOD13Q1/Sentinel-1 model (Table 1). The former can reach a high overall accuracy of 90.6% for all rice growth stages (except ripening) from the producer’s accuracy. The ripening stage is often misclassified with vegetative and reproductive stages. On the other hand, the MOD13Q1/Sentinel-1 model has an overall accuracy of 78.3%, which is lower due to confusion between the ripening class and bare land stage. However, the MOD13Q1/Sentinel-1 model can predict the reproductive stage with a high accuracy (97.1% for PA and 84.6% for UA).
The Sentinel-2 rice growth stage model was able to capture the variation of different rice growth stages, as shown in Figure S2. The map of a particular area on the West Area shows that the vegetative stage was recorded on 1–26 June 2018, followed by the reproductive stage on 1 – 21 July 2018, and then the ripening stage until 10 August 2018. The models show that the land was harvested on 15 August 2018.
Figure S3 shows that the paddy area was surrounded by a non-paddy class as in the East Area, and the irrigation becomes scarce on second rice planting. The vegetative stage was captured on 3–23 June 2018. Then, the reproductive stage was 28 June–23 July 2018. The process of ripening happened between 2 and 17 August 2018. Afterwards, the paddy area becomes bare land or host to other crop types.
4.3. Temporal Changes
The temporal analysis of Sentinel-2 shows that the model was able to capture the change classes with correct ranges of 83.27–90.39% (Table 2) of the time. The temporal consistency decreases with the time lag increase. The gradual decrease of the non-change area from 5 to 16 days also confirms that the model can classify the change of rice growth, as shown in Figure 7 and Figure 8. The West Area has a better consistency result since the irrigation schedule is stable, unlike the East Area where irrigation depends on rain.
On the other hand, the temporal analysis of Sentinel-2/MOD13Q1/Sentinel-1 proves that consistency is high (84.15%), which is comparable with the Sentinel-2 consistency on the 15-day period (87.91). Table 2 also indicates that the highest accuracy comes from the East Area, which suggests that the detection of the non-rice planting class is working well. Moreover, Figure 7 and Figure 8 show that final classification images from the integration of the Sentinel-2/MOD13Q1/Sentinel-1 model can deliver a rice growth stage map without cloud and shadows at a 10-m resolution. The Sentinel-2 model was able to fill in >80% of the study area except on 26 June, 28 July, and 14 September 2018 in the West Area (Figure 7). Moreover, the map of the East Area shows that the area of the rice field is dominantly on bare land and non-rice planting except on the area on the north of the East Area, where rice planting can be irrigated with pumping water from the near river (Figure 8).
5. Discussion
5.1. The Performance of Integrating Sentinel-2/MOD13Q1/Sentinel-1 for Mapping Rice Growth Stages
High-resolution multitemporal images of rice growth stages are essential to improve rice production. Although optical images are less favourable for providing continuous images in tropical regions, this research developed a new method to generate multitemporal maps for rice growth stages across Java Island using multisource remote sensing data (Sentinel-2/MOD13Q1/Sentinel-1). This method outperforms a single source of remote sensing information in a real-world application level on rice growth stage mapping as confirmed with previous results, especially to fill the missing data, with an increased overall accuracy 2–5% [64,93,94]. Previous research reported that using single-date data for creating rice maps is not the right strategy for mapping in the long term. These rice growth stage maps showed temporal consistency even without use of the extended temporal composite vegetation index or backscattering profile series. This workflow works best on the paddy area, which is often flat and water covered. The water cover can be picked up by the Sentinel-1 VH signal and improve the temporal frequency of observation, and thus the overall rice growth stage models. Other advantages of this model are that it can be rolled out to cover larger areas (i.e., entire Southeast Asia) with the same farm practices and without depending on a clean time series of the vegetation index.
Increasing the temporal frequency of Sentinel-1 images helped to improve the accuracy of rice growth stages since the backscattering profiles were more accurately determined, particularly on the vegetative stage on the dry season, such as the East Area, where water is scarce. Moreover, Ndikumana et al. [95] reported that Sentinel-1 with a 6-day revisit cycle can produce a rice map with an 88% accuracy in a France rice field, which better compares with the 12-day revisit cycle from the study from Clauss, Ottinger, and Kuenzer [49] with an 83% accuracy.
It is not surprising that the Sentinel-2 based model produced a higher accuracy than the fusion of MOD13Q1 and Sentinel-1 due to the red edge bands with high spatial resolution. These bands can enhance the detection of capturing canopy chlorophyll content better, as suggested by Frampton et al. [96,97], which enables separation of the rice growth stages. The accuracy from the fusion of MOD13Q1 and Sentinel-1 was higher than MODIS information alone, which indicates the complementary information from Sentinel-1. This high accuracy is prominent for the flooding class due to the coarse resolution of the MOD13Q1 data, allowing a more significant portion of mixed pixels to occur compared to the 10-m Sentinel-2 data. The prediction maps from MOD13Q1 and Sentinel-1 were used to fill the missing data or for cloudy pixels of the Sentinel-2.
The consistency of Sentinel-2 has a negative correlation between lag days. The 5-day lag showed the highest consistency percentage and slowly became lower to the 30-day lag-time due to the unchanged stages and correct changes became lower and also became less consistent over the lag days. This result demonstrated that the models could detect the changes in growth stages over time and still have the right consistency (83.27%) over 30 consecutive days. Moreover, the integration of Sentinel-2/MOD13Q1/Sentinel-1 shows an increase the consistency (2.54%) compared with the 5-day lag consistency percentage.
5.2. The Implication of Satellite-Based Monitoring in Tropical Countries
The tropical regions are challenging for optical satellite-based remote sensing due to persistent cloud coverage. Most of the study relies on high-frequency temporal remote sensing data to capture rice growth changes, which are hampered by observation gaps, which can be filled using radar-based imagery (e.g., S1). However, radar-based data also has some limitations, such as geometric distortion and a different response to bare land [98]. Our finding shows that by merging three sensors, we can deliver information on rice growth stages for near real-time monitoring for the Sentinel-2 model, and better accurate information in periodical data for Sentinel-2/MOD13Q1/Sentinel-1 data. Furthermore, this study successfully filled the missing data up to 28.2% in the West Area and 8.8% in the East Area (Figure 9).
Our methodology focuses on developing a groundwork for mapping rice growth stages for near-real-time monitoring, which is challenging to accomplish in an operational level due to the short duration of rice cultivation. Moreover, we focused on making it feasible to incorporate with cloud-based computing or stand-alone workstations and less human interference. Our study is in line with the works of Rudiyanto, et al. [21]. They have succeeded in mapping rice growth stages from multitemporal Sentinel-1 images with unsupervised classification with one month with the same area with cropping pattern information. Significantly, our methodology outperformed the established rice monitoring systems in Indonesia based on MODIS [99] or Landsat rice monitoring [100], as our results created rice growth stage maps at a 10-m spatial resolution with a 16-day period, less cloudy data, and providing present cultivation crop.
Future research should be focused on fusing the multitemporal and multiresolution of satellite observation to increase data availability and accuracy with <100-m spatial resolution. Our study shows that it is possible to fuse the information of the feature level using machine learning. Our experiment showed that the desktop personal computer with 16 GB RAM could classify the image with high speed (17,770 ha/min). This method can also be employed for other remote sensing data, such as PROBA-V or WorldView missions.
This technology implies that the users can compile the information interactively with the user’s interface on a dedicated website or mobile application with less waiting time before the information becomes obsolete. Moreover, the classifier of machine learning can be chosen easily with other open-source machine learning packages, such as TensorFlow [101] with Keras [102] or Scikit-learn [103]. Importantly, the integration of rice growth stages with other information, such as the recommended cropping calendar, climate predictions, weather reports, price trend maps, insurance risk maps, and other in situ knowledge, will improve the rice productivity supporting regional food security.
5.3. Limitation of Satellite Remote Sensing for Rice Mapping
This research has some limitations, such as the availability of cloud-free images. These limitations are further hampered by the fact that there is a bias between the seasons. For example, most optical observations concentrate on the dry season, while only coarser resolution data are available (MODIS and RADAR) on the wet season. Accordingly, some areas with different surface reflectance may be misclassified if it is applied to images in the wet season, such as a wetland. Some traditional farmers switch flooding and drying again, and conduct final flooding before transplanting. This study’s accuracy is highly dependent on the existing rice field map, and, over time, some area could be converted into an urban or industrial area. Long-term rice mapping using multitemporal satellite images has been explored in some studies [33,104,105] with high accuracy, which can be used to correct existing rice field maps every five years.
The East Area has high fragmentation of different crops on the rice field area especially on the north area of Nganjuk regency, which caused problems in classifying between the vegetative stage of rice and shallot cultivation since both crop types need a significant time of water cover to keep the soil moisture. Hence, they are spectrally similar at a specific time, mostly in the early second planting season. Moreover, the fragmented rice field paddocks with an area <0.5 ha made it difficult to be separated automatically in a large area. Thus, additional information is required, including the use of unmanned aerial vehicles (UAVs) [97], ongoing and planned hyperspectral satellite missions (e.g., PRISMA [106] and ENMAP [107]), and webcams [108], for a more apparent separation between such classes.
6. Conclusions
There is a need for timely and accurate spatial information on rice growth stages. This paper has provided an automatic process to build multitemporal maps for rice growth stages with a 10-m spatial resolution, which is sufficient for crop monitoring at the local and national level in developing countries. This information is useful for stakeholders using a combination of Sentinel-2, MOD13Q1, and Sentinel-1 for a near real-time and high accuracy, consistency and temporal frequency (16-day period). Furthermore, the Sentinel-2 model in this study was implemented in Indonesia, with 7.4 million hectares of autonomous data retrieving, analysis, and dissemination, which are available on a website (
The research presented here is based on open-access satellite data and software, improving its accessibility and uptake by end-users in developing countries. Improving the speed of the mapping process can provide an effective tool for stakeholders and decision-makers to coordinate the distribution of machinery, fertiliser, and water more efficiently. The estimation of rice production can be predicted with better accuracy to control the export and import of rice trade. In addition to the climate change issue, the result is applied as an input of prediction of production and food security in a fragile area, such as drought and flooding, using the temporal analysis or becoming a part of the ASIA Rice project (asia-rice.org) to support the dissemination among its members.
Supplementary Materials
The following are available online at
Author Contributions
Conceptualization (F.R. and R.P.), investigation (F.R. and R.P.), data curation (F.R.), formal analysis (F.R.), methodology (F.R. and R.P.), software (F.R.), visualization (F.R. and G.K.), and writing—original draft preparation (F.R.), writing—review and editing (R.P., G.K., and J.P.), and supervision (R.P., G.K., and J.P.). All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Sustainable Management of Agricultural Research and Technology Dissemination scholarship from the Indonesian Ministry of Agriculture.
Acknowledgments
Indonesia Ministry of Agriculture for providing high resolution of paddy fields area, researchers from Indonesia Agency for Agroclimate and Hydrology Research Institute (IAHRI) and Muhamad Ronal Sabhana Koswara for ground truth campaign. The source codes and scripts used in this study are freely available from
Conflicts of Interest
The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures and Tables
Figure 1. The example of rice growth stages and other rice field conditions in the paddy field area. The geolocation and date stamped on the photos.
Figure 2. Map of the study area for West Area and East Area and overlay with the false colour (NIR/SWIR/Green) of Sentinel-2 image, which has been masked to show only the paddy rice fields for the West Area on 11 and 14 June 2018 and for the East Area on 13 June 2018.
Figure 4. The flowchart of rice detection and the integration of Sentinel-2/MOD13Q1/Sentinel-1.
Figure 5. The example of rice detection workflow for each pixel with vertical-horizontal backscattering (VH) values. The blue line is the example of a non-rice cultivation class (−7° 10′ 34.9″, 111° 52′ 55.42″), and the orange line is the example of a rice cultivation class (−6° 27′ 57.27″, 108° 8′ 56.64″).
Figure 6. (a) The means of spectral bands from Sentinel-2 values, (b) the means of NDVI and EVI values from MOD13Q1, and (c) the means of Sentinel-1 VH values regarding bare land, flooding, and rice growth stages. The bands are offset for visualization and error bars represent standard error.
Figure 7. Temporal map of rice growth stages based on Sentinel-2/MOD13Q1/Sentinel-1 on the West Area.
Figure 8. Temporal map of rice growth stages based on Sentinel-2/MOD13Q1/Sentinel-1 on the East Area.
Figure 9. Pixel-based model compositions of temporal rice growth stages map of Sentinel-2/MOD13Q1/Sentinel-1 on final paddy condition map on 10 June–30 September 2018.
The confusion matrix performance on the test dataset on Sentinel-2 and MOD13Q1/Sentinel-1.
Reference Data | ||||||||
---|---|---|---|---|---|---|---|---|
Classes | Bare Land | Flooding | Vegetative | Reproductive | Ripening | Sum | UA (%) | |
(a) Sentinel-2 | ||||||||
Predicted | Bare land | 32 | 0 | 0 | 0 | 1 | 33 | 97.0 |
data | Flooding | 0 | 21 | 0 | 0 | 0 | 21 | 100.0 |
Vegetative | 1 | 0 | 16 | 0 | 4 | 21 | 76.2 | |
Reproductive | 0 | 0 | 0 | 35 | 3 | 40 | 87.5 | |
Ripening | 0 | 0 | 0 | 1 | 11 | 12 | 91.7 | |
Sum | 33 | 21 | 18 | 36 | 19 | 127 | ||
PA (%) | 97.0 | 100.0 | 88.9 | 97.2 | 57.9 | |||
OA (%) | 90.6 | |||||||
(b) MOD13Q1/Sentinel-1 | ||||||||
Predicted | Bare land | 31 | 1 | 5 | 0 | 5 | 42 | 73.8 |
data | Flooding | 0 | 18 | 3 | 0 | 0 | 21 | 85.7 |
Vegetative | 2 | 2 | 15 | 0 | 0 | 19 | 78.9 | |
Reproductive | 0 | 0 | 3 | 33 | 3 | 39 | 84.6 | |
Ripening | 4 | 0 | 1 | 1 | 11 | 17 | 64.7 | |
Sum | 37 | 21 | 27 | 34 | 19 | 138 | ||
PA (%) | 83.8 | 85.7 | 55.6 | 97.1 | 57.9 | |||
OA (%) | 78.3 |
Note: OA= Overall accuracy, UA = User’s accuracy, and PA = Producer’s accuracy.
Table 2List of the average of performance from temporal changes analysis of (a) Sentinel-2 and (b) Sentinel-2/MOD13Q1/Sentinel-1.
Time Lag | N | No Change (%) | Change Correctly (%) | Consistency (%) | Inconsistency (%) | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
(a) Sentinel-2 | |||||||||||||
WA | EA | Avg. | WA | EA | Avg. | WA | EA | Avg. | WA | EA | Avg. | ||
5 | 23 | 58.72 | 69.22 | 63.90 | 34.00 | 18.82 | 26.48 | 92.72 | 88.04 | 90.39 | 7.28 | 11.96 | 9.61 |
10 | 22 | 50.49 | 68.33 | 59.35 | 40.80 | 18.97 | 29.96 | 91.29 | 87.30 | 89.31 | 8.71 | 12.70 | 10.69 |
15 | 21 | 44.06 | 63.41 | 53.68 | 46.47 | 21.86 | 34.23 | 90.53 | 85.27 | 87.91 | 9.47 | 14.73 | 12.09 |
20 | 20 | 40.57 | 62.24 | 51.40 | 48.79 | 22.26 | 35.53 | 89.36 | 84.50 | 86.93 | 10.64 | 15.50 | 13.07 |
25 | 19 | 39.02 | 55.41 | 47.22 | 48.36 | 26.05 | 37.21 | 87.39 | 81.47 | 84.43 | 12.61 | 18.53 | 15.57 |
30 | 18 | 35.09 | 55.77 | 45.43 | 50.74 | 24.92 | 37.83 | 85.83 | 80.70 | 83.27 | 14.17 | 19.30 | 16.73 |
(b) Sentinel-2/MOD13Q1/Sentinel-1 | |||||||||||||
16 | 6 | 60.27 | 65.1 | 62.64 | 24.26 | 18.75 | 21.50 | 84.52 | 83.77 | 84.15 | 15.48 | 16.23 | 15.85 |
Note: WA= West Area, EA = East Area, and Avg.=Average.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2020 by the authors.
Abstract
Rice (Oryza sativa L.) is a staple food crop for more than half of the world’s population. Rice production is facing a myriad of problems, including water shortage, climate, and land-use change. Accurate maps of rice growth stages are critical for monitoring rice production and assessing its impacts on national and global food security. Rice growth stages are typically monitored by coarse-resolution satellite imagery. However, it is difficult to accurately map due to the occurrence of mixed pixels in fragmented and patchy rice fields, as well as cloud cover, particularly in tropical countries. To solve these problems, we developed an automated mapping workflow to produce near real-time multi-temporal maps of rice growth stages at a 10-m spatial resolution using multisource remote sensing data (Sentinel-2, MOD13Q1, and Sentinel-1). This study was investigated between 1 June and 29 September 2018 in two (wet and dry) areas of Java Island in Indonesia. First, we built prediction models based on Sentinel-2, and fusion of MOD13Q1/Sentinel-1 using the ground truth information. Second, we applied the prediction models on all images in area and time and separation between the non-rice planting class and rice planting class over the cropping pattern. Moreover, the model’s consistency on the multitemporal map with a 5–30-day lag was investigated. The result indicates that the Sentinel-2 based model classification gives a high overall accuracy of 90.6% and the fusion model MOD13Q1/Sentinel-1 shows 78.3%. The performance of multitemporal maps was consistent between time lags with an accuracy of 83.27–90.39% for Sentinel-2 and 84.15% for the integration of Sentinel-2/MOD13Q1/Sentinel-1. The results from this study show that it is possible to integrate multisource remote sensing for regular monitoring of rice phenology, thereby generating spatial information to support local-, national-, and regional-scale food security applications.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



1 Geosciences, School of Agriculture and Environment, Massey University, Palmerston North 4410, New Zealand;
2 AgriFood Digital Lab, School of Food and Advanced Technology, Palmerston North 4410, New Zealand;
3 Geosciences, School of Agriculture and Environment, Massey University, Palmerston North 4410, New Zealand;