Wetlands are the ecosystems that are identified by the presence of anoxic soil, temporary water presence and, vegetation. The hydrology of wetlands is variable due to tidal and rainy season changes. According to Tiner et al. (2015), between 3% and 8% of the Earth's land surface has been covered by wetlands (Tiner et al., 2015). Wetlands are highly productive and provide significant ecosystems’ services at regional and global scales, including facilitating water storage and purification, weather regulation, storm protection, flood mitigation, and shoreline stabilization (Mitsch et al., 2013). Moreover, the prolonged presence of water favors the growth of some endangered terrestrial and aquatic vegetation species (Dahl, 2011). Due to the inherent wet conditions of these lands, they also are considered ideal regions for sequestering and storing atmospheric carbon (Bridgham et al., 2006).
Despite the intrinsic importance of wetlands, they are being degraded due to man-made and natural threats. One of the main issues in recent years is the transformation of the wetlands into agricultural fields due to the demand for intensive farming. Another triggering factor for the permanent loss of wetlands is the overexploitation of underground aquifers. Dahl (2011) reported that the rate of declining marine and estuarine intertidal wetlands is 1.4 percent in the USA between 2004 and 2009. That percentage would be equal to 84,100 acres (34,050 ha). In particular, for coastal wetlands of Atlantic, Pacific, and Gulf of Mexico coasts, the rate for wetland loss is 19,000 acres per year for 1922–1954 and 46,000 acres per year for 1954–1974 (Stedman & Dahl, 2004). This extensive loss of wetlands may hinder future economy and tourism. Technical advances in remote sensing, such as the use of high temporal and spatial resolution imagery have increased the focus on wetland monitoring (Brisco et al., 2015; Mitsch et al., 2013; Wohlfart et al., 2018).
Over the past two decades, remote sensing data have significantly facilitated wetland mapping and monitoring (Tiner et al., 2015; Tsyganskaya et al., 2018; Wohlfart et al., 2018). In particular, the ability of Synthetic Aperture Radar (SAR) sensors to collect data day and night in all-weather conditions makes this a highly valued data source for wetland monitoring. By providing medium to high spatial resolution imagery with a high temporal resolution, newer SAR datasets have proved to be a valuable tool for wetland monitoring (Wohlfart et al., 2018). Compared to other conventional methods for wetland monitoring such as optical imagery, SAR operates at longer wavelengths of the electromagnetic spectrum. This portion of the electromagnetic spectrum allows for deeper penetration of transmitted signals in vegetation cover, which enhances efficient delineation of different wetland classes. Moreover, the sensitivity of the SAR signal to the roughness and dielectric properties of the surface supports retrieval of information related to the shape, size, orientation, and moisture content of the target (Amani et al., 2018; Tsyganskaya et al., 2018).
SAR sensors operate at different frequencies including L (24 cm wavelength), C (5.66 cm wavelength) and, X (3 cm wavelength) bands. The wavelength in which the sensor operates is an influential factor in the penetration depth and signal attenuation. Due to the longer wavelength, L-band has a deeper penetration depth and weaker attenuation through vegetation canopy compared to other frequencies, such as C-band (Hong & Wdowinski, 2014). This allows L-band to penetrate through dense wetland vegetation structures and reach the water surface. Differences in canopy scattering interactions between C-band and L-band signals are mediated by total vegetation water content and the geometric distribution of vegetation water content. In a study conducted in the Amazonian basin, Hess et al. (2015) classified wetlands using L-band JERS-1 mosaics with 100-m resolution. They used dual season backscattering values for estimating the extent of wetland and inundation state and found relatively high producer's accuracy (better than 85%) for wetland extent (Hess et al., 2015). Several studies have also reported an increase in the overall accuracy of wetlands classification by integrating different SAR frequency bands (Evans & Costa, 2013; Mahdianpari et al., 2017; Mohammadimanesh et al., 2018).
Another factor influencing SAR sensor capability is polarization. Given the sensitivity of SAR signal to different backscattering mechanisms, full-polarimetric SAR data can facilitate distinguishing similar wetlands classes (Brisco et al., 2015). Fully polarimetric systems acquire intensity and phase information at four linear polarizations thus allowing effective decompositions of coherency and covariance matrices. To this end, researchers have developed several techniques to decompose polarimetric SAR images into different classes based on scattering signatures (Cloude & Pottier, 1996; Freeman & Durden, 1998; Touzi, 2007). Polarimetric decompositions can categorize ground targets using three different main scattering mechanisms: odd/single bounce, even/double bounce, and volume scattering. In wetlands, odd/single bounce can be attributed to direct scattering from open water. An example of even/double bounce is the scattering between a tree trunk and open water, which is prevalent in flooded vegetated areas. Volume scattering in wetlands mostly occurs as multiple scattering in dense canopies. Adeli et al. (2020) provide a comprehensive review of studies focused on wetland monitoring using SAR data.
A joint partnership between the National Aeronautics and Space Administration (NASA) and the Indian Space Research Organization (ISRO) has led to the development of the space-borne NASA-ISRO SAR (NISAR) program (Hoffman et al., 2015). NISAR will be instrumented with multipolarimetric, dual-frequency L (24 cm wavelength) and S (10 cm wavelengths) band SARs for imaging the Earth. Notably, NISAR is equipped to receive 12 independent channels, enabling a 12-day global revisit cycle (Chuang et al., 2016). However, while the L-band SAR has the ability to collect all data while over land, the duty cycle of the S-band SAR is limited, and will be restricted to a planned subset of the Earth's surface. Considering both ascending and descending orbits, the mission plans to image at L-band the Earth's global landmass twice every 12 days. The full resolution of the L-band SAR data while in its most common operating mode will be 7 m across its entire 240 km swath width. A few studies have explored the use of simulated NISAR for environmental monitoring. For example, Yu and Saatchi (2016) predict that NISAR will be able to generate a global biomass map in a short time frame, given the deeper penetration depth of L-band compared to C-band, and short revisit cycle (Yu & Saatchi, 2016). Duncanson et al. (2020) used simulated NISAR, simulated ICESat-2 and GEDI data to estimate above-ground biomass in Sonoma County, California. Their achieved RMSE for each of the missions was 57%, 75%, and 89% for GEDI, NISAR, and ICESat-2, respectively (Duncanson et al., 2020).
Albinet et al. (2019) report that NISAR will produce 40 PB of data per year. Although this will provide unprecedented global coverage in a short time frame, the high volume raises several challenging issues related to data processing in order to exploit, visualize, and discover the full potential of NISAR data. There are also issues that apply not only to the use of the NISAR data but are broader challenges in the classification of remote sensing data, e.g. the inherent complexity of land cover within wetlands and the limitation of training data. Fortunately, the development of advanced machine learning such as random forest (RF) and support vector machines (SVM) and deep learning techniques such as Convolutional Neural Network (CNN) provides a significant contribution in terms of handling large-volume multitemporal SAR data (Banks et al., 2019; Mahdianpari et al., 2017; Thanh Noi & Kappas, 2018). RF classifies an image using many decision trees that are trained based on subtle variations of the same training data set, hence, the group of trees is less affected by overfitting compared to the entire decision tree (Banks et al., 2019). SVM converts input data to high-dimensional feature space and divides the feature space using optimal hyperplanes. SVM is more resistant to noise and an unequal number of samples within each class (Mountrakis et al., 2011). According to Sheykhmousa and Mahdianpari (2020), although deep learning techniques are powerful in reconstructing complex image patterns, they suffer from hidden layer effects that can result in interpretability issues. Moreover, unlike SVM and RF classifiers, deep learning techniques are more dependent on the presence of high density and high-quality ground reference data. Another issue with deep learning techniques is their high computational complexity. Hence, RF and SVM are still attracting attention from the remote sensing community since they have provided efficient solutions with results that are competitive relative to the more complex deep learning techniques (Sheykhmousa & Mahdianpari, 2020).
The USA National Wetland Inventory (NWI) adopted the Cowardin system for generating wetland inventory maps within the USA which includes five major systems, 11 classes and, 28 subclasses (Cowardin et al., 1979). The classes for this system are defined based on various factors including chemical, hydrological, and geomorphological attributes. NWI updated the national wetland inventory map of the USA in May 2016. One of the reasons behind this update was the demand for having surface waters and wetlands as polygons in a single geospatial data set. This second NWI version included more detailed data of the wetlands and water bodies. There are also a range of studies in the literature that use the Cowardin classification scheme. For instance, Pistolesi et al. (2015) classified Hudson Highlands ecoregion wetlands in New York using the Cowardin classification system. They unified the following classes palustrine emergent, palustrine scrub/shrub, and palustrine forested as emergent, scrub/shrub, and forested wetlands, respectively. The class of open water includes palustrine aquatic bed, palustrine unconsolidated bottom, and palustrine unconsolidated shore based on the definition in Cowardin classification systems (Pistolesi et al., 2015). In another study, implemented in Minnesota, Corcoran et al. (2013) used a two-level RF classifier to classify wetlands in six major classes of forested uplands, open water, forested, scrub/shrub, and emergent wetlands. The classification was based on a modified version of Cowardin classification systems (Corcoran et al., 2013).
The fully polarimetric SAR data allows for the recreation of covariance and coherency matrices that can be led to attaining polarimetric decompositions elements. Hence the implementation of polarimetric decomposition pertains to the separation of the received signal to different scattering mechanisms that are established to be advantageous for wetland separation analysis (Brisco et al., 2013; Koch et al., 2012; Mohammadimanesh et al., 2019). While the upcoming NISAR mission is expected to acquire SAR imagery in only dual-polarization (HH and HV) rather than full polarization for most of the Earth's landmass, NISAR does have the capability of acquiring fully polarimetric L-band SAR data, and an extended mission scenario could include a collection of fully polarimetric data over more extended areas. Therefore, the primary objective of this study was to assess the ability of fully polarimetric L-band simulated NISAR data for delineating wetlands complexes using two machine learning classifiers. Simulated NISAR data are acquired by the UAVSAR platform. In particular, this study aims to (a) compare the efficiency of object-based SVM and RF techniques for classifying L-band prototype science products, (b) evaluate the capability of recent polarimetric decomposition techniques for classifying wetlands complexes, (c) explore the relative importance of polarimetric features in RF models, and (d) test the impact of SVM parameter selection on overall accuracy. To this end, once the raw L-band simulated NISAR data are preprocessed, 84 polarimetric features from more than 10 polarimetric decompositions are extracted.
Study Area, Classification System, and NISAR DataMost parts of northeastern Louisiana are covered by rivers, lakes, and forested areas. This study focuses on Yucatan Lake in an unincorporated community covered by inundated willows and Cyprus trees, and in some parts by crops. The aquatic environment of this area contains several ponds. The lake lies 8.36 kilometers from Newellton and 16.10 kilometers from Saint Joseph in Tensas Parish in Tenses County. The extent of Yucatan Lake is estimated to be around 10 square kilometers. The elevation around the lake varies from 20 to 66 feet. From the climate perspective, the temperature in the area changes from 37 °F in December to 93 Fahrenheit in July. The highest average monthly precipitation varies from 3.06 inches in September to 6.31 inches in January and March (Yucatan Lake Topo Map in Tensas Parish, 2021). Figure 1 shows the geographic location of the study area (left) and the simulated NISAR scene (right).
Figure 1. Left: Geographic location of the Yucatan Lake is shown by a red rectangle in the boundary between northeastern Louisiana and Mississippi. Right: The simulated NISAR image. This image is dated September 30, 2019, and it was acquired in the morning at 9:17 a.m. local time (02:17 p.m. UTC time).
Full polarimetric L-band UAVSAR data were collected over the Yucatan lake area 13 times between June and October 2019. The UAVSAR instrument flies on a NASA Gulfstream 3 jet and is equipped with a multipolarimetry SAR sensor operating at L-band (23.5 cm wavelength). The collected data is part of the NISAR UAVSAR AM/PM campaign to collect L-band SAR data with a similar observation cadence as NISAR for algorithm development and calibration (Chapman et al., 2019). The data was specially processed to mimic NISAR noise and resolution characteristics by the UAVSAR project. In particular, L-band simulated NISAR imitates the polarizations, incidence angles, and signal-to-noise level of upcoming real NISAR data configurations (Huang et al., 2021). The duration of the flight to acquire a full image was approximately 12 min. The used imagery for this study was acquired on a low-flood date. The UAVSAR data sampling was reduced from the standard UAVSAR slant range pixels of 0.8 × 1.7 m to 10 × 10 m on the ground, corresponding to the smallest possible NISAR pixel spacing being considered for its products. The UAVSAR swath width is 16 km, but since incidence angles are limited on NISAR to 34° in near range and 48° in far range (Kraatz et al., 2020), the corresponding incidence angle is restricted swath width for UAVSAR is about 5 km. Compared to planned NISAR performance, UAVSAR has a higher signal-to-noise ratio and much higher resolution. Hence, the NASA Jet Propulsion Laboratory (JPL) reduced the resolution and added Gaussian noise to the UAVSAR data. Moreover, the simulated NISAR data is coregistered on the same grids. This characteristic eases time-series analysis applications. Further, given the need for radiometric terrain corrected backscatter, the simulated NISAR products come with a radiometric terrain correction (RTC) calibration file (Simulated NISAR Products, 2020). For comparing the characteristics of simulated NISAR and real NISAR data, the characteristics of real NISAR data are provided in Table 1.
Table 1 Characteristics of the Upcoming NISAR Mission
Characteristics | Descriptions |
Operating frequency | L-band (24 cm wavelength) and S-band (10 cm wavelength) |
Full Spatial resolution | 7 m in azimuth over a swath width of ∼242 km, variable in range depending on mode |
Repeat orbit | 12-day |
Altitude | 740 km |
Polarization | L: Single-pol through quad-pol, including compact-pol and split-band dual-pol |
Incidence Angle Range | 34–48° |
Range resolution | 3–10 m |
Azimuth resolution | 7 m |
The NWI map of the area reveals that the area is covered by lakes, rivers, freshwater ponds, forested scrub/shrub wetlands, and emergent wetlands. National Agriculture Imagery Program (NAIP) is responsible for acquiring aerial imagery (with 1-meter resolution) during agricultural growing season within the USA (USDA-FSA-APFO Aerial Photography Field Office, 2015). In this study, a mosaic of NAIP imagery over our study site was overlaid onto the NWI wetland map of the Louisiana state. The series of NAIP imagery was captured in 2019 during the leaf-on season. The wetland map of Louisiana state was downloaded from the NWI website in the format of .shp. Once the .shp file loaded in the ArcMap, the attribute was set to the wetland type on its unique values. The scene of the simulated NISAR imagery was also overlaid to assure the consistency of borders and coregistration among these three layers. Next, the type of dominant wetland in each region was determined using NWI map. Ultimately, the borders of digitized reference polygons were drawn by relying on the NAIP imagery on the leaf-on season.
The classes that were assigned to digitized polygons followed a modified version of the Cowardin classification scheme. This study used six major classes: emergent, forested scrub/shrub wetlands, open water, freshwater pond, forested upland and, cultivated/planted land. The class of forested scrub/shrub wetland is defined as forested swamp or wetland shrub bog or wetland that parallels to palustrine forested and/or palustrine shrub in the Cowardin system. Table 2 provides a summary of the classes considered, their abbreviations, and brief descriptions of each class. The total reference data employed were divided with 70% used as training data for the classifier and the rest used for testing (Table 3).
Table 2 Description of Wetland Classes Used in This Study and Their Corresponding NWI and Cowardin Class Names and Code
Table 3 Number of Training and Testing Polygons
Class | Training polygons | Testing polygons |
Open water | 42 | 12 |
Cultivated/planted land | 36 | 10 |
Forested upland | 24 | 7 |
Forested/shrub wetland | 52 | 15 |
Emergent wetland | 27 | 8 |
Freshwater pond | 25 | 7 |
L-band simulated NISAR data were geo-referenced based on NAD1983 UTM zone 15. After the simulated NISAR images were georeferenced, we implemented a speckle reduction filter. An enhanced Lee filter with a 7 × 7 window size was applied for reducing speckle (Lee et al., 2009). Unlike nonadaptive speckle filters, the enhanced Lee filter is adaptive, meaning it does not smooth the entire image to the same degree. Depending on the spatial location of each pixel, the enhanced Lee adaptive filter preserves edges, shapes, and texture of the image by lowering the standard deviation of neighboring pixels. The ultimate result is an image with reduced noise but has edges and image quality preserved (Choi & Jeong, 2019; Lee et al., 2009).
Polarimetric DecompositionThe main goal of polarimetric decomposition is to separate the backscattering signal based on different scattering mechanisms. Depending on the various sensor and target factors, including the roughness and dielectric properties of the surface, different backscattering mechanisms are expected from a specific land cover (Furtado et al., 2016). Generally, there are two types of decompositions: coherent and incoherent. While coherent decomposition is impractical for separating different natural targets due to a high degree of noise, incoherent decomposition is more applicable. Table 4 shows a number of different incoherent decomposition techniques and their corresponding polarimetric features.
Table 4 Incoherent Decomposition Techniques and Corresponding Polarimetric Features
Decompositions | Extracted polarimetric features |
Pauli | Pauli_a, Pauli_b, Pauli_c |
Krogager | Krogager_Ks, Krogager_Kd, Krogager_Kh |
Freeman-Durden | Freeman_Vol, Freeman_Odd, Freeman_Dbl |
H/A/Alpha | Entropy, Anisotropy, Shannon Entropy, H/A/A, T11, H/A/A_T22, H/A/A_T33, DERD, Polarization Asymmetry, Polarization Fraction, SERD, Radar Vegetation Index, Anisotropy12, Pedestal Height, Alpha, Anisotropy_Lueneburg, Pseudo Probabilities (p1, p2, p3), Lambda |
Yamaguchi | Yamaguchi_Vol, Yamaguchi_Odd, Yamaguchi_Dbl, Yamaguchi_Hlx |
An_Yang | An_Yang Vol, An_Yang_Odd, An_Yang_Dbl, An_Yang_Hlx |
Touzi | TSVM_alpha_s, TSVM_alpha_s1, TSVM_alpha_s2, TSVM_alpha_s3, TSVM_tau_m, TSVM_tau_m1, TSVM_tau_m2, TSVM_phi_s2, TSVM_psi1 TSVM_psi, TSVM_tau_m3, TSVM_phi_s3 TSVM_psi2, TSVM_phi_s1, TSVM_phi_s, TSVM_psi3 |
Singh | Singh _Vol, Singh _Odd, Singh _Dbl, Singh _Hlx |
Huynen | Huynen_T11, Huynen_T22, Huynen_T33 |
VanZyl | VanZyl_Vol, VanZyl_Odd, VanZyl_Dbl |
Aghababaee | Aghababaee_Alphap_mean, Aghababaee_Alphap_SM1, Aghababaee_Alphap_SM2, Aghababaee_Alphap_SM3, Aghababaee_M_SM1, Aghababaee_M_SM2, Aghababaee_M_SM3, Aghababaee_Orientation_max_mean, Aghababaee_Orientation_max_SM1, Aghababaee_Orientation_max_SM2, Aghababaee_Orientation_max_SM3, Aghababaee_Tawp_mean, Aghababaee_Tawp_SM1, Aghababaee_Tawp_SM2, Aghababaee_Tawp_SM3 |
As mentioned previously, polarimetric decomposition techniques can facilitate the physical interpretation of land cover types by decomposing the received signal into the scattering responses. The results of 11 decompositions applied to L-band UAVSAR data as a prototype for planned NISAR of the study area are illustrated in Figures 2a–2k. NAIP imagery of the study area is also presented to provide a visual assessment of the study site (Figure 2l). The map tool was adjusted to maximize the contrast in the RGB format. Color coding of the decomposition images in Figure 2 is as follows: odd scattering is blue, even scattering is red, and volume scattering is green. Double bounce scattering occurs in the presence of inundated vegetation. Most of the dark areas correspond to open water since the backscattering signal from calm water is usually weak (Qi et al., 2012).
Figure 2. Implemented decompositions: (a) Pauli, (b) Krogager, (c) Freeman-Durden, (d) H/A/Alpha, (e) Yamaguchi, (f) An_Yang, (g) Touzi, (h) Singh, (i) Huynen, (j) VanZyl, (k) Aghababaee, and (l) normal color NAIP image of the study area shown in Figure 1. The UAVSAR imagery was dated September 30, 2019, and it was acquired in the morning at 9:17 a.m. local time.
Pauli is based on the decomposition of the scattering matrix in the form of complex addition of the Pauli basis. Pauli decomposition uses the scattering matrix elements to produce three elements: odd scattering, even scattering, and volume scattering (Cloude & Pottier, 1996) Figure 2a. Krogager decomposes the scattering matrix in form of factorization of a sphere, a diplane, and a helix. For interpretation of the Krogager decomposition, the phase values are usually ignored and three parameters that correspond to the weight coefficient of a sphere, diplane, and helix are considered. Although Krogager is a coherent decomposition and generally is more applicable for man-made structures such as urban areas, it has provided a well-balanced interpretation of the targets such as vegetation and water area in Figure 2b. The combination of powers scattered by a sphere, diplane-like and the helix-like component of the Krogager generated the color code for visualization (Krogager, 1990).
The three-component Freeman-Durden decomposition has proven to have a good ability to discriminate between flooded and nonflooded forests especially in tropical regions. The ability of Freeman-Durden in discriminating different vegetation covers can be attributed to the scattering model of this decomposition, which contains randomly oriented dipoles and double bounce scatterer that can result from a corner-reflector (L-shape) target. This decomposition categorizes the scene by extracting the different scattering mechanisms from the covariance matrix (Freeman & Durden, 1998). The decomposition results in Figure 2c show that emergent wetlands and urban areas are identified using double bounce. We expect that open water, freshwater pond areas, and some part of cultivated/planted land will be detected using the odd-bounce mechanism and finally, forested scrub-shrub wetlands will be detected by volume scattering. The fourth decomposition implemented (Figure 2d) is H/A/ALPHA, named for its three main features: Entropy (H), Anisotropy (A), and Alpha angle. Entropy represents the heterogeneity of a single scatterer; the higher the entropy the larger the number of detected scattering mechanisms and low entropy meaning one scattering mechanism is detected. Anisotropy is a normalized ratio of eigenvalues and is defined as the dominancy of second scattering mechanisms. The last parameter is alpha, which is an angle that indicates the type of dominant backscattering mechanisms. A zero alpha angle illustrates that the surface scattering mechanism is prevailing, where 45° and 90° incidence angles represent the dominance of double bounce and volume scattering (Cloude & Pottier, 1997).
Yamaguchi decomposition extends Freeman-Durden decomposition by adding an element of helix scattering, which helps to distinguish co-pol and cross-pol ratios. The presence of helix scattering makes this decomposition perform better in urban areas (Yamaguchi et al., 2005), but the visualization of the decomposition is quite similar to that of the Freeman-Durden (Figure 2e). An & Yang decomposition is similar to that of the Yamaguchi decomposition in terms of decomposition features (Figure 2f) (Wang et al., 2020).
Touzi decomposition uses eigenvalue and eigenvector decomposition similar to H/A/ALPHA but employs a roll-invariant coherent scattering model for decomposing eigenvectors of the coherency matrix. The parameter that is useful for vegetation structure mapping is the phase of symmetric scattering (Touzi, 2007). Although the Touzi decomposition image seems to be noisy (Figure 2g), Touzi (2010) found it to be a powerful decomposition approach for delineating different wetlands types. Touzi decomposition produces 15 different polarimetric features including the symmetric scattering-type magnitude, phase, and target helicity. Target helicity generated from this decomposition is better than H/A/ALPHA for forest characterization. The other component of this decomposition that can discriminate different herbaceous wetlands is the phase. Despite the efficiency of Touzi for wetland monitoring, the optimal integration of its features, including dominant, medium, and low single scatterings, is still debatable. The coherency matrix can also be decomposed into four different elements to create Singh decomposition (Singh et al., 2013). This decomposition allows for full utilization of polarimetric decomposition, due to its ability to distinguish the difference between the dihedral and dipole scattering in volume scattering. This decomposition is also better in identifying urban areas due to its sensitivity to HV polarization (Figure 2h).
Hyunen decomposes the coherency matrix into three different scattering mechanisms which are three eigenvalues of the coherency matrix (Huynen, 1970). Although a theoretically powerful technique, there are major drawbacks of this decomposition; Li and Zhang (2016) found this decomposition provided little insight into the physics of scattering. Irregularity and asymmetry of the scattering elements and instability are other drawbacks of this decomposition. However, Li and Zhang (2016) introduced a unified and improved version of this decomposition that has less irregularity and asymmetry (Figure 2i). Van Zyl is a decomposition of the nonnegative eigenvalue of the covariance matrix (van Zyl, 1993). To estimate all scattering components of the polarimetric data, this decomposition combines the eigenvector decomposition of the covariance matrix to produce three components of odd, double, and volume scatterings (Figure 2j).
Similar to Freeman-Durden, Aghababaee decomposition is a model-based decomposition that employs multipolarimetric SAR data as the sum of Kronecker products (SKP; Aghababaee & Sahebi, 2018). Aghababaee decomposition decomposes the target to direct, double bounce, and random-volume scattering mechanisms. In particular, it can detect multiple scatterers in forested areas by using the SKPs of the covariance matrix. The results of Aghababaee decomposition seem a little noisy. The interesting aspect about the RGB color code of this decomposition is that the open water has different colors in the Yucatan lake area and the river part (Figure 2k). Based on our knowledge from the area, this decomposition has a weakness in discriminating different herbaceous wetlands. However, this figure only shows one of the combinations of the three features that presumably have better visualization of the area. The NWI map of the area is shown in Figure 3.
Object-Based Machine Learning ClassifiersThis study implemented two object-based machine-learning algorithms (SVM and RF) to perform object-based classification of the simulated NISAR imagery. Object-based image analysis (OBIA) clusters pixels in order to create a segmented image that contains a grouped vector and defined geometry. The segmentation integrates contextual and spectral information to consider geographic information, color, and shape of each ground feature. As a result, the created objects bear more resemblance to real-world features than pixel-based classifiers. Additionally, the salt and pepper noise that exists in the pixel-based image classification is eliminated in OBIA classifiers (Frohn et al., 2011; Salehi et al., 2018).
SVM defines decision boundaries called hyperplanes to separate different classes. The iterative learning process of SVM occurs by searching for an optimal hyperplane decision boundary to minimize misclassification (Zhu & Blumberg, 2002). Unlike conventional classification techniques (e.g. maximum likelihood) that assume a normal distribution of training data, SVM is a nonparametric classification technique that holds no initial assumption about training data distribution (Mountrakis et al., 2011). Another appealing characteristic of SVM for geospatial data analysis is its capability to train and minimize the classification error using a small number of training samples. However, the choice of SVM kernel and parameters is not yet defined (Martins et al., 2016), hence in this paper, two parameters of C and gamma for the radial basis function (RBF) kernel were examined.
RF classifiers use an integration of tree predictors in which each tree uses values from independently sampled random vectors (Pal, 2005). RF is an attractive approach because it is also independent of assumptions about the normality of input data (Tian et al., 2016), and as the trees grow, best splits of a random subset of input features are chosen, which reduces the correlation between separate trees. Another advantage of RF is that fewer variables need to be set for training the classifier. The number of trees trained in the RF classifier for this study was 200 and the number of seeds was equal to the square root of the number of samples (Mahdianpari et al., 2017).
Variable importance measures the prediction strength of each variable generated by each tree and can be considered as a postaccuracy assessment for RF classifiers. The relative importance of each feature can be obtained using variable importance analysis (Rodriguez-Galiano et al., 2012). Two methods are common for variable importance analysis: permutation importance or mean decrease accuracy (MDA), which is based on out-of-bag (OOB) error, and Gini importance or mean decrease impurity (MDI). In MDA, the average of each tree accuracies sort in decreasing order as a result of permutation. In this study, we used the MDI procedure for variable importance analysis since we aimed at testing the consistency of our results with other investigations (Amani et al., 2019). MDI measures the importance of each feature in terms of the total number of samples divided by the number of tree splits. After calculating the Gini index for each of the polarimetric features, they were sorted in decreasing order.
The flowchart of the classification is shown in Figure 4. We applied the Enhanced Lee adaptive filter with a window size of 7 × 7 to reduce the speckle noise. All data were georeferenced and terrain corrected using SRTM DEM with a 30-meter resolution. Ultimately, 84 polarimetric features from the L-band simulated NISAR were extracted using PolSARpro™. To investigate the complimentary between different polarimetric features, we integrated the polarimetric features in one single raster composite using ArcGIS Pro. The mean-shift segmentation was implemented using red, green, blue, and near-infrared (NIR) bands of NAIP imagery in ArcGIS pro (Tao et al., 2007). We segmented the area using NAIP imagery, then superimposed the segments (polygons) on UAVSAR imagery and extracted polarimetric stack for each segment. The minimum processing units are segments instead of pixels in the image classification. Following, each segment was then classified into one wetland type. The computational complexity of this approach is low. We adjusted different spectral and spatial detail parameters to achieve the optimum. There are three parameters that need to be adjusted before segmentation: spectral detail, spatial detail, and minimum segment size. The higher spectral detail merges the sudden changes in the color of the image such as shadow and unshadowed areas (was set to 15). The lower the spatial detail, the smoother the output image (was set to 4). Finally, we adjust the minimum segment size. The higher the spatial resolution of the imagery, the higher parameter should be (was set to 9). Once the objects were generated, we trained the classifier using predefined reference polygons. We used the compactness, mean and standard deviation of generated segments as the segmentation attributes. The segmented image was then exported. Once the segments of the image were generated, we applied the classifier on the composite raster file containing polarimetric features. The generated segmented image was then imported to the classifier.
In the next step, the optimized parameters for achieving the highest accuracy of the SVM classifier were evaluated. Once the classifiers were trained, the stacked vector containing polarimetric features was imported into SVM and RF classifiers. The accuracy assessment of the classification results was implemented using the validation samples in form of a confusion matrix. Ultimately, the variable importance analysis of the RF classifier was performed.
Results and DiscussionThematic classification maps produced from the SVM and RF classifiers are shown in Figure 5 with six classes: open water, freshwater pond, forested scrub/shrub, forested upland, emergent wetland, and cultivated/planted land. An initial visual assessment of thematic maps suggests that forested scrub/shrub wetlands are dominant in the study area. The cultivated planted land and open water class are also well delineated from the surrounding neighborhood. Moreover, the RF classifier tends to discriminate between forested upland and forested scrub/shrub wetland in the northeast of the study area and in the center island. Forested shrub wetlands around Yucatan Lake are predominantly classified as emergent wetlands.
Figure 5. Object-based classification results: (a) Support vector machine map and (b) random forest map for the study area shown in Figure 1.
Selecting the parameters of the SVM kernel can considerably affect the overall classification accuracy. Hence, the optimal selection of a different combination of C and Gamma were examined. The resultant overall accuracy for different combinations of parameters are shown in the form of a heatmap in Figure 6. As can be seen, Gamma values higher than 10 and C values lower than 0.01 are better to be avoided in the classification, as they do not result in meaningful results. Overall, the combination of gamma values in the range of 0.001–10 and C values in the range of 0.1–1,000 produced the results with accepted overall accuracies. Among the various tested combination of gamma and C, gamma equal to 0.1, and C equal to 10 provided the highest overall accuracy in our study.
Figure 6. Variation in support vector machine overall accuracy based on different values of Gamma and C in the radial basis function kernel.
To provide a quantitative assessment of the classification results, confusion matrices were calculated for both SVM and RF classifiers (Figure 7). The confusion matrix gives insight into the performance of the classification by providing the number of correctly or incorrectly classified data that is predicted by the classifier. The vertical axis provides the reference data or true label and the horizontal axis denotes the classifier predictions. The overall accuracy of the model can be obtained by dividing the total number of correctly classified sample data by the total number of reference data. We showed the producer's accuracy by dividing the number of correctly classified samples by the number of that specific class reference samples. The diagonal elements represent the producer's accuracy for each class. Overall accuracies of 74.33% and 81.93% were obtained for SVM and RF, respectively. The nondiagonal elements show the confusion between different classes. The nondiagonal elements corresponding to herbaceous vegetation for both classifiers in the confusion matrix are more noticeable compared to the non-vegetated area such as open water and cultivated/planted land. Open water and cultivated/planted classes have distinct backscattering responses (i.e. surface scattering) compared to other classes (dominated by volume and dihedral scattering) in the study area in almost all decompositions, as seen in Figure 2. This makes it easy for the two classifiers to differentiate them from other classes. However, differentiating between open water and cultivated/planted land by the two classifiers is problematic in some areas. This could be explained by the fact that these two classes have backscattering similarities (i.e. mostly dominated by surface scattering and minimum dihedral or volume scattering responses) and thus the classifiers, relying on backscattering response, have a problem distinguishing them. Similar results have been achieved by Chen et al. (2014).
Figure 7. Confusion matrices of classification results: (a) Support vector machine confusion matrix and (b) random forest confusion matrix.
Another interesting finding is the disagreement between forested/shrub and emergent wetlands between RF and SVM classifiers. As it is evident in Figure 5, most classes that SVM classified as forested/shrub wetlands are classified as emergent wetland by RF. Looking at the NAIP imagery of the area (Figure 2l), acquired in September 2019— -UAVSAR was also acquired in September 2019, it appears that wetlands in the disagreement areas are shrub-dominated wetlands. The NWI, which was used as training data in this study, considers these shrub-dominated wetlands as forested/shrub. However, in reality, it is exhibiting scattering features intermediate to an emergent wetland and forested wetland, which the two respective classifiers are interpreting differently, but are forced to decide between.
The relative importance of each polarimetric feature for the RF classifier was assessed using the Gini index. Figure 8 shows 84 polarimetric features sorted in decreasing order of importance, some of the H/A/ALPHA and Aghababaee features dominating in the highest levels. The features that correspond to the same decomposition have the same color. The definition of each of the features on the right is represented in Table 4. As shown in Figure 8, the first five of the polarimetric features have a high impact on the overall accuracy. After the fifteenth feature, the importance of polarimetric features decreases with a gradual slope. Hence, for future studies, presenting the first 15 important features can be sufficient since including the rest of the features would not bring significant enhancement in the accuracy. Notably, the area is mostly covered with a forested wetland that resulted in the importance of volume scattering in the classification results.
Moreover, many polarimetric features correlate with each other, meaning they do not produce distinctive and meaningful results. The top parameters are all parameters that would be useful in identifying forest volume scattering. It is far down before it hits a double bounce parameter, which should be an indicator of inundation in a forested area (Mohammadimanesh et al., 2019). These features are preferred to be eliminated from evaluations since they increase the processing time significantly. Ultimately, the one challenging issue that still needs to be resolved is the influence of the combination of polarimetric features on each other. For instance, the performance of Freeman-Durden features for delineating wetlands can vary in the presence of Yamaguchi features. As a result, if the Freeman-Durden decomposition features are being applied solo or together with Yamaguchi features, the importance of each feature of the former may affect the latter decomposition. The double bounce feature of the Freeman-Durden can delineate wetlands. However, when the Yamaguchi double bounce feature is available, it might perform better than the Freeman-Durden component. Hence, the inclusion of multiple decompositions may impact the importance of each feature.
ConclusionsEfficient extraction of information from geospatial datasets can facilitate the management of complex wetland environments. As we move forward, more diverse data of higher quality are being acquired with satellites on a subweekly basis. In this study, we used object-based SVM and RF classifications to classify L-band UAVSAR data, as a proxy for planned NISAR imagery, using 84 polarimetric features and achieved an overall accuracy of 74.33% and 81.93%, respectively. The choice of parameters of the RBF kernel was the influential factor in the SVM's overall accuracy. The confusion matrix of SVM demonstrates that SVM is a powerful classifier for delineating between different upland classes. Moreover, variable importance analysis of RF classifier demonstrated that among 11 different polarimetric decompositions, H/A/ALPHA, Freeman-Durden, and Aghababaee have the superior ability for discriminating different wetlands types. As expected, among different land-cover classes nonwetland classes including planted/cultivated land and open water had higher accuracies in both classifiers. The used imagery for this study was acquired in during a low flood date, which means that there will be the least contribution of double bounce scattering. For further studies, the inclusion of high flood imagery may increase the overall accuracies. Ultimately, this study confirmed the ability of simulated NISAR configuration for the discrimination of wetland classes using object-based machine learning classifiers.
The implemented ML classification scheme shall provide some initial insight into the application of L-band multipolarization NISAR for wetland mapping and monitoring. The L-band NISAR data are planned to acquire data in a dense time series with global coverage. Large aperture reflectors and real-time digital beamforming are expected to bring a significant improvement in SAR capability for biomass remote sensing and solid earth surface observations. Ultimately, accurate and meaningful wetland maps may leverage multifrequency and multipolarization satellite data with high temporal resolution such as planned NISAR.
AcknowledgmentThis research was partially carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (80NM0018D0004).
Data Availability StatementThe data used in this study are available online at
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Given the key role wetlands play in climate regulation and shoreline stabilization, identifying their spatial distribution is essential for the management, restoration, and protection of these invaluable ecosystems. The increasing availability of high spatial and temporal resolution optical and synthetic aperture radar (SAR) remote sensing data coupled with advanced machine learning techniques have provided an unprecedented opportunity for mapping complex wetlands’ ecosystems. A recent partnership between the National Aeronautics and Space Administration (NASA) and the Indian Space Research Organization (ISRO) resulted in the design of the NASA‐ISRO SAR (NISAR) mission. In this study, the capability of L‐band simulated NISAR data for wetland mapping in Yucatan Lake, Louisiana, is investigated using two object‐based machine learning approaches: Support vector machine (SVM) and random forest (RF). L‐band Unmanned Aerial Vehicle SAR (UAVSAR) data are exploited as a proxy for NISAR data. Specifically, we evaluated the synergistic use of different polarimetric features for efficient delineation of wetland types, extracting 84 polarimetric features from more than 10 polarimetric decompositions. High spatial resolution National Agriculture Imagery Program imagery is applied for image segmentation using the mean‐shift algorithm. Overall accuracies of 74.33% and 81.93% obtained by SVM and RF, respectively, demonstrate the great possibility of L‐band prototype NISAR data for wetland mapping and monitoring. In addition, variable importance analysis using the Gini index for RF classifier suggests that H/A/ALPHA, Freeman‐Durden, and Aghababaee features have the highest contribution to the overall accuracy.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Department of Environmental Resources Engineering, State University of New York College of Environmental Science and Forestry (ESF), Syracuse, NY, USA
2 Department of Electrical Engineering, C‐CORE, Memorial University of Newfoundland, St. John's, NL, Canada
3 NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA