1. Introduction
Tropical fruits contain many biologically active compounds such as polysaccharides, polyphenols, vitamins, dietary fiber, and carotenoids [1]. Cherimoya (Annona cherimola) fruit is rich in fermentable carbohydrates, especially glucose and fructose [2]. This species is probably native to the area between Ecuador and Peru and has been cultivated since 1200 BC during the Inca Empire period [3]. Similarly, soursop (Anonna muricata L.) is native to tropical America and Africa [4]; its pulp constitutes more than 80% of the fruit and is composed mainly of water, non-reducing sugars, and carbohydrates [5]. Pineapple (Ananas comosus L. Merr) is one of the most popular fruits in the world, widely distributed in tropical regions, and highly dependent on its attractive aroma and sweet taste [6]. Pineapple is an economically important crop in tropical and subtropical regions, and its sweetness determines its quality [7]. Moreover, its soluble sugars such as sucrose, glucose, and fructose are responsible for the sweet taste of fresh-cut pineapple [6]. However, they also have other phytochemical components that can cause differences among these three fruits.
Traditionally, the most commonly used techniques for quantifying sugars in food are liquid and gas chromatography, nuclear magnetic resonance, infrared spectroscopy, spectrophotometry, polarization measurement, and hyperspectral imaging [8,9], which are limited by their invasiveness, intensive use of resources, and negative environmental impact [10,11]. Therefore, in recent years, there has been an increasing need to explore non-destructive and in situ methods that have been reported to work as well as high-performance liquid chromatography [12]. One such non-destructive method is Raman spectroscopy (RS), that enables to detect the internal quality of fruits and vegetables without destroying them [11,12,13]. In addition, the use of RS is increasing, especially in food safety. This technology makes it possible to detect peak signals by exposing the sample to a laser light [14]. This new technology is based on the detection of frequency changes and variations in the intensity of scattered light when a sample interacts with a laser light source [13], obtaining more selective spectra because it provides narrower spectral bands with abundant and well-resolved chemical information that is easy to interpret [12,13].
Andersen et al. [15] mention that RS has become a robust method to assess the content of individual sugars; for example, previous studies have been reported on the application of RS in the quantification of sugars from aqueous solutions, allowing the discrimination of fructose, glucose, maltose, and sucrose with laser excitation at 785 nm [9,12]. This technique is promising because it does not require pretreatment of fruit samples such as berries [9,15]. In addition, RS provides spectroscopic fingerprints that provide information related to specific chemical bonds or functional groups that are useful for analyzing samples qualitatively and quantitatively [11], which is used with chemometric methods to discriminate foods according to their composition [16,17]. In addition, the combination of RS and chemometric methods, was used for the identification of agricultural product varieties in grains, edible oils, and honey [18]. Also, they were used for the discrimination of foreign fats and oils in milk cream, and yogurt [19]; for the rapid quantification of starch in agricultural products [20]; and for the differentiation of conventional and transgenic cotton seed genotypes [21].
Regarding chemometric tools for food discrimination based on spectral data [16,22], principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) are considered robust linear classification methods [23]. PCA is an unsupervised method that reduces the data to find similarities between samples [24]. On the other hand, PLS-DA is a supervised method that classifies samples into predefined categories by establishing an internal relationship between spectra (X-block) and a classifier (y-variable) [24,25]. These two methods, known for their speed and effectiveness in providing clear sample separation, can predict a sample class by a model-based regression [22,26]. Therefore, in this research, the effect of data pre-processing methods on model performance was evaluated, and the applicability of RS combined with PCA and PLS-DA to identify the types of sugar from three tropical fruits was studied.
2. Materials and Methods
2.1. Reagents
Glucose, fructose, and sucrose standards (≥99.5% purity) were bought from Sigma-Aldrich (St. Louis, MO, USA).
2.2. Fruit Sampling
Four kilograms each of soursop (Anonna muricata L.), cherimoya (Annona cherimola), and pineapple (Ananas comosus L. Merr) were obtained from a local market in the province of Bagua, in the Amazonas region of Peru (5° 38′ 21″ S, 78° 31′ 54″ W, 408 m.a.s.l.).
The samples were taken to the Laboratorio de Investigación en Ingeniería de los Alimentos y Poscosecha of the UNTRM, where five fruits (without bruises or marks) of each variety were selected. Then, the fruits were manually separated from the peels, and the pulp alone was placed in 50 mL Falcon tubes and stored in deep freezing conditions (Eppendorf, Premium 0410, Hamburg, Germany). The samples were then lyophilized (Labconco, 710402010 model, Kansas City, MO, USA) at 0.008 bar and −84°C for 18 h [27].
2.3. Raman Spectroscopy (RS)
A Horiba Raman spectrometer system (Horiba Scientific, XploRA plus, Montpellier, France) equipped with a 532 nm and 785 nm laser and an Olympus BX 41 microscope was used in this study [28]. The spectrometer was equipped with a 1200 line/mm grating during the spectral recording, and a 532 nm laser was focused on the sample on the microscope stage through a 50 LWD (long working distance) objective (Olympus, Tokyo, Japan). The Raman scattering signals were detected by a charge-coupled device (CCD) detector, with a detection range from 200 to 1700 cm−1 in extended mode. The measurement was performed with an integration time of 5 s, with 10 spectral accumulations and 50% filter laser power. The spectral data were collected using the LabSpec 6.7.1 (Horiba, France) software (Supplementary Material S1). According to Huang et al. [29], the spectral resolution was 3 cm−1; and the calibration was checked by 520.47 cm−1 line of silicon [10]. Following Kolašinac et al. [10], five replicates of each fruit variety were used in this study to overcome sample inhomogeneity. Ten spectra were recorded for each replicate (50 spectra for each fruit variety), resulting in 150 spectra. The Raman peaks were assigned according to the published literature.
2.4. Chemometric Analysis
Raman spectral data were processed according to Ditta et al. and Gersony et al. [30,31]. The data were then smoothed, and a baseline correction, vector normalization, and substrate removal were performed using the Solo+MIA software 8.1 (Eigenvector, Research, Inc. Wenatchee, WA, USA). Dimensionality reduction of spectral intensities was performed using principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA).
PCA was performed to form clusters per type of fruit. Hotelling’s T2 and Q residual statistics were used to identify and remove the anomalous spectra that were considered outliers [25]. Hotelling’s T2 and Q residual statistics were calculated at 95% confidence. To perform the partial least squares discriminant analysis (PLS-DA), all data were divided into the calibration and test sets (75% and 25% of the samples, respectively) using the Kennard–Stone approach [10]. The highest values of sensitivity (SE), specificity (SP), and precision (P) were used to evaluate the goodness of the models based on the training and prediction data sets. Precision (P) indicates what fraction of predictions are positive as a positive class; sensitivity (SE), or true positive rate, refers to the percentage of samples of a given class that the model correctly recognizes as belonging to that class; specificity (SP), or true negative rate, refers to samples not belonging to a given class that the model correctly rejects [32].
3. Results and Discussion
3.1. Characterization of the Raman Spectra of Fruits
Diverse peaks were identified in these fruits with different intensities (Figure 1). The core of the pineapple contains significantly higher amounts of fructose and glucose [33]. Meanwhile, the soursop contains carbohydrates, proteins, folate, calcium, phosphorus, iron, and vitamin C, and [34,35,36] reported that the spectrum of sucrose presents the characteristic bands of α−glucose (847 cm−1) and β−fructose (868 cm−1). In this work, glucose was present with the highest intensity in cherimoya and pineapple, at 855.15 cm−1 and 851.93 cm−1, respectively, and with the lowest intensity in soursop (854.08 cm−1). Schulz and Baranska [36] identified other peaks related to sugars in carrots (1462, 1126, and 840 cm−1).
The assignments of molecule vibration associated with the sugars in tropical fruits were made according to the published literature (Table 1). Glucose is one of the most common hexoses with aldehydes in its structure [37], containing a carbonyl group (C-O stretch) [38]. This sugar was found at 1131.22 cm−1, 1134.44 cm−1, and 1133.37 cm−1 in cherimoya, soursop, and pineapple, respectively. Fructose, also known as ‘fruit sugar’, occurs in fruits and plants in a free state and as a building block of sucrose [37]. The Raman bands at 1472–1454 cm−1, corresponding to CH2 scissor vibrations [39], contain peaks at 1464.22 cm−1, 1467.44 cm−1, and 1464.22 cm−1 in cherimoya, soursop, and pineapple, respectively; nevertheless, the peak corresponding to fructose’s β(CCO) vibration was absent in soursop. Although RS allowed the identification of peaks associated with sugars in the three tropical fruits, it was not possible to quantify them. Therefore, it is also suggested that other techniques be used to complement the study. One technique that could be useful to validate the results is gas chromatography–mass spectrometry (GC-MS) analysis and profiling with the HS-SPME technique, since it is an effective tool due to its ability to recover and characterize compounds as mentioned by Mihaylova et al. [40].
3.2. Multivariate Analysis
Before performing any chemometric analysis, the raw spectra needed to be corrected because they contained fluorescence, as in Figure 2a. For this purpose, the spectra underwent a baseline correction with a seventh-order polynomial fitting [41,42], resulting in the spectra of Figure 2b, where the characteristic peaks of the, soursop, and pineapple chemical composition are distinguished. Raw spectra are available in the Supplementary Data file.
3.2.1. Principal Component Analysis
PCA is a chemometric method that allows us to explore patterns [43] and is almost always the first analysis performed on a multivariate data set [44]. The acquired spectral data set may be disturbed by random noise during acquisition; therefore, pre-processing methods are necessary to improve the PCA results. Pre-processing methods can reduce the random noise and improve the spectral characteristics of interest [45]. Figure 3 shows the results of the four pre-processing methods applied to the cherimoya, soursop, and pineapple spectral data sets. The mean centering method (Figure 3a) produced the least visible changes in the corrected spectra compared to the Savitzky–Golay first (Figure 3b), second (Figure 3c) and third (Figure 3d) derivative method. These effects can be observed in the intensities and positions of the peaks.
The changes in the model performance indicators are presented to highlight the effect of the pre-processing methods on the model [45]. Table 2 reports the root mean square error for cross-validation (RMSECV), which is an internal validation error estimated according to De Almeida et al. [46]. Table 2 shows that when the SG first derivative pre-processing method was used, the PCA model was composed of four principal components that captured 92.59% of the total data variance and 7.41% of the Q residuals (Table 3). Mata et al. [21] used a plot of Hotelling’s T2 vs. Q residuals to eliminate four spectra identified as outliers for the discrimination between transgenic and conventional cotton seeds. Figure 4 was obtained after removing two outliers and shows that cherimoya was separated from the other fruits using the four pre-processing methods. However, none became an outlier, since it was at most 95%.
The mean centering pre-processing method produced a three-component PCA model with a captured variance of 76.05%, which is the lowest captured variance among the models (Table 2). In the score plot, a better sample separation can be observed due to the formation of three well-defined clusters (Figure 5). Cherimoya and soursop, which belong to the Annonaceae family [47], form clusters closer to each other because they share many Raman peaks (including sugars), such that their chemical compositions are similar, unlike pineapple, which belongs to another family (Bromeliaceae) and has a different chemical composition. The separation of the clusters may be caused by the differences in the fruits’ phytochemical compound content, such as polyphenols, vitamins, fiber, and carotenoids. The loading plot of the first principal component produced by the four pre-processing methods (Figure 5) shows more pronounced peaks in the 1000–1200 cm−1 band, where the characteristic glucose peak (common to the three tropical fruits) is found.
3.2.2. Partial Least Square Discriminant Analysis
Since the discriminative power of a classification model is unknown, it is necessary to analyze it using different pre-processing methods [48]. Table 4 shows the results of the PLS-DA model applied to tropical fruits under the four pre-processing methods studied (all results are in Table 5). The mean centering pre-processing method and the first derivative of the Savitzky–Golay (SG) pre-processing method show the best prediction results, with a precision of 100% in all tropical fruits. On the training data set, the precision for mean centering and the first derivative of the SG pre-processing method was 93.06–100% and 98.68–100%, respectively. With the second and third derivatives of the SG pre-processing method, the precision was 80–100% and 84.44–100% on the prediction and training data sets, respectively. These values are higher than those reported by Kolašinac et al. [10], whose precision was 19.35–100% on the training data for classifying five varieties of Balkan paprika. These results varied because of the pre-processing method applied. Moreover, in the study of meat carcass classification using Raman spectroscopy, Logan et al. [32] found that the best PLS-DA model correctly classified 94% of grass-fed versus grain-fed carcasses. According to Table 4, the first derivative of the SG pre-processing method produced the best results in the PLS-DA classification model for the training data set, as it correctly classifies the samples in 98.68–100% of cases (P), the samples belonging to its class in 97.43–100% of cases (SE), and the samples not belonging to its class in 98.66–100% of cases (SP). All indicators for the prediction data set were 100%.
We also analyzed the classification power of the PLS-DA model through the receiver operating characteristic (ROC) curves (Figure 6). The ROC curves of cherimoya, soursop, and pineapple were located closer to the upper left corner of the ROC space, which indicates that the first derivative of the SG pre-processing method produces a PLS-DA model with optimal performance, since it can correctly classify the tropical fruits in the classes to which they belong. In all three fruits, the value of the area under the curve (AUC) was one. These results confirm the ROC curves.
4. Conclusions
Sucrose was detected in the three fruits in a spectral region between 1131 and 1134 cm−1. Glucose was detected with 855.15 cm−1 (cherimoya) and 851.93 cm−1 (pineapple) peaks. Finally, fructose appeared with peaks between 1464 and 1467 cm−1 in the three fruits. These results show the high sensitivity of RS for detecting sugars. Therefore, RS is an emerging technology that can provide robustness to data analysis obtained from conventional technologies such as chromatography or spectrophotometry. However, access to chromatography equipment is expensive and not sustainable.
Likewise, RS complemented by chemometric tools such as PCA and PLS-DA, as used in this study, made it possible to establish a high-precision classification model to discriminate among three fruits, thus being a method that can be used to discriminate fruits based on spectral fingerprints using non-destructive technology. Finally, since the fruits have sugar types in common, it is assumed that the discriminatory efficiency of PLS-DA is based not only on the sugar type but also on the content of polyphenols, vitamins, dietary fiber, carotenoids, and other chemical compounds. However, we recommend further studies to corroborate these findings.
Conceptualization, E.M.C.-A. and G.S.-L.; methodology, E.M.C.-A. and L.T.-V.; software, C.R.B.-Z. and E.M.C.-A.; validation, E.M.C.-A., I.S.C.-C. and M.B.; formal analysis, G.S.-L.; investigation, E.M.C.-A. and I.S.C.-C.; resources, J.L.M.-Q.; data curation, E.M.C.-A.; writing—original draft preparation, E.M.C.-A. and C.R.B.-Z.; writing—review and editing, I.S.C.-C. and C.R.B.-Z.; visualization, M.B. and J.L.M.-Q.; supervision, E.M.C.-A. and J.L.M.-Q.; project administration, L.D.M.-A.; funding acquisition, E.M.C.-A., G.S.-L. and C.R.B.-Z. All authors have read and agreed to the published version of the manuscript.
No applicable.
No applicable.
All the
The authors thank Project CUI N° 2343049—Creación de los Servicios de Investigación en Ingeniería de Alimentos y Poscosecha of the Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas.
All authors declare no conflicts of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. Raman spectra of glucose, fructose, and sucrose in cherimoya, soursop, and pineapple.
Figure 2. Chemometric analysis of raw (a) and corrected (b) Raman spectra of cherimoya, soursop, and pineapple.
Figure 2. Chemometric analysis of raw (a) and corrected (b) Raman spectra of cherimoya, soursop, and pineapple.
Figure 3. Spectrum pre-processed with (a) mean centering, (b) the first derivative of the Savitzky–Golay method, (c) the second derivative of the Savitzky–Golay method, and (d) the third derivative of the Savitzky–Golay method.
Figure 4. Q vs Hotelling’s T2 in the PCA model. Both parameters are below 95%. Pre-processing methods: (a) mean centering method, (b) first derivative of the Savitzky–Golay method, (c) second derivative of the Savitzky–Golay method, and (d) third derivative of the Savitzky–Golay method.
Figure 5. PCA scores and loadings with different pre-processing methods: (a,b) mean centering, (c,d) first derivative of Savitzky–Golay, (e,f) second derivative of Savitzky–Golay, and (g,h) third derivative of Savitzky–Golay method.
Figure 5. PCA scores and loadings with different pre-processing methods: (a,b) mean centering, (c,d) first derivative of Savitzky–Golay, (e,f) second derivative of Savitzky–Golay, and (g,h) third derivative of Savitzky–Golay method.
Figure 6. ROC curves for the first derivative of the Savitzky–Golay pre-processing method.
Assignments of Raman peaks for sugar standards and cherimoya, soursop, and pineapple.
Assignment 1 | Literature | Fructose | Glucose | Sucrose | Cherimoya | Soursop | Pineapple |
---|---|---|---|---|---|---|---|
δ (CH2) | 1460 [ | 1473.93 | 1471.74 | 1472.81 | 1464.22 | 1467.44 | 1464.22 |
ρ (CH2) | 1346 [ | 1356.80 | 1359.73 | 1355.73 | 1355.73 | 1352.50 | 1353.58 |
ν (CC), ν (CO), β (COH) | 1142 [ | 1133.37 | 1133.37 | 1133.37 | 1131.22 | 1134.44 | 1133.37 |
ν (CC), ν (CO) | 1074 [ | 1080.73 | 1081.81 | 1080.73 | 1078.58 | 1085.03 | 1085.03 |
CH, COH bending | 924 [ | 932.49 | 929.27 | 933.57 | 928.20 | 928.20 | 918.53 |
ν (CH) | 867 [ | 859.45 | 853.00 | 860.52 | 855.15 | 854.08 | 851.93 |
β (CCO) | 596 [ | 594.12 | 595.20 | 601.64 | 595.12 | ||
β (CCO), β (CCC) | 527 [ | 536.12 | 537.19 | 536.12 | 535.04 | 535.04 | |
β (CCC) | 465 [ | 456.63 | 455.55 | 457.70 | 453.40 | 455.55 | 455.55 |
β (CCC), β (CCO), β (OCO) | 424 [ | 416.88 | 415.81 | 416.88 | 417.95 |
1 ν–stretching, β–in-plane bending.
Effect of four data pre-processing methods on the PCA model for cherimoya, soursop, and pineapple.
Pre-Processing Method | PC | RMSECV | % Variance Captured Total | Q Residual | Hotelling’s T2 |
---|---|---|---|---|---|
Mean centering | 3 | 254.18 | 76.05 | 23.95 | 76.05 |
First derivative of Savitzky–Golay | 4 | 44.06 | 92.59 | 7.41 | 92.59 |
Second derivative of Savitzky–Golay | 5 | 25.86 | 91.10 | 8.90 | 91.10 |
Third derivative of Savitzky–Golay | 5 | 39.32 | 81.06 | 18.94 | 81.06 |
Results of the PCA model with different pre-processing methods.
Pre-Processing Methods | RMSEC | RMSECV | Principal Component Number | Eigenvalue of Cov (X) | % Variance Captured This PC | % Variance Captured Total |
---|---|---|---|---|---|---|
Mean centering | 180.47 | 254.18 | 1 | 1.29 × 107 | 39.29 | 39.29 |
2 | 6.36 × 106 | 19.34 | 58.64 | |||
3 | 5.72 × 106 | 17.42 | 76.05 | |||
First derivative of Savitzky–Golay | 31.30 | 44.06 | 1 | 2.17 × 106 | 68.52 | 68.52 |
2 | 3.81 × 105 | 12.02 | 80.54 | |||
3 | 2.75 × 105 | 8.66 | 89.20 | |||
4 | 1.08 × 105 | 3.39 | 92.59 | |||
Second derivative of Savitzky–Golay | 19.20 | 25.86 | 1 | 5.47 × 105 | 54.96 | 54.96 |
2 | 1.90 × 105 | 19.06 | 74.01 | |||
3 | 8.61 × 104 | 8.65 | 82.67 | |||
4 | 4.62 × 104 | 4.65 | 87.31 | |||
5 | 3.77 × 104 | 3.79 | 91.10 | |||
Third derivative of Savitzky–Golay | 29.15 | 39.31 | 1 | 5.12 × 105 | 47.57 | 47.57 |
2 | 1.72 × 105 | 15.96 | 63.53 | |||
3 | 8.88 × 104 | 8.24 | 71.77 | |||
4 | 6.25 × 104 | 5.80 | 77.57 | |||
5 | 3.75 × 104 | 3.48 | 81.06 |
Classification results of the PLS-DA model.
Pre-Processing Method | Sweet Fruit | Training Data Set | Prediction Data Set | ||||
---|---|---|---|---|---|---|---|
P 1 (%) | SE 2 (%) | SP 3 (%) | P (%) | SE (%) | SP (%) | ||
Mean centering | Cherimoya | 93.06 | 93.87 | 97.98 | 100.00 | 100.00 | 100.00 |
Soursop | 97.02 | 97.95 | 96.97 | 100.00 | 100.00 | 100.00 | |
Pineapple | 100.00 | 98.00 | 100.00 | 100.00 | 100.00 | 100.00 | |
First derivative of Savitzky–Golay | Cherimoya | 100.00 | 97.43 | 100.00 | 100.00 | 100.00 | 100.00 |
Soursop | 98.68 | 100.00 | 98.66 | 100.00 | 100.00 | 100.00 | |
Pineapple | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | |
Second derivative of Savitzky–Golay | Cherimoya | 96.68 | 54.83 | 98.11 | 100.00 | 12.50 | 100.00 |
Soursop | 84.44 | 92.00 | 83.05 | 80.00 | 100.00 | 75.00 | |
Pineapple | 100.00 | 89.28 | 100.00 | 100.00 | 100.00 | 100.00 | |
Third derivative of Savitzky–Golay | Cherimoya | 98.13 | 71.79 | 98.61 | 100.00 | 20.00 | 100.00 |
Soursop | 87.94 | 97.22 | 86.67 | 80.59 | 69.23 | 83.33 | |
Pineapple | 98.68 | 100.00 | 98.67 | 100.00 | 100.00 | 100.00 |
1 Precision (P), 2 Sensitivity (SE) or true positive rate, 3 Specificity (SP) or true negative rate.
Result of the PLS-DA model with different pre-processing methods.
Pre-Processing Method | Modeled Class | Cherimoya | Soursop | Pineapple |
---|---|---|---|---|
Mean centering | Sensitivity (Cal) | 0.959 | 1.000 | 1.000 |
Specificity (Cal) | 0.929 | 0.949 | 1.000 | |
Sensitivity (CV) | 0.939 | 1.000 | 1.000 | |
Specificity (CV) | 0.909 | 0.949 | 1.000 | |
Class. Err (Cal) | 0.0557617 | 0.0252525 | 0 | |
Class. Err (CV) | 0.0760668 | 0.0252525 | 0 | |
RMSEC | 0.31426 | 0.273395 | 0.217022 | |
RMSECV | 0.333692 | 0.289513 | 0.229203 | |
Bias | 1.66533 × 10 −16 | 1.66533 × 10 −16 | 1.11022 × 10−16 | |
CV Bias | 0.00127205 | −0.00184987 | 0.000577824 | |
R2 Cal | 0.554066 | 0.662499 | 0.78946 | |
R2 CV | 0.501205 | 0.622938 | 0.766066 | |
First derivative of Savitzky–Golay | Sensitivity (Cal) | 0.974 | 1.000 | 1.000 |
Specificity (Cal) | 0.972 | 0.987 | 1.000 | |
Sensitivity (CV) | 0.897 | 0.972 | 1.000 | |
Specificity (CV) | 0.958 | 0.987 | 1.000 | |
Class. Err (Cal) | 0.0267094 | 0.00666667 | 0 | |
Class. Err (CV) | 0.0721154 | 0.0205556 | 0 | |
RMSEC | 0.26207 | 0.197014 | 0.198749 | |
RMSECV | 0.311281 | 0.252941 | 0.209272 | |
Bias | −0.00915475 | 0.00408662 | 0.00506812 | |
CV Bias | 0.00775709 | −0.0145767 | 0.00681962 | |
R2 Cal | 0.699008 | 0.822952 | 0.819861 | |
R2 CV | 0.595265 | 0.717618 | 0.800737 | |
Second derivative of Savitzky–Golay | Sensitivity (Cal) | 0.548 | 1.000 | 0.964 |
Specificity (Cal) | 0.906 | 0.831 | 1.000 | |
Sensitivity (CV) | 0.548 | 0.960 | 0.929 | |
Specificity (CV) | 0.887 | 0.831 | 1.000 | |
Class. Err (Cal) | 0.272976 | 0.0847458 | 0.0178571 | |
Class. Err (CV) | 0.28241 | 0.104746 | 0.0357143 | |
RMSEC | 0.397685 | 0.343363 | 0.242586 | |
RMSECV | 0.400828 | 0.351006 | 0.244279 | |
Bias | 0.0964188 | −0.131557 | 0.0351384 | |
CV Bias | 0.0958097 | −0.13477 | 0.0389599 | |
R2 Cal | 0.366397 | 0.538207 | 0.740784 | |
R2 CV | 0.356427 | 0.518102 | 0.740114 | |
Third derivative of Savitzky–Golay | Sensitivity (Cal) | 0.821 | 0.944 | 1.000 |
Specificity (Cal) | 0.889 | 0.813 | 0.987 | |
Sensitivity (CV) | 0.821 | 0.917 | 1.000 | |
Specificity (CV) | 0.875 | 0.813 | 0.973 | |
Class. Err (Cal) | 0.145299 | 0.121111 | 0.00666667 | |
Class. Err (CV) | 0.152244 | 0.135 | 0.0133333 | |
RMSEC | 0.348922 | 0.32159 | 0.208805 | |
RMSECV | 0.361874 | 0.336387 | 0.234444 | |
Bias | −0.00584702 | 0.000421702 | 0.00542532 | |
CV Bias | −0.00528552 | −0.00200115 | 0.00728667 | |
R2 Cal | 0.465947 | 0.528059 | 0.801175 | |
R2 CV | 0.425925 | 0.486149 | 0.749947 |
Cal: calibration; CV: cross-validation; RMSEC: root mean square error calibration, RMSECV: root mean square error calibration of cross-validation.
Supplementary Materials
The following supporting information can be downloaded at:
References
1. Mohan, K.; Muralisankar, T.; Uthayakumar, V.; Chandirasekar, R.; Revathi, N.; Ramu Ganesan, A.; Velmurugan, K.; Sathishkumar, P.; Jayakumar, R.; Seedevi, P. Trends in the Extraction, Purification, Characterisation and Biological Activities of Polysaccharides from Tropical and Sub-Tropical Fruits—A Comprehensive Review. Carbohydr. Polym.; 2020; 238, 116185. [DOI: https://dx.doi.org/10.1016/j.carbpol.2020.116185] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32299552]
2. Isas, A.S.; Mariotti Celis, M.S.; Pérez Correa, J.R.; Fuentes, E.; Rodríguez, L.; Palomo, I.; Mozzi, F.; Van Nieuwenhove, C. Functional Fermented Cherimoya (Annona Cherimola Mill.) Juice Using Autochthonous Lactic Acid Bacteria. Food Res. Int.; 2020; 138, 109729. [DOI: https://dx.doi.org/10.1016/j.foodres.2020.109729] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33292965]
3. Perrone, A.; Yousefi, S.; Salami, A.; Papini, A.; Martinelli, F. Botanical, Genetic, Phytochemical and Pharmaceutical Aspects of Annona Cherimola Mill. Sci. Hortic.; 2022; 296, 110896. [DOI: https://dx.doi.org/10.1016/j.scienta.2022.110896]
4. Monarres-Cuevas, O.; Alia-Tejacal, I.; Pérez-Arias, G.A.; López-Martínez, V.; Juarez-Lopez, P.; Valle-Guadarrama, S.; Rodríguez-Verastegui, L.L. Physical, Chemical, and Physiological Characterization of Chilling Injury in Soursop Fruit (Anonna Muricata L.). Postharvest Biol. Technol.; 2022; 193, 112052. [DOI: https://dx.doi.org/10.1016/j.postharvbio.2022.112052]
5. Berumen-Varela, G.; Hernández-Oñate, M.A.; Tiznado-Hernández, M.E. Utilization of Biotechnological Tools in Soursop (Annona Muricata L.). Sci. Hort.; 2019; 245, pp. 269-273. [DOI: https://dx.doi.org/10.1016/j.scienta.2018.10.028]
6. Zhou, D.; Liu, Q.; Wu, C.; Li, T.; Tu, K. Characterization of Soluble Sugars, Glycosidically Bound and Free Volatiles in Fresh-Cut Pineapple Stored at Different Temperature. Food Biosci.; 2021; 43, 101329. [DOI: https://dx.doi.org/10.1016/j.fbio.2021.101329]
7. Dorey, E.; Fournier, P.; Léchaudel, M.; Tixier, P. Modeling Sugar Content of Pineapple under Agro-Climatic Conditions on Reunion Island. Eur. J. Agron.; 2016; 73, pp. 64-72. [DOI: https://dx.doi.org/10.1016/j.eja.2015.10.010]
8. Batsoulis, A.N.; Siatis, N.G.; Kimbaris, A.C.; Alissandrakis, E.K.; Pappas, C.S.; Tarantilis, P.A.; Harizanis, P.C.; Polissiou, M.G. FT-Raman Spectroscopic Simultaneous Determination of Fructose and Glucose in Honey. J. Agric. Food Chem.; 2005; 53, pp. 207-210. [DOI: https://dx.doi.org/10.1021/jf048793m]
9. González-Viveros, N.; Gómez-Gil, P.; Castro-Ramos, J.; Cerecedo-Núñez, H.H. On the Estimation of Sugars Concentrations Using Raman Spectroscopy and Artificial Neural Networks. Food Chem.; 2021; 352, 129375. [DOI: https://dx.doi.org/10.1016/j.foodchem.2021.129375]
10. Kolašinac, S.; Pećinar, I.; Danojević, D.; Stevanović, Z.D. Raman Spectroscopy Coupled with Chemometric Modeling Approaches for Authentication of Different Paprika Varieties at Physiological Maturity. LWT; 2022; 162, 113402. [DOI: https://dx.doi.org/10.1016/j.lwt.2022.113402]
11. Wang, K.; Li, Z.; Li, J.; Lin, H. Raman Spectroscopic Techniques for Nondestructive Analysis of Agri-Foods: A State-of-the-Art Review. Trends Food Sci. Technol.; 2021; 118, pp. 490-504. [DOI: https://dx.doi.org/10.1016/j.tifs.2021.10.010]
12. Monago-Maraña, O.; Afseth, N.K.; Knutsen, S.H.; Wubshet, S.G.; Wold, J.P. Quantification of Soluble Solids and Individual Sugars in Apples by Raman Spectroscopy: A Feasibility Study. Postharvest Biol. Technol.; 2021; 180, 111620. [DOI: https://dx.doi.org/10.1016/j.postharvbio.2021.111620]
13. Xu, S.; Huang, X.; Lu, H. Advancements and Applications of Raman Spectroscopy in Rapid Quality and Safety Detection of Fruits and Vegetables. Horticulturae; 2023; 9, 843. [DOI: https://dx.doi.org/10.3390/horticulturae9070843]
14. Wu, L.; Tang, X.; Wu, T.; Zeng, W.; Zhu, X.; Hu, B.; Zhang, S. A Review on Current Progress of Raman-Based Techniques in Food Safety: From Normal Raman Spectroscopy to SESORS. Food Res. Int.; 2023; 169, 112944. [DOI: https://dx.doi.org/10.1016/j.foodres.2023.112944] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37254368]
15. Andersen, P.V.; Afseth, N.K.; Aaby, K.; Gaarder, M.Ø.; Remberg, S.F.; Wold, J.P. Prediction of Chemical and Sensory Properties in Strawberries Using Raman Spectroscopy. Postharvest Biol. Technol.; 2023; 201, 112370. [DOI: https://dx.doi.org/10.1016/j.postharvbio.2023.112370]
16. Dehelean, A.; Cristea, G.; Feher, I.; Hategan, A.R.; Magdas, D.A. Differentiation of Transylvanian Fruit Distillates Using Supervised Statistical Tools Based on Isotopic and Elemental Fingerprint. J. Sci. Food Agric.; 2023; 103, pp. 1454-1463. [DOI: https://dx.doi.org/10.1002/jsfa.12241]
17. Fernández, J.A.; Abbas, O.; Dardenne, P.; Baeten, V. Discrimination of Corsican Honey by FT-Raman Spectroscopy and Chemometrics. Biotechnol. Agron. Soc. Environ.; 2011; 15, pp. 75-83.
18. Li, W.; Tan, F.; Cui, J.; Ma, B. Fast Identification of Soybean Varieties Using Raman Spectroscopy. Vib. Spectrosc.; 2022; 123, 103447. [DOI: https://dx.doi.org/10.1016/j.vibspec.2022.103447]
19. Yazgan Karacaglar, N.N.; Bulat, T.; Boyaci, I.H.; Topcu, A. Raman Spectroscopy Coupled with Chemometric Methods for the Discrimination of Foreign Fats and Oils in Cream and Yogurt. J. Food Drug Anal.; 2019; 27, pp. 101-110. [DOI: https://dx.doi.org/10.1016/j.jfda.2018.06.008]
20. Nakajima, S.; Kuroki, S.; Ikehata, A. Selective Detection of Starch in Banana Fruit with Raman Spectroscopy. Food Chem.; 2023; 401, 134166. [DOI: https://dx.doi.org/10.1016/j.foodchem.2022.134166]
21. da Mata, M.M.D.; Rocha, P.D.; Farias, I.K.T.D.; Silva, J.L.B.D.; Medeiros:, E.P.; Silva, C.S.; Simões, S.D.S. Distinguishing Cotton Seed Genotypes by Means of Vibrational Spectroscopic Methods (NIR and Raman) and Chemometrics. Spectrochim. Acta Part A Mol. Biomol. Spectrosc.; 2022; 266, 120399. [DOI: https://dx.doi.org/10.1016/j.saa.2021.120399] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34597869]
22. Alamar, P.D.; Caramês, E.T.S.; Poppi, R.J.; Pallone, J.A.L. Detection of Fruit Pulp Adulteration Using Multivariate Analysis: Comparison of NIR, MIR and Data Fusion Performance. Food Anal. Methods; 2020; 13, pp. 1357-1365. [DOI: https://dx.doi.org/10.1007/s12161-020-01755-x]
23. Sun, M.; Jiang, H.; Yuan, W.; Jin, S.; Zhou, H.; Zhou, Y.; Zhang, C. Discrimination of Maturity of Camellia Oleifera Fruit On-Site Based on Generative Adversarial Network and Hyperspectral Imaging Technique. Food Meas.; 2024; 18, pp. 10-25. [DOI: https://dx.doi.org/10.1007/s11694-023-02145-7]
24. Balcázar-Zumaeta, C.R.; Castro-Alayo, E.M.; Cayo-Colca, I.S.; Idrogo-Vásquez, G.; Muñoz-Astecker, L.D. Metabolomics during the Spontaneous Fermentation in Cocoa (Theobroma Cacao L.): An Exploraty Review. Food Res. Int.; 2023; 163, 112190. [DOI: https://dx.doi.org/10.1016/j.foodres.2022.112190]
25. Kniese, J.; Race, A.M.; Schmidt, H. Classification of Cereal Flour Species Using Raman Spectroscopy in Combination with Spectra Quality Control and Multivariate Statistical Analysis. J. Cereal Sci.; 2021; 101, 103299. [DOI: https://dx.doi.org/10.1016/j.jcs.2021.103299]
26. Deng, H.; He, R.; Long, M.; Li, Y.; Zheng, Y.; Lin, L.; Liang, D.; Zhang, X.; Liao, M.; Lv, X. et al. Comparison of the Fruit Volatile Profiles of Five Muscadine Grape Cultivars (Vitis Rotundifolia Michx.) Using HS-SPME-GC/MS Combined With Multivariate Statistical Analysis. Front. Plant Sci.; 2021; 12, 728891. [DOI: https://dx.doi.org/10.3389/fpls.2021.728891]
27. Balcázar-Zumaeta, C.R.; Pajuelo-Muñoz, A.J.; Trigoso-Rojas, D.F.; Iliquin-Chavez, A.F.; Fernández-Romero, E.; Yoplac, I.; Muñoz-Astecker, L.D.; Rodríguez-Hamamura, N.; Maza Mejía, I.M.; Cayo-Colca, I.S. et al. Reduction in the Cocoa Spontaneous and Starter Culture Fermentation Time Based on the Antioxidant Profile Characterization. Foods; 2023; 12, 3291. [DOI: https://dx.doi.org/10.3390/foods12173291]
28. Castro-Alayo, E.M.; Torrejón-Valqui, L.; Cayo-Colca, I.S.; Cárdenas-Toro, F.P. Evaluation of the Miscibility of Novel Cocoa Butter Equivalents by Raman Mapping and Multivariate Curve Resolution–Alternating Least Squares. Foods; 2021; 10, 3101. [DOI: https://dx.doi.org/10.3390/foods10123101]
29. Huang, Y.; Karashima, T.; Yamamoto, M.; Hamaguchi, H. Molecular-level Pursuit of Yeast Mitosis by Time- and Space-resolved Raman Spectroscopy. J Raman Spectrosc.; 2003; 34, pp. 1-3. [DOI: https://dx.doi.org/10.1002/jrs.960]
30. Ditta, A.; Nawaz, H.; Mahmood, T.; Majeed, M.I.; Tahir, M.; Rashid, N.; Muddassar, M.; Al-Saadi, A.A.; Byrne, H.J. Principal Components Analysis of Raman Spectral Data for Screening of Hepatitis C Infection. Spectrochim. Acta Part A Mol. Biomol. Spectrosc.; 2019; 221, 117173. [DOI: https://dx.doi.org/10.1016/j.saa.2019.117173]
31. Gersony, J.T.; McClelland, A.; Holbrook, N.M. Raman Spectroscopy Reveals High Phloem Sugar Content in Leaves of Canopy Red Oak Trees. New Phytol.; 2021; 232, pp. 418-424. [DOI: https://dx.doi.org/10.1111/nph.17465] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33991343]
32. Logan, B.G.; Hopkins, D.L.; Schmidtke, L.M.; Fowler, S.M. Assessing Chemometric Models Developed Using Raman Spectroscopy and Fatty Acid Data for Northern and Southern Australian Beef Production Systems. Meat Sci.; 2022; 187, 108753. [DOI: https://dx.doi.org/10.1016/j.meatsci.2022.108753] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35168178]
33. Banerjee, S.; Ranganathan, V.; Patti, A.; Arora, A. Valorisation of Pineapple Wastes for Food and Therapeutic Applications. Trends Food Sci. Technol.; 2018; 82, pp. 60-70. [DOI: https://dx.doi.org/10.1016/j.tifs.2018.09.024]
34. Agu, K.C.; Okolie, P.N. Proximate Composition, Phytochemical Analysis, and in Vitro Antioxidant Potentials of Extracts of Annona Muricata (Soursop). Food Sci. Nutr.; 2017; 5, pp. 1029-1036. [DOI: https://dx.doi.org/10.1002/fsn3.498]
35. Leivas, C.; Cantu-Jungles, T.; Barbosa, B.; De Paula Werner, M.; Iacomini, M.; Cordeiro, L.M.C. Investigation of the Chemical Structure and Analgesic and Anti-Inflammatory Properties of Polysaccharides That Constitute the Dietary Fibers of Soursop (Annona Muricata) Fruit. Food Res. Int.; 2023; 166, 112588. [DOI: https://dx.doi.org/10.1016/j.foodres.2023.112588]
36. Schulz, H.; Baranska, M. Identification and Quantification of Valuable Plant Substances by IR and Raman Spectroscopy. Vib. Spectrosc.; 2007; 43, pp. 13-25. [DOI: https://dx.doi.org/10.1016/j.vibspec.2006.06.001]
37. Wiercigroch, E.; Szafraniec, E.; Czamara, K.; Pacia, M.Z.; Majzner, K.; Kochan, K.; Kaczor, A.; Baranska, M.; Malek, K. Raman and Infrared Spectroscopy of Carbohydrates: A Review. Spectrochim. Acta Part A Mol. Biomol. Spectrosc.; 2017; 185, pp. 317-335. [DOI: https://dx.doi.org/10.1016/j.saa.2017.05.045]
38. Zheng, Z.; Yao, X.; Zhang, S.; Fang, L.; Jiang, Z. In-Situ Raman Study of α-D-Glucose under Different Pressure and Temperature. J. Mol. Struct.; 2023; 1274, 134539. [DOI: https://dx.doi.org/10.1016/j.molstruc.2022.134539]
39. Yao, X.; Fu, C.; Zhang, S.; Cheng, L.; Jiang, Z. Structure Investigation of β-D-Fructose Crystal under High Pressure: Raman Scattering, IR Absorption, and Synchrotron X-Ray Diffraction. J. Mol. Struct.; 2020; 1220, 128746. [DOI: https://dx.doi.org/10.1016/j.molstruc.2020.128746]
40. Mihaylova, D.; Popova, A.; Dincheva, I.; Pandova, S. HS-SPME-GC–MS Profiling of Volatile Organic Compounds and Polar and Lipid Metabolites of the “Stendesto” Plum–Apricot Kernel with Reference to Its Parents. Horticulturae; 2024; 10, 257. [DOI: https://dx.doi.org/10.3390/horticulturae10030257]
41. Guo, Y.; Jin, W.; Wang, W.; He, Y.; Qiu, S. Baseline Correction for Raman Spectra Using a Spectral Estimation-Based Asymmetrically Reweighted Penalized Least Squares Method. Appl. Opt. AO; 2023; 62, pp. 4766-4776. [DOI: https://dx.doi.org/10.1364/AO.489478] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37707250]
42. Hu, H.; Bai, J.; Xia, G.; Zhang, W.; Ma, Y. Improved Baseline Correction Method Based on Polynomial Fitting for Raman Spectroscopy. Photonic Sens; 2018; 8, pp. 332-340. [DOI: https://dx.doi.org/10.1007/s13320-018-0512-y]
43. Brereton, R.G. Applied Chemometrics for Scientists; 1st ed. Wiley: Hoboken, NJ, USA, 2007; ISBN 978-0-470-01686-2
44. Esbensen, K.H.; Geladi, P. 2.13-Principal Component Analysis: Concept, Geometrical Interpretation, Mathematical Background, Algorithms, History, Practice. Comprehensive Chemometrics; Elsevier: Oxford, UK, 2009; pp. 211-226. ISBN 978-0-444-52701-1
45. Roger, J.-M.; Boulet, J.-C.; Zeaiter, M.; Rutledge, D.N. 3. 01-Pre-Processing Methods. Comprehensive Chemometrics; Elsevier: Oxford, UK, 2020; pp. 1-75. ISBN 978-0-444-64166-3
46. De Almeida, M.R.; Correa, D.N.; Rocha, W.F.C.; Scafi, F.J.O.; Poppi, R.J. Discrimination between Authentic and Counterfeit Banknotes Using Raman Spectroscopy and PLS-DA with Uncertainty Estimation. Microchem. J.; 2013; 109, pp. 170-177. [DOI: https://dx.doi.org/10.1016/j.microc.2012.03.006]
47. Padmanabhan, P.; Paliyath, G. Annonaceous Fruits. Encyclopedia of Food and Health; Caballero, B.; Finglas, P.M.; Toldrá, F. Academic Press: Oxford, UK, 2016; pp. 169-173. ISBN 978-0-12-384953-3
48. Devos, O.; Downey, G.; Duponchel, L. Simultaneous Data Pre-Processing and SVM Classification Model Selection Based on a Parallel Genetic Algorithm Applied to Spectroscopic Data of Olive Oils. Food Chem.; 2014; 148, pp. 124-130. [DOI: https://dx.doi.org/10.1016/j.foodchem.2013.10.020] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24262536]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Tropical fruits such as cherimoya, soursop, and pineapple share sugars (glucose, fructose, and sucrose) in common but may differ in the content of other phytochemicals. In the present work, confocal Raman spectroscopy and partial least squares discriminant analysis (PLS-DA) were used to establish a classification model among the three fruits and to evaluate the effect of pre-processing methods on the model’s performance. The Raman spectra showed that glucose was present in the fruits in the 800–900 cm−1 band and the 1100–1200 cm−1 band. While sucrose was present in the bands of 1131.22 cm−1, 1134.44 cm−1, and 1133.37 cm−1 in the three fruits, fructose was present in the bands of 1464.22 cm−1, 1467.44 cm−1, and 1464.22 cm−1 in cherimoya, soursop, and pineapple. The accuracy of the PLS-DA model varied according to the pre-processing methods used. The Savitzky–Golay first derivative method produced a model with 98.69–100% and 100% precision on the training and prediction data, respectively.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details





1 Instituto de Investigación, Innovación y Desarrollo para el Sector Agrario y Agroindustrial (IIDAA), Facultad de Ingeniería y Ciencias Agrarias, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Calle Higos Urco 342-350-356, Chachapoyas 01000, Peru;
2 Facultad de Ingeniería Zootecnista, Agronegocios y Biotecnología, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Calle Higos Urco 342-350-356, Chachapoyas 01000, Peru;
3 Facultad de Ingeniería Civil y Ambiental, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Calle Higos Urco 342-350-356, Chachapoyas 01000, Peru;