Introduction
Metabolic syndrome (MetS) is the leading cause of cardiometabolic diseases and has become a significant health concern worldwide [1,2]. According to the National Cholesterol Education Program (NCEP), MetS is diagnosed based on elevated glucose levels, reduced HDL cholesterol, elevated triglycerides (TG), obesity, and hypertension [3]. Beyond low-density lipoprotein cholesterol (LDL-C), TG-rich lipoproteins, such as very-low-density lipoprotein (VLDL), have been shown to cause cardiac remodeling [4–6], particularly by inducing atrial dilation, which is a hallmark of atrial myopathy and represents a risk for atrial fibrillation (AF). Other hazardous lipids, such as diacylglycerols and ceramides, can promote atrial myopathy by promoting mitochondrial dysfunction and apoptosis [7]. Some saturated free fatty acids can affect cardiac ion channels by activating macrophages and IL-6 [8], and postprandial VLDL is correlated with atrial remodelling [5], suggesting that dietary fat and postprandial circulatory lipids can affect arrhythmogenicity [9]. Moreover, VLDL is toxic to the cardiovascular system and can induce vulnerability to AF in patients with MetS but not in healthy subjects [10,11].
Circulatory lipids, except for some albumin-binding fatty acids, are carried by lipoproteins. VLDL lipotoxicity is primarily affected by the lipid core, which consists of a variable abundance of lipid species. Quantification of lipid species in lipoproteins can be performed using nuclear magnetic resonance (NMR), mass spectrometry (MS), gas chromatography (GC), and Fourier transform-ion cyclotron resonance (FT-ICR) [12]. However, these costly and time-consuming methods are not feasible for daily applications in clinical scenarios. Moreover, none of them can be used as a monitoring assay when repeated testing is required, for instance, when assessing blood lipid changes after a nutritional intervention or exercise workout.
Attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy is a technique for analyzing the chemical composition of materials by detecting their characteristic absorption of mid-infrared light. In ATR-FTIR, molecules absorb mid-infrared light at specific wavenumbers (or frequencies), depending on their chemical bonds and structure. Each type of bond (e.g., C = O and C–H) has a unique absorption pattern, creating a molecular fingerprint. ATR-FTIR can quickly analyze the molecular fingerprint of an organic sample and reveal its principal biomolecular composition and interactions and has been validated in a broad spectrum of biomedical applications, such as SARS-CoV-2 detection [13,14], urinalysis for assessing renal hyperfiltration [15], and the detection of HIV infection [16]. The advantages of FTIR, with its quick assaying time and demand for tiny blood samples (0.5 µL, equivalent to a finger-prick sampling amount) prove that it is an ideal method for blood lipid analysis.
In ATR-FTIR, lipids in the blood samples produce clustered signals representing IR reflections from structures with a range of molecules conjugated to various proteins. Conventional algorithms for spectral analysis cannot categorize lipid species. Algorithms are used to directly estimate and quantify the amount of specific lipids to generate raw reflection data with complex interactions. Therefore, an effective analytical method for ATR-FTIR is required to facilitate its application in lipid analyses.
Data-driven processing enables rapid development and application of artificial intelligence (AI). The introduction of the gradient boosting tree model (GBT), linear regression model (LR), and histogram-based gradient boosting tree model (HGBT), an enhanced variant of the GBT model, makes machine learning useful in complex biomedical prediction. Some examples include the prediction of progression-free survival in prostate cancer [17], estimation of delivery date in pregnancy [18], and evaluation of one-year mortality in patients receiving coronary stenting [19]. AI-assisted ATR-FTIR spectroscopy has great potential as a rapid, economical, and resilient diagnostic platform for detecting and monitoring metabolic disorders. Thus, it can be widely used in the clinical setting. In this study, we combined machine learning and ATR-FTIR data to develop a valuable method for distinguishing between patients with and without MetS and for predicting conventional laboratory test results.
Methods
Setting and participants
The study protocol was approved by the Kaohsiung Medical University Hospital Institutional Review Board (KMUHIRB-F(II)-20210091) and was implemented in accordance with the Good Clinical Practice Guidelines, the principles of the Declaration of Helsinki, and all applicable legislation and regulations of the Kaohsiung Medical University Hospital. This retrospective study accessed data from 2021/05/25, and all authors had access to information that could identify individual information after data collection. All study participants provided their informed written consent. Two blood samples were collected from each subject and centrifuged to keep the serum frozen for the ATR-FTIR experiments. Clinical and laboratory data were obtained. The study participants were divided into two groups based on the presence or absence of MetS. The clinical diagnosis of MetS is defined by the existence of at least three criteria as follows: waist circumference (greater than or equal to 90 cm in men and greater than or equal to 80 cm in women), blood pressure (≥ 130/85 mmHg), blood glucose level (fasting state greater than or equal to 100 mg/dl), TG (greater than 150 mg/dL), and high-density lipoprotein cholesterol (HDL-C level, for male less than or equal to 40 mg/dL, for female less than or equal to 50 mg/dL).
ATR-FTIR spectral data acquisition
All ATR-FTIR spectral data were obtained at Taiwan Light Source (TLS) BL14A1 of the National Synchrotron Radiation Research Center (NSRRC) in Taiwan. This study applied a small sample volume (0.5 µL to a Ge ATR crystal, as shown in Fig 1(A). A modulated mid-infrared beam was directed into the Ge crystal, generating an evanescent wave at the crystal surface, allowing the light to interact with the sample. The sample absorbs light at characteristic wavenumbers, and the remaining light is detected using a liquid nitrogen-cooled mercury cadmium telluride (MCT) infrared detector. The resulting spectrum shows peaks corresponding to specific molecular bonds, providing a unique chemical profile of the sample. ATR-FTIR samples must be stored in a −80 °C freezer to ensure sample stability before measurement and brought to room temperature naturally before measurement. The samples were placed on the ATR crystal surface to allow moisture to evaporate naturally before the signal acquisition, ensuring that the moisture did not interfere with the spectra. If the moisture in the samples was not completely removed, interference appeared at 3500 cm−1, 2100 cm−1, and 1640 cm−1. A measurement time of approximately 5 min (~300 s) did not cause any degradation of the components.
[Figure omitted. See PDF.]
(A). The schematic depicts the attenuated total reflection for acquiring the FTIR spectra of a blood sample. The mid-infrared light is modulated using the Michaelson interferometer of an FTIR spectrometer. (B). Representative ATR-FTIR spectrometry. Raw data are obtained from plasma samples from one MetS and one non-MetS individual. The absorption regions of the major biomolecules are indicated.
Each ATR-FTIR spectral dataset of a blood sample contained 1738 data points in the spectral range 4000–650 cm−1 (Fig 1(B)). Three spectral ranges, 3000–2800 cm−1, 1800–1650 cm−1, and 1500–1350 cm−1, of characteristic lipid absorption were selected as inputs. The characteristic absorption bands of the blood samples for subsequent analysis included absorption bands (1) in the spectral range of 3000–2800 cm−1 and 1500–1350 cm−1 assigned to the methyl and methylene groups of lipid molecules, (2) at ~ 1740 cm−1 for the carbonyl group of lipids, (3) at ~ 1650 cm−1 assigned to C = O stretching of the peptide bond, and (4) in the spectral range of 1500–1000 cm−1 for the phosphodiester groups of nucleic acids, phospholipids, and side chains of peptides, and glycans [20–23] as shown in Fig 1(B).
ATR-FTIR spectral analysis
All analyses and model calculations were performed using Python 3.9, NumPy, scikit-learn, imbalanced-learn, and matplotlib packages. The ATR-FTIR spectral data were used mainly to (1) determine the presence of MetS (classification task) and (2) predict the blood levels of several lipids (regression task) as follows: TG, total cholesterol (CHOL), HDL-C, low-density lipoprotein-cholesterol (LDL-C), and very-low-density lipoprotein cholesterol (VLDL-C). In conventional data analysis, data points that deviate significantly from the distribution are typically identified as outliers and eliminated to optimize the performance of the models. However, in this study, data outliers were not removed because all blood lipid levels were obtained from a trustworthy and standardized laboratory at the medical center.
Models used for data preparation
Based on the previously reported prevalence of MetS, approximately 33% [23] oversampling was performed to generate a balanced dataset. An adaptive synthetic approach (ADASYN) [24] was used to generate calibration data. The performance and credibility of the model were assessed using 10-fold cross-validation.
Models used for data analysis
The models used in the regression task include basic linear regression (LR), gradient boosting regression tree (GBTR), random forest regression tree, and histogram-based gradient boosting tree classifier (HGBTR), which have the advantage of starting the estimation quickly without loss of accuracy [25]. For the classification task, that is, the determination of MetS, logistic regression, random forest tree classifier (RF), gradient boosting tree classifier (GBT), and histogram-based gradient boosting tree classifier (HGBT) [25]. The model for each task was individually tested to determine the best target prediction results.
Metrics for model explanation
The relationships between the actual laboratory values and model-predicted values for each prediction target were presented in the residual plot to evaluate the regression task results, along with the calculated values of R squared (R2), mean absolute error (MAE), and mean absolute percentage error (MAPE), which were calculated by taking the average of the absolute differences between the predicted and actual values, divided by the actual values, and then expressed as a percentage. This can provide a clear and interpretable measure of prediction accuracy by expressing errors as a percentage of actual values. Subsequently, 10-fold cross-validation was used to calculate the mean and standard deviation (SD) of the above metrics across 10 models. A smaller standard deviation suggests that the performance of the model is relatively consistent across different folds.
The receiver operating characteristic curve (ROC) and confidence intervals (CI) were presented for 10-fold cross-validation to evaluate the classification task results. Other evaluation metrics included area under the curve (AUC), F1, precision, and recall. The mean and SD were also calculated.
The results of the regression models were plotted as the weights of the best-performing GBT model over a 10-fold range. The scaled weighting was from −1 to 1, greater than 0.5, or smaller than −0.5 for the selected absorption wavenumbers for further evaluation.
To determine MetS, t-distributed stochastic neighbor embedding (TSNE) [26] was used to verify the data distribution in a high-dimensional 2D plane. Regarding the model explanation for the classification task, we applied the SHAP (SHapley Additive exPlanations) values with a tree-based explainer [27] to determine the nine wavenumbers with the greatest impact. This method enabled us to assess the consistency between our findings and those in the current literature.
Results
Study subjects
This study included 25 non-MetS and 25 MetS patients (Table 1). The two groups were age- and sex-matched. The MetS group had higher incidences of hypertension, diabetes mellitus, body weight, waist circumference, and hip circumference. In the MetS group, 56% of the subjects were taking lipid-lowering drugs. The mean MetS score was 3.96 for the MetS group and 0.32 for the non-MetS group. In the laboratory data, the MetS group had greater TG, VLDL-C, non-HDL-C, uric acid, alanine aminotransferase (AST), and aspartate aminotransferase (ALT). HDL-C was significantly lower in the MetS group (Table 1). Upon examining the distribution of blood lipid values, we discovered that some patients with MetS exhibited extreme TG, VLDL, and LDL values. Prior to running the model, our data analysis indicated significant differences between the MetS and non-MetS groups for all parameters except CHOL (more information in S1 Appendix).
[Figure omitted. See PDF.]
ATR-FTIR spectral data characteristics
Spectral data were collected following a standard calibration procedure, using 150 blood samples from 50 individuals. Each blood sample was repeatedly used to collect FTIR spectral data to generate 491 FTIR data (300 from MetS and 191 from non-MetS subjects). The ADASYN method was used to match the prevalence balance with 33% of MetS cases, and the dataset numbers were 300 for MetS and 609 for non-MetS cases. We categorized lipid values into five equally divided groups from maximum to minimum to examine potential spectral differences among varying lipid levels (S2 Appendix). The mean ATR-FTIR spectrum for each group is plotted as a line, with the shaded regions representing plus one standard deviation. This visualization highlights the trends in spectral differences across lipid groups. However, although these trends suggest potential relationships, they do not allow for direct quantitative conclusions or inferences. Overlapping or minimal differences in FTIR spectral absorption across lipid ranges further suggest that nonlinear associations may exist which are not directly observable in raw spectral data. All the figures are listed in S2 Appendix. Grouped Spectrum with Specified Lipid Value Ranges.
Regression task results
The correlation between the predicted values derived from the different regression models and actual laboratory values is shown in Fig 2 and Table 2. Prediction errors (residuals = actual values – predicted values) were also examined for TG, CHOL, HDL-C, LDL-C, and VLDL-C (Fig 2A–2E). The best model for predicting all targets based on R2 was HGBT, whereas the model with the best MAE value was RF (Table 2). In our analysis, we observed that deviance decreased across most of the folds during the cross-validation training and testing phases. However, a few folds exhibited irregular patterns, possibly due to outliers in the data distribution (S3 Appendix for more information).
[Figure omitted. See PDF.]
Correlation between the predictive values of FTIR spectrometry and the reference data (left panel). Residual plots (right panels) of IR absorbance versus the blood concentrations of (A)TG, (B) total cholesterol, (C) HDL-C, (D) LDL-C, and (E) VLDL-C.
[Figure omitted. See PDF.]
In the residual evaluation (right panels of Fig 2A–2E), the performance of LR (shown by green dots) was poor (R2 < 0.3) with large MAE values, whereas the performance of RF/GBT/HGBT was similar. Among the different lipid parameters, the prediction performance was the best for TG and the worst for LDL-C, particularly with the LR model. The widely used Friedewald formula for calculating LDL-C is known to have inherent errors when compared with direct blood measurements, although it has been reported to be the most accurate way to calculate fasting LDL-C in Chinese people [28]. For example, the MAPE and MAE for LDL-C, calculated using the Friedewald formula, were 77.5% and 92.3 mg/dL, respectively. In this study, our best model (Random Forest) achieves a significantly lower MAPE of 7.3% and an MAE of 9.7 mg/dL for LDL-C prediction.
Next, coefficient modeling was performed. The wavenumbers that were strongly correlated in predicting specific lipid parameters in the GBT model are shown in the heatmap (Fig 3). To predict TG, the data points were 2991, 1724, and 1722 cm−1. For LDL-C, the data included points at 2933, 2931, 2906, and 1766 cm−1. For HDL-C, the data included points at 2906, 2904, 2902, and 2898 cm−1. For Chol were at 2931, 2916, 2906, and 1755 cm−1. Furthermore, the data points were 2991 and 1722 cm−1 for VLDL-C, respectively.
[Figure omitted. See PDF.]
The blue line represents the mean value of the FTIR spectra of total MetS, the mean value of the FTIR spectra of total MetS, and the orange dotted line represents the mean value of the FTIR spectra of total non-MetS. The degree of darkness in the vertical line indicates the magnitude of the correction. The unselected frequency bands are blank. The frequencies with an absolute coefficient larger than 0.5 are specifically labeled.
We further investigated the effect of MetS on the lipid value predictions of the model. The error distribution for our predictions showed that MetS subjects had more prediction errors than non-MetS subjects, indicating that the prediction accuracy was more accurate in the absence of MetS. To explore this further, we tested models trained and validated separately for MetS and non-MetS groups. The results indicated that the models trained on non-MetS data performed consistently well on non-MetS test data, whereas those trained on MetS data exhibited significant variability (S3 Appendix for more information).
These findings suggest that, while our model is adequate for general rapid scans, additional data and validation are necessary with a sufficient population size for patients with MetS. This also highlights the substantial variability in lipid disorders among patients with MetS. Moreover, FTIR provides an efficient and visualized screen to demonstrate lipid abnormalities in MetS (S4 Appendix for detailed information).
Classification task results
These models were used to examine their ability to classify MetS and non-MetS groups, as shown in Fig 3 and Table 2. The best model was RF, with an AUC of 0.978, followed by HGBT (AUC, 0.972), GBT (AUC, 0.967), and LR (AUC, 0.851) (Fig 4A). The F1 score was similar, approximately around 0.88 for RF, HBGT, and GBT, but 0.69 for LR. The results suggest that RF, HBGT, and GBT can distinguish between MetS and non-MetS using ATR-FTIR data.
[Figure omitted. See PDF.]
(A) ROCs curves of the different models in the MetS prediction task. The color bands represent the CI ranges. (B) The t-SNE method shows separated clustering between MetS and non-MetS spectrometry data. Light-blue dots represent non-MetS and red dots represent MetS. (C) The SHAP values for the GBT model in the MetS prediction task are presented in descending order, with features ranked from top to bottom based on their influence on prediction. (D) SHAP waterfall plots of the METS predictions. (E) SHAP partial-dependence plots for the absorption at 1660 cm−1. (F) SHAP partial-dependence plots for the absorption at 1653 cm−1. (G) SHAP partial-dependence plots for the absorption at 1655 cm−1.
Projection verification of all the samples was performed using the t-SNE method (Fig 4B). Each projection point indicates high-dimensional data on the 2D plane; the color marks MetS, and the point density reflects the similarity of the data. In the 2D projection, the distinction between most MetS and non-MetS samples was well illustrated; however, a few data points remained relatively similar and overlapped. The excellent AUC and F1 ratings of all three models (Table 2) indicate that MetS and non-MetS could be correctly classified using an appropriately constructed nonlinear model with FTIR spectral data. Concurrently, as shown in Fig 4C, the SHAP value analysis revealed that the top nine wavenumbers were 2843, 1496, 1653, 1699, 1655, 2891, 2935, 1410, and 2902 cm−1. Notably, four wavenumbers were within the 2800–3000 cm−1 range, two wavenumbers were within the 1350–1500 cm−1 range, and three were within the 1600–1700 cm−1 range. This finding indicates that when the task requires a comprehensive assessment rather than a single numerical value, the model considers all relevant lipid-associated frequency bands. SHAP waterfall plots identified bands that significantly contributed to MetS predictions (Fig 4D). For instance, wavenumbers of 1660, 1655, and 1657 cm−1 were found to be the most significant contributors to the model’s prediction of MetS (Fig 4E–G). These plots revealed that wavenumbers of 1653, 1655, and 1660 cm−1 exhibited a stronger association with MetS predictions once they exceeded a certain threshold, further supporting the idea that FTIR detects long-chain lipid structures in the blood, which are crucial features for our model’s predictions.
Discussion
This study has proven our hypothesis that ATR-FTIR spectral data and appropriate machine learning methods can be used to accurately predict lipid levels in blood samples and distinguish between the absence and presence of MetS.
With the emergence of new lipid-lowering drugs based on various pharmaceutical targets, stratification of patients with MetS has become increasingly important. The more comprehensive the lipid disorders are determined, the more precisely the treatment will be chosen to optimize the efficacy and outcome benefits. Lipids are a broad group of organic molecules with structures that produce different infrared absorptions or emissions, which can be detected using FTIR spectrometry. FTIR spectrometry has been used to quantify the oil and fat content of food [29–31]. For instance, it precisely detects different lipid fractions and free fatty acids in oleaginous yeasts [30]. Bel’skaya et al. published their work on using model solutions in applying FTIR spectroscopy for the quantitative analysis of blood serum [32]. Portaccio et al. published their study results on ATR-FTIR spectra of commercial lipid samples and demonstrated specific fingerprint regions in ATR-FTIR spectra for phospholipids (phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, and phosphatidylinositol) from 3500 to 2800 cm−1 and 1800 to 600 cm−1, sphingolipids (ceramide, sphingosine 1-phosphate, ceramide 1-phosphate, and sphingomyelin) from 3500 to 2800 cm−1 and 1800 to 600 cm−1, and CHOL from 3150–2800 cm−1.[33] Their study proved the usefulness of FTIR spectra for analyzing complex lipids, and this application can ultimately contribute to lipid research and its implications for human health. Similarly, FTIR could precisely predict TG, CHOL, and different-density lipoproteins in this study. The best spectral region(s) for predicting total TG were 2927, 2925, 1718, and 1500 cm−1, 1718 cm−1 for CHOL, and 2937 cm−1 for HDL-C. Those were 2895, 2806, 2800, 1745, 1743, and 1716 cm−1 for LDL-C, and were 2931 and 1500 cm−1 for VLDL-C. The regions are close to those in a previous study, which evaluated 31 healthy individuals and suggested spectral ranges for lipids as 1741, 1747, 2854, and 2929 cm−1 [34]. Jessen et al. used FTIR spectrometry to analyze blood plasma without any sample preparation to achieve stable and accurate measurements for concentrations of clinically relevant chemical constituents, including TG and CHOL [35]. Similarly, Li et al. used FTIR spectrometry to determine glucose and CHOL in whole blood and a partial least squares (PLS) regression fusion modeling method was used [36].
The choice of machine learning method affects the accuracy of the prediction of blood lipids, and this phenomenon was clearly demonstrated in this study. Linear and logistic regression models perform poorly with high-dimensional data. It can be inferred that our model did not initially extract the features using PCA. However, RF, GPT, and HGPT continue to perform well for TG/CHOL/HDL-C, where a specific algorithm alone is sufficient to extract information from the original input and map the results.
The prediction performance also varied among different blood lipids. For LDL-C, which had a relatively wide concentration range, the R2 value was relatively low. Nevertheless, the MAE value was between 9 and 12, and our model achieved a considerably low prediction error (as low as 7.3%) (Table 2). For instance, with an actual LDL level of 100 mg/dL, our model’s prediction ranged from 93.7 to 107.3 mg/dL. Previous studies suggested a desirable bias of ± 6.8% and a desirable total allowable error (TEa) of ± 13.7% for LDL laboratory data and calculated errors by analyzing the European Federation of Clinical Chemistry and Laboratory Medicine database [37]. Therefore, the prediction error for LDL-C in this study would not affect the medication decisions or clinical judgment. In the case of TG, all the machine learning models exhibited superior prediction accuracy, with R2 values of 0.801–0.851 (Table 2).
Consistent with the existing knowledge, the model’s interpretation revealed that the focus of HGBT lies between 3000 and 2800 cm−1. We also attempted to restrict the input variables to the theoretical spectrum and entire spectral range in wavenumbers; however, the performance did not increase considerably, indicating that the interval chosen for this experiment was adequate. Using the high-dimensional projection technique, t-SNE, FTIR spectral analysis can effectively identify MetS samples. The SHAP values represent the contribution of each feature (in our case, each wavenumber) to the model’s output relative to the average prediction (Fig 4). Essentially, SHAP values explain how a particular feature impacts the model’s prediction, serving as a simplified, interpretable model to elucidate the workings of a more complex machine learning model. However, interpretability challenges can occasionally be introduced because the SHAP uses a linear approach to explain nonlinear models. Literature indicates that the range of 1645–1675 cm−1 primarily corresponds to non-cyclic carbon-carbon double bonds (C = C), which may be related to the structure of long-chain lipids in the blood. Notably, the combined contribution of the other 252 parameters was greater than that of the single wavenumber at 1660 cm−1, indicating the advantage of using SHAP in the FTIR spectrometry analysis.
This study has several limitations. Firstly, the limited sample size hindered our ability to evaluate the effectiveness of FTIR spectrometry in determining the severity of MetS. Secondly, although differences between plasma and serum FTIR spectra have been reported, this aspect was not investigated here. Lastly, the selection and exclusion of absorption bands in FTIR spectra may have introduced some degree of bias. Although the number of data items was sufficient for machine learning calculations, evaluating the model will require additional data. As we employed the ADASYN approach to balance the prevalence of the data, since additional data were created to compensate, the impact on actual applications must be further investigated. Fourth, verification using liquid chromatography-mass spectrometry (LC-MS) standards is lacking. The Taiwan Accreditation Foundation approved the laboratory for the fulfillment of ISO 15189 (Medical Laboratories Requirements for Quality and Competence), which supports the reliability of blood lipid concentrations. Future studies should comprehensively compare the accuracy, sensitivity, and specificity of the results from the AI-assisted FTIR spectrometry method against established methods with lipid standards. And we may explore causation-focused models to investigate molecular associations further and validate these predictions. However, given the study’s current scope, our machine learning approach offers valuable insights into broad lipid level trends without requiring the extensive molecular characterization that lies beyond this study’s objectives.
The developed AI-assisted FTIR spectrometry method is a nondestructive, reagent-free, and robust technology capable of accurately predicting lipid concentrations and identifying MetS in a prepared plasma sample. Macroscopic assessment is an unmet clinical requirement for diseases associated with metabolic disorders. This method provides a complete scout view of all MetS-related biochemical constituents, including all lipid species. Therefore, residual lipid-related risks can be revealed entirely. This comprehensive feature provides the potential for establishing an assessment system to stratify individuals with MetS to guide medical decisions and monitor therapeutic effects. In conclusion, AI-assisted FTIR spectrometry blood sample analysis is worth future large population studies to be applicable in clinical settings.
Supporting information
S1 Appendix. Distribution of blood lipid values in MetS and non-MetS groups.
https://doi.org/10.1371/journal.pone.0316522.s001
S2 Appendix. Grouped spectrum with specified lipid value ranges.
https://doi.org/10.1371/journal.pone.0316522.s002
S3 Appendix. Training deviance of the GBT model on different targets.
https://doi.org/10.1371/journal.pone.0316522.s003
S4 Appendix. The effect of the presence of MetS on the model’s lipid values prediction.
https://doi.org/10.1371/journal.pone.0316522.s004
S5 File. The de-identified supporting data.
https://doi.org/10.1371/journal.pone.0316522.s005
(CSV)
Acknowledgments
The authors thank Miss Wun-Jyun Jhuang for her skillful handling of the blood sample preparation and on the ATR-FTIR platform. The authors also thank Dr. Yi-Hsiung Lin for his assistance with research coordination at the beginning.
References
1. 1. Saklayen MG. The global epidemic of the metabolic syndrome. Curr Hypertens Rep. 2018;20(2):12. Epub 2018/02/26. pmid:29480368; PMCID: PMC5866840
* View Article
* PubMed/NCBI
* Google Scholar
2. 2. Deng H, Guo P, Zheng M, Huang J, Xue Y, Zhan X, et al. Epidemiological characteristics of atrial fibrillation in Southern China: results from the Guangzhou heart study. Sci Rep. 2018;8(1):17829. Epub 2018/12/13. pmid:30546024; PMCID: PMC6292893
* View Article
* PubMed/NCBI
* Google Scholar
3. 3. Ahmed M, Kumari N, Mirgani Z, Saeed A, Ramadan A, Ahmed MH, et al. Metabolic syndrome; definition, pathogenesis, elements, and the effects of medicinal plants on it’s elements. J Diabetes Metab Disord. 2022;21(1):1011–22. Epub 2022/03/15. pmid:35673459; PMCID: PMC9167315
* View Article
* PubMed/NCBI
* Google Scholar
4. 4. Lee H-C, Lin Y-H. The pathogenic role of very low density lipoprotein on atrial remodeling in the metabolic syndrome. Int J Mol Sci. 2020;21(3):891. Epub 2020/01/30. pmid:32019138; PMCID: PMC7037013
* View Article
* PubMed/NCBI
* Google Scholar
5. 5. Lee H-C, Shin S-J, Huang J-K, Lin M-Y, Lin Y-H, Ke L-Y, et al. The role of postprandial very-low-density lipoprotein in the development of atrial remodeling in metabolic syndrome. Lipids Health Dis. 2020;19(1):210. Epub 2020/09/22. pmid:32962696; PMCID: PMC7507670
* View Article
* PubMed/NCBI
* Google Scholar
6. 6. Lee H-C, Akhmedov A, Chen C-H. Spotlight on very-low-density lipoprotein as a driver of cardiometabolic disorders: Implications for disease progression and mechanistic insights. Front Cardiovasc Med. 2022;9:993633. Epub 2022/10/04. pmid:36267630; PMCID: PMC9577298
* View Article
* PubMed/NCBI
* Google Scholar
7. 7. Robinson H, Molendijk J, Shah AK, Rahman T, Anderson GJ, Hill MM. Rapid assessment of lipidomics sample purity and quantity using fourier-transform infrared spectroscopy. Biomolecules. 2022;12(9):1265. Epub 2022/09/08. pmid:36139104; PMCID: PMC9496531
* View Article
* PubMed/NCBI
* Google Scholar
8. 8. Alí A, Boutjdir M, Aromolaran AS. Cardiolipotoxicity, inflammation, and arrhythmias: role for interleukin-6 molecular mechanisms. Front Physiol. 2019;9:1866. Epub 2019/01/07. pmid:30666212; PMCID: PMC6330352
* View Article
* PubMed/NCBI
* Google Scholar
9. 9. Sacks FM, Lichtenstein AH, Wu JHY, Appel LJ, Creager MA, Kris-Etherton PM, et al. Dietary fats and cardiovascular disease: a presidential advisory from the american heart association. Circulation. 2017;136(3):e1–23. Epub 2017/06/15. pmid:28620111;
* View Article
* PubMed/NCBI
* Google Scholar
10. 10. Lee H-C, Lin H-T, Ke L-Y, Wei C, Hsiao Y-L, Chu C-S, et al. VLDL from metabolic syndrome individuals enhanced lipid accumulation in atria with association of susceptibility to atrial fibrillation. Int J Mol Sci. 2016;17(1):134. pmid:26805814
* View Article
* PubMed/NCBI
* Google Scholar
11. 11. Lee H-C, Cheng W-C, Ma W-L, Lin Y-H, Shin S-J, Lin Y-H. Association of lipid composition and unsaturated fatty acids of VLDL with atrial remodeling in metabolic syndrome. Sci Rep. 2023;13(1):6575. Epub 2023/04/21. pmid:37085694; PMCID: PMC10121655
* View Article
* PubMed/NCBI
* Google Scholar
12. 12. Rosqvist F, Fridén M, Vessby J, Rorsman F, Lind L, Risérus U. Circulating fatty acids from high-throughput metabolomics platforms as potential biomarkers of dietary fatty acids. Clin Nutr. 2022;41(12):2637–43. Epub 2022/10/13. pmid:36308982;
* View Article
* PubMed/NCBI
* Google Scholar
13. 13. Heino H, Rieppo L, Männistö T, Sillanpää MJ, Mäntynen V, Saarakkala S. Diagnostic performance of attenuated total reflection Fourier-transform infrared spectroscopy for detecting COVID-19 from routine nasopharyngeal swab samples. Sci Rep. 2022;12(1):20358. Epub 2022/11/27. pmid:36437268; PMCID: PMC9701801
* View Article
* PubMed/NCBI
* Google Scholar
14. 14. Martinez-Cuazitl A, Vazquez-Zapien GJ, Sanchez-Brito M, Limon-Pacheco JH, Guerrero-Ruiz M, Garibay-Gonzalez F, et al. ATR-FTIR spectrum analysis of saliva samples from COVID-19 positive patients. Sci Rep. 2021;11(1):19980. Epub 2021/10/07. pmid:34620977; PMCID: PMC8497525
* View Article
* PubMed/NCBI
* Google Scholar
15. 15. Kurultak İ, Sarigul N, Kodal NS, Korkmaz F. Urinalysis of individuals with renal hyperfiltration using ATR-FTIR spectroscopy. Sci Rep. 2022;12(1):20887. Epub 2022/12/03. pmid:36463336; PMCID: PMC9719484
* View Article
* PubMed/NCBI
* Google Scholar
16. 16. Silva LG, Péres AFS, Freitas DLD, Morais CLM, Martin FL, Crispim JCO, et al. ATR-FTIR spectroscopy in blood plasma combined with multivariate analysis to detect HIV infection in pregnant women. Sci Rep. 2020;10(1):20156. Epub 2020/11/19. pmid:33214678; PMCID: PMC7677535
* View Article
* PubMed/NCBI
* Google Scholar
17. 17. Jia Y, Quan S, Ren J, Wu H, Liu A, Gao Y, et al. MRI radiomics predicts progression-free survival in prostate cancer. Front Oncol. 2022;12:974257. Epub 2022/08/30. pmid:36110963; PMCID: PMC9468743
* View Article
* PubMed/NCBI
* Google Scholar
18. 18. Wu Y, Zhang Y, Zou X, Yuan Z, Hu W, Lu S, et al. Estimated date of delivery with electronic medical records by a hybrid GBDT-GRU model. Sci Rep. 2022;12(1):4892. Epub 2022/03/22. pmid:35318360; PMCID: PMC8941136
* View Article
* PubMed/NCBI
* Google Scholar
19. 19. Lin H, Xue Y, Chen K, Zhong S, Chen L. Acute coronary syndrome risk prediction based on gradient boosted tree feature selection and recursive feature elimination: a dataset-specific modeling study. PLoS One. 2022;17(11):e0278217. Epub 2022/11/29. pmid:36445881; PMCID: PMC9707772
* View Article
* PubMed/NCBI
* Google Scholar
20. 20. Kacuráková M, Mathlouthi M. FTIR and laser-Raman spectra of oligosaccharides in water: characterization of the glycosidic bond. Carbohydr Res. 1996;284(2):145–57. pmid:8653717
* View Article
* PubMed/NCBI
* Google Scholar
21. 21. Banyay M, Sarkar M, Gräslund A. A library of IR bands of nucleic acids in solution. Biophys Chem. 2003;104(2):477–88. pmid:12878315
* View Article
* PubMed/NCBI
* Google Scholar
22. 22. Barth A. Infrared spectroscopy of proteins. Biochim Biophys Acta. 2007;1767(9):1073–101. Epub 2007/06/28. pmid:17692815
* View Article
* PubMed/NCBI
* Google Scholar
23. 23. Casal HL, Mantsch HH. Polymorphic phase behaviour of phospholipid membranes studied by infrared spectroscopy. Biochim Biophys Acta. 1984;779(4):381–401. pmid:6391546
* View Article
* PubMed/NCBI
* Google Scholar
24. 24. Haibo H, Yang B, Garcia EA, Shutao L. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence); 2008:1–8.
* View Article
* Google Scholar
25. 25. Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W. LightGBM: a highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems; 2017;30.
26. 26. van der Maaten LJP, Hinton GE. Visualizing data using t-SNE. J Mac Lear Res. 2008;9:2579–605.
* View Article
* Google Scholar
27. 27. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. pmid:32607472
* View Article
* PubMed/NCBI
* Google Scholar
28. 28. Xu J, Du X, Zhang S, Xiang Q, Zhu L, Liu L. The accuracy of four formulas for LDL-C calculation at the fasting and postprandial states. Front Cardiovasc Med. 2022;9:944003. Epub 2022/08/18 pmid:36061569; PMCID: PMC9433804
* View Article
* PubMed/NCBI
* Google Scholar
29. 29. Ye Q, Meng X. Highly efficient authentication of edible oils by FTIR spectroscopy coupled with chemometrics. Food Chem. 2022;385:132661. Epub 2022/03/08. pmid:35299015;
* View Article
* PubMed/NCBI
* Google Scholar
30. 30. Shapaval V, Brandenburg J, Blomqvist J, Tafintseva V, Passoth V, Sandgren M, et al. Biochemical profiling, prediction of total lipid content and fatty acid profile in oleaginous yeasts by FTIR spectroscopy. Biotechnol Biofuels. 2019;12:140. Epub 2019/06/06. pmid:31178928; PMCID: PMC6551905
* View Article
* PubMed/NCBI
* Google Scholar
31. 31. Mousa MAA, Wang Y, Antora SA, Al-Qurashi AD, Ibrahim OHM, He H-J, et al. An overview of recent advances and applications of FT-IR spectroscopy for quality, authenticity, and adulteration detection in edible oils. Crit Rev Food Sci Nutr. 2022;62(29):8009–27. Epub 2021/05/12. pmid:33977844
* View Article
* PubMed/NCBI
* Google Scholar
32. 32. Bel’skaya LV, Sarf EA, Solomatin DV. Application of FTIR Spectroscopy for quantitative analysis of blood serum: a preliminary study. Diagnostics (Basel). 2021;11(12):2391. pmid:34943626
* View Article
* PubMed/NCBI
* Google Scholar
33. 33. Portaccio M, Faramarzi B, Lepore M. Probing biochemical differences in lipid components of human cells by means of ATR-FTIR spectroscopy. Biophysica. 2023;3(3):524–38.
* View Article
* Google Scholar
34. 34. Huber M, Kepesidis KV, Voronina L, Božić M, Trubetskov M, Harbeck N, et al. Stability of person-specific blood-based infrared molecular fingerprints opens up prospects for health monitoring. Nat Commun. 2021;12(1):1511. Epub 2021/03/08. pmid:33686065; PMCID: PMC7940620
* View Article
* PubMed/NCBI
* Google Scholar
35. 35. Jessen TE, Höskuldsson AT, Bjerrum PJ, Verder H, Sørensen L, Bratholm PS, et al. Simultaneous determination of glucose, triglycerides, urea, cholesterol, albumin and total protein in human plasma by Fourier transform infrared spectroscopy: direct clinical biochemistry without reagents. Clin Biochem. 2014;47(13–14):1306–12. Epub 2014/06/02. pmid:24943400;
* View Article
* PubMed/NCBI
* Google Scholar
36. 36. Li Z, Lv H, Li T, Si G, Wang Q, Lv J, et al. Reagent-free simultaneous determination of glucose and cholesterol in whole blood by FTIR-ATR. Spectrochim Acta A Mol Biomol Spectrosc. 2017;178:192–7. Epub 2017/02/03. pmid:28187317
* View Article
* PubMed/NCBI
* Google Scholar
37. 37. Steyn N, Muller Rossouw H, Pillay TS, Martins J. Comparability of calculated LDL-C with directly measured LDL-C in selected paediatric and adult cohorts. Clin Chim Acta. 2022;537:158–66. pmid:36265577
* View Article
* PubMed/NCBI
* Google Scholar
Citation: Gau T-P, Wen J-H, Lu I-W, Huang P-Y, Lee Y-C, Lee W-P, et al. (2025) Application of attenuated total reflection–Fourier transform infrared spectroscopy in semi-quantification of blood lipids and characterization of the metabolic syndrome. PLoS ONE 20(1): e0316522. https://doi.org/10.1371/journal.pone.0316522
About the Authors:
Tz-Ping Gau
Roles: Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft
Affiliations: Department of Anesthesiology, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan, Center for Big Data Research, Kaohsiung Medical University, Kaohsiung, Taiwan, Department of Information Management, National Sun Yat-sen University, Kaohsiung, Taiwan
Jen-Hung Wen
Roles: Data curation, Formal analysis, Writing – original draft
Affiliation: School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
I-Wei Lu
Roles: Formal analysis, Software
Affiliation: Center for General Education, Kaohsiung Medical University, Kaohsiung, Taiwan
Pei-Yu Huang
Roles: Data curation, Investigation, Methodology
Affiliation: Group of Life Science, National Synchrotron Radiation Research Center, Hsinchu, Taiwan
Yao-Chang Lee
Roles: Conceptualization, Investigation, Methodology, Validation, Visualization, Writing – review & editing
Affiliations: Group of Life Science, National Synchrotron Radiation Research Center, Hsinchu, Taiwan, Department of Optics and Photonics, National Central University, Chung-Li, Taiwan, Department of Chemistry, National Tsing Hua University, Hsinchu, Taiwan
ORICD: https://orcid.org/0000-0003-1761-3746
Wei-Po Lee
Roles: Formal analysis, Methodology, Software
Affiliation: Department of Information Management, National Sun Yat-sen University, Kaohsiung, Taiwan
Hsiang-Chun Lee
Roles: Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing
E-mail: [email protected]
Affiliations: Division of Cardiology, Department of Internal Medicine, Kaohsiung Medical University Gangshan Hospital, Kaohsiung, Taiwan, Lipid Science and Aging Research Center, Kaohsiung Medical University, Kaohsiung, Taiwan, Department of Internal Medicine, School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
ORICD: https://orcid.org/0000-0002-5877-9059
[/RAW_REF_TEXT]
1. Saklayen MG. The global epidemic of the metabolic syndrome. Curr Hypertens Rep. 2018;20(2):12. Epub 2018/02/26. pmid:29480368; PMCID: PMC5866840
2. Deng H, Guo P, Zheng M, Huang J, Xue Y, Zhan X, et al. Epidemiological characteristics of atrial fibrillation in Southern China: results from the Guangzhou heart study. Sci Rep. 2018;8(1):17829. Epub 2018/12/13. pmid:30546024; PMCID: PMC6292893
3. Ahmed M, Kumari N, Mirgani Z, Saeed A, Ramadan A, Ahmed MH, et al. Metabolic syndrome; definition, pathogenesis, elements, and the effects of medicinal plants on it’s elements. J Diabetes Metab Disord. 2022;21(1):1011–22. Epub 2022/03/15. pmid:35673459; PMCID: PMC9167315
4. Lee H-C, Lin Y-H. The pathogenic role of very low density lipoprotein on atrial remodeling in the metabolic syndrome. Int J Mol Sci. 2020;21(3):891. Epub 2020/01/30. pmid:32019138; PMCID: PMC7037013
5. Lee H-C, Shin S-J, Huang J-K, Lin M-Y, Lin Y-H, Ke L-Y, et al. The role of postprandial very-low-density lipoprotein in the development of atrial remodeling in metabolic syndrome. Lipids Health Dis. 2020;19(1):210. Epub 2020/09/22. pmid:32962696; PMCID: PMC7507670
6. Lee H-C, Akhmedov A, Chen C-H. Spotlight on very-low-density lipoprotein as a driver of cardiometabolic disorders: Implications for disease progression and mechanistic insights. Front Cardiovasc Med. 2022;9:993633. Epub 2022/10/04. pmid:36267630; PMCID: PMC9577298
7. Robinson H, Molendijk J, Shah AK, Rahman T, Anderson GJ, Hill MM. Rapid assessment of lipidomics sample purity and quantity using fourier-transform infrared spectroscopy. Biomolecules. 2022;12(9):1265. Epub 2022/09/08. pmid:36139104; PMCID: PMC9496531
8. Alí A, Boutjdir M, Aromolaran AS. Cardiolipotoxicity, inflammation, and arrhythmias: role for interleukin-6 molecular mechanisms. Front Physiol. 2019;9:1866. Epub 2019/01/07. pmid:30666212; PMCID: PMC6330352
9. Sacks FM, Lichtenstein AH, Wu JHY, Appel LJ, Creager MA, Kris-Etherton PM, et al. Dietary fats and cardiovascular disease: a presidential advisory from the american heart association. Circulation. 2017;136(3):e1–23. Epub 2017/06/15. pmid:28620111;
10. Lee H-C, Lin H-T, Ke L-Y, Wei C, Hsiao Y-L, Chu C-S, et al. VLDL from metabolic syndrome individuals enhanced lipid accumulation in atria with association of susceptibility to atrial fibrillation. Int J Mol Sci. 2016;17(1):134. pmid:26805814
11. Lee H-C, Cheng W-C, Ma W-L, Lin Y-H, Shin S-J, Lin Y-H. Association of lipid composition and unsaturated fatty acids of VLDL with atrial remodeling in metabolic syndrome. Sci Rep. 2023;13(1):6575. Epub 2023/04/21. pmid:37085694; PMCID: PMC10121655
12. Rosqvist F, Fridén M, Vessby J, Rorsman F, Lind L, Risérus U. Circulating fatty acids from high-throughput metabolomics platforms as potential biomarkers of dietary fatty acids. Clin Nutr. 2022;41(12):2637–43. Epub 2022/10/13. pmid:36308982;
13. Heino H, Rieppo L, Männistö T, Sillanpää MJ, Mäntynen V, Saarakkala S. Diagnostic performance of attenuated total reflection Fourier-transform infrared spectroscopy for detecting COVID-19 from routine nasopharyngeal swab samples. Sci Rep. 2022;12(1):20358. Epub 2022/11/27. pmid:36437268; PMCID: PMC9701801
14. Martinez-Cuazitl A, Vazquez-Zapien GJ, Sanchez-Brito M, Limon-Pacheco JH, Guerrero-Ruiz M, Garibay-Gonzalez F, et al. ATR-FTIR spectrum analysis of saliva samples from COVID-19 positive patients. Sci Rep. 2021;11(1):19980. Epub 2021/10/07. pmid:34620977; PMCID: PMC8497525
15. Kurultak İ, Sarigul N, Kodal NS, Korkmaz F. Urinalysis of individuals with renal hyperfiltration using ATR-FTIR spectroscopy. Sci Rep. 2022;12(1):20887. Epub 2022/12/03. pmid:36463336; PMCID: PMC9719484
16. Silva LG, Péres AFS, Freitas DLD, Morais CLM, Martin FL, Crispim JCO, et al. ATR-FTIR spectroscopy in blood plasma combined with multivariate analysis to detect HIV infection in pregnant women. Sci Rep. 2020;10(1):20156. Epub 2020/11/19. pmid:33214678; PMCID: PMC7677535
17. Jia Y, Quan S, Ren J, Wu H, Liu A, Gao Y, et al. MRI radiomics predicts progression-free survival in prostate cancer. Front Oncol. 2022;12:974257. Epub 2022/08/30. pmid:36110963; PMCID: PMC9468743
18. Wu Y, Zhang Y, Zou X, Yuan Z, Hu W, Lu S, et al. Estimated date of delivery with electronic medical records by a hybrid GBDT-GRU model. Sci Rep. 2022;12(1):4892. Epub 2022/03/22. pmid:35318360; PMCID: PMC8941136
19. Lin H, Xue Y, Chen K, Zhong S, Chen L. Acute coronary syndrome risk prediction based on gradient boosted tree feature selection and recursive feature elimination: a dataset-specific modeling study. PLoS One. 2022;17(11):e0278217. Epub 2022/11/29. pmid:36445881; PMCID: PMC9707772
20. Kacuráková M, Mathlouthi M. FTIR and laser-Raman spectra of oligosaccharides in water: characterization of the glycosidic bond. Carbohydr Res. 1996;284(2):145–57. pmid:8653717
21. Banyay M, Sarkar M, Gräslund A. A library of IR bands of nucleic acids in solution. Biophys Chem. 2003;104(2):477–88. pmid:12878315
22. Barth A. Infrared spectroscopy of proteins. Biochim Biophys Acta. 2007;1767(9):1073–101. Epub 2007/06/28. pmid:17692815
23. Casal HL, Mantsch HH. Polymorphic phase behaviour of phospholipid membranes studied by infrared spectroscopy. Biochim Biophys Acta. 1984;779(4):381–401. pmid:6391546
24. Haibo H, Yang B, Garcia EA, Shutao L. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence); 2008:1–8.
25. Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W. LightGBM: a highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems; 2017;30.
26. van der Maaten LJP, Hinton GE. Visualizing data using t-SNE. J Mac Lear Res. 2008;9:2579–605.
27. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. pmid:32607472
28. Xu J, Du X, Zhang S, Xiang Q, Zhu L, Liu L. The accuracy of four formulas for LDL-C calculation at the fasting and postprandial states. Front Cardiovasc Med. 2022;9:944003. Epub 2022/08/18 pmid:36061569; PMCID: PMC9433804
29. Ye Q, Meng X. Highly efficient authentication of edible oils by FTIR spectroscopy coupled with chemometrics. Food Chem. 2022;385:132661. Epub 2022/03/08. pmid:35299015;
30. Shapaval V, Brandenburg J, Blomqvist J, Tafintseva V, Passoth V, Sandgren M, et al. Biochemical profiling, prediction of total lipid content and fatty acid profile in oleaginous yeasts by FTIR spectroscopy. Biotechnol Biofuels. 2019;12:140. Epub 2019/06/06. pmid:31178928; PMCID: PMC6551905
31. Mousa MAA, Wang Y, Antora SA, Al-Qurashi AD, Ibrahim OHM, He H-J, et al. An overview of recent advances and applications of FT-IR spectroscopy for quality, authenticity, and adulteration detection in edible oils. Crit Rev Food Sci Nutr. 2022;62(29):8009–27. Epub 2021/05/12. pmid:33977844
32. Bel’skaya LV, Sarf EA, Solomatin DV. Application of FTIR Spectroscopy for quantitative analysis of blood serum: a preliminary study. Diagnostics (Basel). 2021;11(12):2391. pmid:34943626
33. Portaccio M, Faramarzi B, Lepore M. Probing biochemical differences in lipid components of human cells by means of ATR-FTIR spectroscopy. Biophysica. 2023;3(3):524–38.
34. Huber M, Kepesidis KV, Voronina L, Božić M, Trubetskov M, Harbeck N, et al. Stability of person-specific blood-based infrared molecular fingerprints opens up prospects for health monitoring. Nat Commun. 2021;12(1):1511. Epub 2021/03/08. pmid:33686065; PMCID: PMC7940620
35. Jessen TE, Höskuldsson AT, Bjerrum PJ, Verder H, Sørensen L, Bratholm PS, et al. Simultaneous determination of glucose, triglycerides, urea, cholesterol, albumin and total protein in human plasma by Fourier transform infrared spectroscopy: direct clinical biochemistry without reagents. Clin Biochem. 2014;47(13–14):1306–12. Epub 2014/06/02. pmid:24943400;
36. Li Z, Lv H, Li T, Si G, Wang Q, Lv J, et al. Reagent-free simultaneous determination of glucose and cholesterol in whole blood by FTIR-ATR. Spectrochim Acta A Mol Biomol Spectrosc. 2017;178:192–7. Epub 2017/02/03. pmid:28187317
37. Steyn N, Muller Rossouw H, Pillay TS, Martins J. Comparability of calculated LDL-C with directly measured LDL-C in selected paediatric and adult cohorts. Clin Chim Acta. 2022;537:158–66. pmid:36265577
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025 Gau et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background/Purpose
Dyslipidemia, a hallmark of metabolic syndrome (MetS), contributes to atherosclerotic and cardiometabolic disorders. Due to days-long analysis, current clinical procedures for cardiotoxic blood lipid monitoring are unmet. This study used AI-assisted attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy to identify MetS and precisely quantify multiple blood lipid levels with a blood sample of 0.5 µl and the assaying time is approximately 10 minutes.
Methods
ATR-FTIR spectroscopy with 1738 data points in the spectral range of 4000–650 cm−1 was used to analyze the blood samples. An adaptive synthetic technique was used to establish a prevalence-balanced dataset. LDL-C, HDL-C, TG, VLDL-C, and cholesterol levels were defined as the predicted targets of lipid absorption profiles. Linear regression (LR), gradient boosting regression tree (GBT), and histogram-based gradient boosting regression tree (HGBTR) were used to train the models. Lipid profile value prediction was evaluated using R2 and MAE, whereas MetS prediction was evaluated using area under the ROC curve.
Results
A total of 150 blood samples from 25 individuals without MetS and 25 with MetS yielded 491 spectral measurements. In the regression models, HGBT best predicted the targets of TG, CHOL, HDL-C, LDL-C, and VLDL-C with R2 values of 0.854 (0.12), 0.684 (0.08), 0.758 (0.10), and 0.419 (0.11), respectively. The classification model with the greatest AUC was RF (0.978), followed by HGBT (0.972) and GBT (0.967).
Conclusion
The results of this study revealed that predicting MetS and determining blood lipid levels with high R2 values and limited errors are feasible for monitoring during therapy and intervention.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer