Full text

Turn on search term navigation

© 2025 by the authors. Published by MDPI on behalf of the Lithuanian University of Health Sciences. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Background and Objectives: Liver cancer ranks among the leading causes of cancer-related mortality, necessitating the development of novel diagnostic methods. Deregulated lipid metabolism, a hallmark of hepatocarcinogenesis, offers compelling prospects for biomarker identification. This study aims to employ explainable artificial intelligence (XAI) to identify lipidomic biomarkers for liver cancer and to develop a robust predictive model for early diagnosis. Materials and Methods: This study included 219 patients diagnosed with liver cancer and 219 healthy controls. Serum samples underwent untargeted lipidomic analysis with LC-QTOF-MS. Lipidomic data underwent univariate and multivariate analyses, including fold change (FC), t-tests, PLS-DA, and Elastic Network feature selection, to identify significant biomarker candidate lipids. Machine learning models (AdaBoost, Random Forest, Gradient Boosting) were developed and evaluated utilizing these biomarkers to differentiate liver cancer. The AUC metric was employed to identify the optimal predictive model, whereas SHAP was utilized to achieve interpretability of the model’s predictive decisions. Results: Notable alterations in lipid profiles were observed: decreased sphingomyelins (SM d39:2, SM d41:2) and increased fatty acids (FA 14:1, FA 22:2) and phosphatidylcholines (PC 34:1, PC 32:1). AdaBoost exhibited a superior classification performance, achieving an AUC of 0.875. SHAP identified PC 40:4 as the most efficacious lipid for model predictions. The SM d41:2 and SM d36:3 lipids were specifically associated with an increased risk of low-onset cancer and elevated levels of the PC 40:4 lipid. Conclusions: This study demonstrates that untargeted lipidomics, in conjunction with explainable artificial intelligence (XAI) and machine learning, may effectively identify biomarkers for the early detection of liver cancer. The results suggest that alterations in lipid metabolism are crucial to the progression of liver cancer and provide valuable insights for incorporating lipidomics into precision oncology.

Details

Title
Untargeted Lipidomic Biomarkers for Liver Cancer Diagnosis: A Tree-Based Machine Learning Model Enhanced by Explainable Artificial Intelligence
Author
Colak, Cemil 1   VIAFID ORCID Logo  ; Fatma Hilal Yagin 1   VIAFID ORCID Logo  ; Algarni, Abdulmohsen 2   VIAFID ORCID Logo  ; Algarni, Ali 3   VIAFID ORCID Logo  ; Al-Hashem, Fahaid 4   VIAFID ORCID Logo  ; Ardigò, Luca Paolo 5   VIAFID ORCID Logo 

 Department of Biostatistics, and Medical Informatics, Faculty of Medicine, Inonu University, 44280 Malatya, Turkey; [email protected] 
 Department of Computer Science, King Khalid University, Abha 61421, Saudi Arabia; [email protected] 
 Department of Informatics and Computer Systems, College of Computer Science, King Khalid University, Abha 61421, Saudi Arabia; [email protected] 
 Department of Physiology, College of Medicine, King Khalid University, Abha 61421, Saudi Arabia; [email protected] 
 Department of Teacher Education, NLA University College, Linstows Gate 3, 0166 Oslo, Norway 
First page
405
Publication year
2025
Publication date
2025
Publisher
MDPI AG
ISSN
1010660X
e-ISSN
16489144
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3181605841
Copyright
© 2025 by the authors. Published by MDPI on behalf of the Lithuanian University of Health Sciences. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.