Abstract

Gut microbiome is gaining interest because of its links with several diseases, including colorectal cancer (CRC), as well as the possibility of being used to obtain non-intrusive predictive disease biomarkers. Here we performed a meta-analysis of 1042 fecal metagenomic samples from seven publicly available studies. We used an interpretable machine learning approach based on functional profiles, instead of the conventional taxonomic profiles, to produce a highly accurate predictor of CRC with better precision than those of previous proposals. Moreover, this approach is also able to discriminate samples with adenoma, which makes this approach very promising for CRC prevention by detecting early stages in which intervention is easier and more effective. In addition, interpretable machine learning methods allow extracting features relevant for the classification, which reveals basic molecular mechanisms accounting for the changes undergone by the microbiome functional landscape in the transition from healthy gut to adenoma and CRC conditions. Functional profiles have demonstrated superior accuracy in predicting CRC and adenoma conditions than taxonomic profiles and additionally, in a context of explainable machine learning, provide useful hints on the molecular mechanisms operating in the microbiota behind these conditions.

Details

Title
Towards a metagenomics machine learning interpretable model for understanding the transition from adenoma to colorectal cancer
Author
Casimiro-Soriguer, Carlos S 1 ; Loucera Carlos 2 ; Peña-Chilet María 3 ; Dopazo Joaquin 4 

 Hospital Virgen del Rocio, Clinical Bioinformatics Area, Fundación Progreso y Salud (FPS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158) 
 Hospital Virgen del Rocio, Clinical Bioinformatics Area, Fundación Progreso y Salud (FPS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158); Hospital Virgen del Rocio, Computational Systems Medicine, Institute of Biomedicine of Seville (IBIS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158) 
 Hospital Virgen del Rocio, Clinical Bioinformatics Area, Fundación Progreso y Salud (FPS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158); Hospital Virgen del Rocio, Computational Systems Medicine, Institute of Biomedicine of Seville (IBIS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158); FPS, Hospital Virgen del Rocio, Bioinformatics in Rare Diseases (BiER), Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158) 
 Hospital Virgen del Rocio, Clinical Bioinformatics Area, Fundación Progreso y Salud (FPS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158); Hospital Virgen del Rocio, Computational Systems Medicine, Institute of Biomedicine of Seville (IBIS), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158); FPS, Hospital Virgen del Rocio, Bioinformatics in Rare Diseases (BiER), Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158); ELIXIR-ES (INB), FPS, Hospital Virgen del Rocio, Seville, Spain (GRID:grid.411109.c) (ISNI:0000 0000 9542 1158) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2618383529
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.