Abstract

Microbial interactions can lead to different colonization outcomes of exogenous species, be they pathogenic or beneficial in nature. Predicting the colonization of exogenous species in complex communities remains a fundamental challenge in microbial ecology, mainly due to our limited knowledge of the diverse mechanisms governing microbial dynamics. Here, we propose a data-driven approach independent of any dynamics model to predict colonization outcomes of exogenous species from the baseline compositions of microbial communities. We systematically validate this approach using synthetic data, finding that machine learning models can predict not only the binary colonization outcome but also the post-invasion steady-state abundance of the invading species. Then we conduct colonization experiments for commensal gut bacteria species Enterococcus faecium and Akkermansia muciniphila in hundreds of human stool-derived in vitro microbial communities, confirming that the data-driven approaches can predict the colonization outcomes in experiments. Furthermore, we find that while most resident species are predicted to have a weak negative impact on the colonization of exogenous species, strongly interacting species could significantly alter the colonization outcomes, e.g., Enterococcus faecalis inhibits the invasion of E. faecium invasion. The presented results suggest that the data-driven approaches are powerful tools to inform the ecology and management of microbial communities.

Predicting the colonization of exogenous species in complex communities is a challenge in ecology. Here, the authors propose a data-driven approach to predict colonization outcomes and perform validation experiments in human gut microbial communities.

Details

Title
Data-driven prediction of colonization outcomes for complex microbial communities
Author
Wu, Lu 1   VIAFID ORCID Logo  ; Wang, Xu-Wen 2   VIAFID ORCID Logo  ; Tao, Zining 3 ; Wang, Tong 2   VIAFID ORCID Logo  ; Zuo, Wenlong 1   VIAFID ORCID Logo  ; Zeng, Yu 1 ; Liu, Yang-Yu 4   VIAFID ORCID Logo  ; Dai, Lei 5   VIAFID ORCID Logo 

 Chinese Academy of Sciences, CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, China (GRID:grid.9227.e) (ISNI:0000000119573309) 
 Brigham and Women’s Hospital and Harvard Medical School, Channing Division of Network Medicine, Department of Medicine, Boston, USA (GRID:grid.62560.37) (ISNI:0000 0004 0378 8294) 
 Chinese Academy of Sciences, CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, China (GRID:grid.9227.e) (ISNI:0000000119573309); Shandong Agricultural University, Tai’an, China (GRID:grid.440622.6) (ISNI:0000 0000 9482 4676) 
 Brigham and Women’s Hospital and Harvard Medical School, Channing Division of Network Medicine, Department of Medicine, Boston, USA (GRID:grid.62560.37) (ISNI:0000 0004 0378 8294); University of Illinois at Urbana-Champaign, Center for Artificial Intelligence and Modeling, The Carl R. Woese Institute for Genomic Biology, Champaign, USA (GRID:grid.35403.31) (ISNI:0000 0004 1936 9991) 
 Chinese Academy of Sciences, CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, China (GRID:grid.9227.e) (ISNI:0000000119573309); University of Chinese Academy of Sciences, Beijing, China (GRID:grid.410726.6) (ISNI:0000 0004 1797 8419) 
Pages
2406
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2957802404
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.