Full Text

Turn on search term navigation

© 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Microbial strains of variable functional capacities coexist in microbiomes. Current bioinformatics methods of strain analysis cannot provide the direct linkage between strain composition and their gene contents from metagenomic data. Here we present Strain‐level Pangenome Decomposition Analysis (StrainPanDA), a novel method that uses the pangenome coverage profile of multiple metagenomic samples to simultaneously reconstruct the composition and gene content variation of coexisting strains in microbial communities. We systematically validate the accuracy and robustness of StrainPanDA using synthetic data sets. To demonstrate the power of gene‐centric strain profiling, we then apply StrainPanDA to analyze the gut microbiome samples of infants, as well as patients treated with fecal microbiota transplantation. We show that the linked reconstruction of strain composition and gene content profiles is critical for understanding the relationship between microbial adaptation and strain‐specific functions (e.g., nutrient utilization and pathogenicity). Finally, StrainPanDA has minimal requirements for computing resources and can be scaled to process multiple species in a community in parallel. In short, StrainPanDA can be applied to metagenomic data sets to detect the association between molecular functions and microbial/host phenotypes to formulate testable hypotheses and gain novel biological insights at the strain or subspecies level.

Details

Title
StrainPanDA: Linked reconstruction of strain composition and gene content profiles via pangenome‐based decomposition of metagenomic data
Author
Hu, Han 1 ; Tan, Yuxiang 2 ; Li, Chenhao 3 ; Chen, Junyu 2 ; Kou, Yan 1 ; Xu, Zhenjiang Zech 4 ; Liu, Yang‐Yu 5 ; Tan, Yan 1 ; Dai, Lei 2   VIAFID ORCID Logo 

 Bioinformatics Department, Xbiome, Scientific Research Building, Tsinghua High‐Tech Park, Shenzhen, China 
 CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China 
 Center for Computational and Integrative Biology, Massachusetts General Hospital and Harvard Medical School, Richard B. Simches Research Center, Boston, Massachusetts, USA 
 Department of Food Science and Technology, State Key Laboratory of Food Science and Technology, Nanchang University, Nanchang, China 
 Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA 
Section
RESEARCH ARTICLES
Publication year
2022
Publication date
Sep 1, 2022
Publisher
John Wiley & Sons, Inc.
ISSN
27705986
e-ISSN
2770596X
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3090606075
Copyright
© 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.