Abstract

The overwhelming complexity of microbiomes makes it difficult to decipher functional relationships between specific microbes and ecosystem properties. While machine learning analyses have demonstrated an impressive ability to correlate microbial community composition with macroscopic functions, mechanisms that dictate model predictions are often unknown, and predictions often lack an assigned metric of uncertainty. In this study, we apply Bayesian networks to build on prior feature selection analyses and construct easy-to-interpret probabilistic models, which accurately predict levels of dissolved organic carbon (DOC) from the relative abundance of soil bacteria (16S rRNA gene profiles). In addition to standard cross-validation, we show that a Bayesian network model trained using samples from a pine litter decomposition study accurately predicts DOC of samples from an independent oak litter decomposition study, suggesting that mechanisms driving variation in soil carbon storage may be conserved across different types of decomposing plant litter. Furthermore, the structure of the resulting Bayesian network model defines a minimal set of highly informative taxa, whose abundances directly constrain the probability of high or low DOC conditions. Significant accuracy of the Bayesian network model with independent data sets supports the validity of the identified relationships between taxa abundance and DOC.

Competing Interest Statement

The authors have declared no competing interest.

Details

Title
Probabilistic Ranking Of Microbiomes with Taxa Selection to discover and validate microbiome function models for multiple litter decomposition studies
Author
Thompson, Jaron; Lubbers, Nicholas E; Kroeger, Marie E; Devan, Rae; Johansen, Renee; Dunbar, John; Munsky, Brian
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2020
Publication date
Jul 17, 2020
Publisher
Cold Spring Harbor Laboratory Press
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
ProQuest document ID
2424571530
Copyright
© 2020. This article is published under http://creativecommons.org/licenses/by-nd/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.