Content area

Abstract

Motivation

Nonparametric multivariate analysis has been widely used to identify variables associated with a dissimilarity matrix and to quantify their contribution. For very large studies (n5000) and many explanatory variables, existing software packages (e.g. adonis and adonis2 in vegan) are computationally intensive when conducting sequential multivariate analysis with permutations or bootstrapping. Moreover, for subjects from a complex sampling design, we need to adjust for sampling weights to derive an unbiased estimate.

Results

We implemented an R function fast.adonis to overcome these computational challenges in large-scale studies. fast.adonis generates results consistent with adonis/adonis2 but much faster. For complex sampling studies, fast.adonis integrates sampling weights algebraically to mimic the source population; thus, analysis can be completed very fast without requiring a large amount of memory.

Availability and implementation

fast.adonis is implemented using R and is publicly available at https://github.com/jennylsl/fast.adonis.

Supplementary information

Supplementary data are available at Bioinformatics Advances online.

Details

Title
fast.adonis: a computationally efficient non-parametric multivariate analysis of microbiome data for large-scale studies
Author
Li, Shilan 1   VIAFID ORCID Logo  ; Vogtmann, Emily 1 ; Graubard, Barry I 1 ; Gail, Mitchell H 1 ; Abnet, Christian C 1 ; Shi, Jianxin 1 

 Division of Cancer Epidemiology and Genetics, National Cancer Institute , Bethesda, MD 20892, USA 
Publication year
2022
Publication date
2022
Publisher
Oxford University Press
e-ISSN
26350041
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3192246544
Copyright
Published by Oxford University Press 2022.