A benchmark of batch-effect correction methods

Abstract

Background

Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal.

Results

We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression.

Conclusion

Based on our results, Harmony, LIGER, and Seurat 3 are the recommended methods for batch integration. Due to its significantly shorter runtime, Harmony is recommended as the first method to try, with the other methods as viable alternatives.

Details

Title

A benchmark of batch-effect correction methods for single-cell RNA sequencing data

Author

Hoa Thi Nhu Tran; Ang, Kok Siong; Chevrier, Marion; Zhang, Xiaomeng; Shin Lee, Nicole Yee; Goh, Michelle; Chen, Jinmiao

Pages

1-32

Section

Research

Publication year

2020

Publication date

2020

Publisher

BioMed Central

ISSN

14747596

e-ISSN

1474760X

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1186/s13059-019-1850-9

ProQuest document ID

2341178198

© 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

A benchmark of batch-effect correction methods for single-cell RNA sequencing data

Jump to:

Abstract

Details

Suggested sources