Abstract

The recently developed droplet-based single-cell transcriptome sequencing (scRNA-seq) technology makes it feasible to perform a population-scale scRNA-seq study, in which the transcriptome is measured for tens of thousands of single cells from multiple individuals. Despite the advances of many clustering methods, there are few tailored methods for population-scale scRNA-seq studies. Here, we develop a Bayesian mixture model for single-cell sequencing (BAMM-SC) method to cluster scRNA-seq data from multiple individuals simultaneously. BAMM-SC takes raw count data as input and accounts for data heterogeneity and batch effect among multiple individuals in a unified Bayesian hierarchical model framework. Results from extensive simulation studies and applications of BAMM-SC to in-house experimental scRNA-seq datasets using blood, lung and skin cells from humans or mice demonstrate that BAMM-SC outperformed existing clustering methods with considerable improved clustering accuracy, particularly in the presence of heterogeneity among individuals.

With the development of large scale single cell RNA-seq technology, population-scale scRNA-seq studies are emerging. Here, the authors develop BAMM-SC, a tool for clustering droplet-based scRNA-seq data from multiple individuals simultaneously.

Details

Title
A Bayesian mixture model for clustering droplet-based single-cell transcriptomic data from population studies
Author
Sun, Zhe 1 ; Chen, Li 2 ; Xin Hongyi 3 ; Jiang, Yale 4 ; Huang Qianhui 5 ; Cillo, Anthony R 6 ; Tabib Tracy 7 ; Kolls, Jay K 8 ; Bruno, Tullia C 9 ; Lafyatis, Robert 7 ; Vignali Dario A A 10   VIAFID ORCID Logo  ; Chen, Kong 11 ; Ding, Ying 1 ; Hu, Ming 12   VIAFID ORCID Logo  ; Chen, Wei 13   VIAFID ORCID Logo 

 University of Pittsburgh, Department of Biostatistics, Graduate School of Public Health, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000) 
 Harrison School of Pharmacy, Auburn University, Department of Health Outcomes Research and Policy, Auburn, USA (GRID:grid.252546.2) (ISNI:0000 0001 2297 8753) 
 Children’s Hospital of Pittsburgh of UPMC, University of Pittsburgh, Division of Pulmonary Medicine, Department of Pediatrics, Pittsburgh, USA (GRID:grid.252546.2) 
 Children’s Hospital of Pittsburgh of UPMC, University of Pittsburgh, Division of Pulmonary Medicine, Department of Pediatrics, Pittsburgh, USA (GRID:grid.252546.2); Tsinghua University, School of Medicine, Beijing, China (GRID:grid.12527.33) (ISNI:0000 0001 0662 3178) 
 University of Michigan, Department of Biostatistics, School of Public Health, Ann Arbor, USA (GRID:grid.214458.e) (ISNI:0000000086837370) 
 University of Pittsburgh, Department of Immunology, School of Medicine, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000) 
 University of Pittsburgh, Division of Rheumatology and Clinical Immunology, Department of Medicine, School of Medicine, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000) 
 Tulane University, School of Medicine, New Orleans, USA (GRID:grid.265219.b) (ISNI:0000 0001 2217 8588) 
 University of Pittsburgh, Department of Immunology, School of Medicine, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000); UPMC Hillman Cancer Center, Tumor Microenvironment Center, Pittsburgh, USA (GRID:grid.478063.e) (ISNI:0000 0004 0456 9819) 
10  University of Pittsburgh, Department of Immunology, School of Medicine, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000); UPMC Hillman Cancer Center, Tumor Microenvironment Center, Pittsburgh, USA (GRID:grid.478063.e) (ISNI:0000 0004 0456 9819); UPMC Hillman Cancer Center, Cancer Immunology and Immunotherapy Program, Pittsburgh, USA (GRID:grid.478063.e) (ISNI:0000 0004 0456 9819) 
11  University of Pittsburgh, Division of Pulmonary, Allergy and Critical Care Medicine, Department of Medicine, School of Medicine, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000) 
12  Lerner Research Institute, Cleveland Clinic Foundation, Department of Quantitative Health Sciences, Cleveland, USA (GRID:grid.239578.2) (ISNI:0000 0001 0675 4725) 
13  University of Pittsburgh, Department of Biostatistics, Graduate School of Public Health, Pittsburgh, USA (GRID:grid.21925.3d) (ISNI:0000 0004 1936 9000); Children’s Hospital of Pittsburgh of UPMC, University of Pittsburgh, Division of Pulmonary Medicine, Department of Pediatrics, Pittsburgh, USA (GRID:grid.21925.3d) 
Publication year
2019
Publication date
2019
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2206213052
Copyright
© The Author(s) 2019. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.