Abstract

Recent advances in single-cell technologies and integration algorithms make it possible to construct comprehensive reference atlases encompassing many donors, studies, disease states, and sequencing platforms. Much like mapping sequencing reads to a reference genome, it is essential to be able to map query cells onto complex, multimillion-cell reference atlases to rapidly identify relevant cell states and phenotypes. We present Symphony (https://github.com/immunogenomics/symphony), an algorithm for building large-scale, integrated reference atlases in a convenient, portable format that enables efficient query mapping within seconds. Symphony localizes query cells within a stable low-dimensional reference embedding, facilitating reproducible downstream transfer of reference-defined annotations to the query. We demonstrate the power of Symphony in multiple real-world datasets, including (1) mapping a multi-donor, multi-species query to predict pancreatic cell types, (2) localizing query cells along a developmental trajectory of fetal liver hematopoiesis, and (3) inferring surface protein expression with a multimodal CITE-seq atlas of memory T cells.

The number of single-cell RNA-seq datasets generated is increasing rapidly, making methods that map cell types to well-curated references increasingly important. Here, the authors propose an accurate method for mapping single cells onto a reference atlas in seconds.

Details

Title
Efficient and precise single-cell reference atlas mapping with Symphony
Author
Kang, Joyce B 1   VIAFID ORCID Logo  ; Nathan, Aparna 1   VIAFID ORCID Logo  ; Weinand, Kathryn 1 ; Zhang, Fan 1   VIAFID ORCID Logo  ; Millard Nghia 1 ; Rumker Laurie 1 ; Branch, Moody D 2 ; Korsunsky Ilya 1 ; Raychaudhuri Soumya 3   VIAFID ORCID Logo 

 Center for Data Sciences, Brigham and Women’s Hospital, Boston, USA (GRID:grid.62560.37) (ISNI:0000 0004 0378 8294); Brigham and Women’s Hospital and Harvard Medical School, Division of Genetics, Department of Medicine, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Brigham and Women’s Hospital and Harvard Medical School, Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Harvard Medical School, Department of Biomedical Informatics, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34) 
 Brigham and Women’s Hospital and Harvard Medical School, Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X) 
 Center for Data Sciences, Brigham and Women’s Hospital, Boston, USA (GRID:grid.62560.37) (ISNI:0000 0004 0378 8294); Brigham and Women’s Hospital and Harvard Medical School, Division of Genetics, Department of Medicine, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Brigham and Women’s Hospital and Harvard Medical School, Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Harvard Medical School, Department of Biomedical Informatics, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34); Versus Arthritis Centre for Genetics and Genomics, Centre for Musculoskeletal Research, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK (GRID:grid.5379.8) (ISNI:0000000121662407) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2579868284
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.