Abstract

Genetic differences inferred from sequencing reads can be used for demultiplexing of pooled single-cell RNA-seq (scRNA-seq) data across multiple donors without WGS-based reference genotypes. However, such methods could not be directly applied to single-cell ATAC-seq (scATAC-seq) data owing to the lower read coverage for each variant compared to scRNA-seq. We propose a new software, scATAC-seq Variant-based EstimatioN for GEnotype ReSolving (scAVENGERS), which resolves this issue by calling more individual-specific germline variants and using an optimized mixture model for the scATAC-seq. The benchmark conducted with three synthetic multiplexed scATAC-seq datasets of peripheral blood mononuclear cells and prefrontal cortex tissues showed outstanding performance compared to existing methods in terms of accuracy, doublet detection, and a portion of donor-assigned cells. Furthermore, analyzing the effect of the improved sections provided insight into handling pooled single-cell data in the future. Our source code of the devised software is available at GitHub: https://github.com/kaistcbfg/scAVENGERS.

Details

Title
scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
Author
Han, Seungbeom 1 ; Kim, Kyukwang 1 ; Park, Seongwan 1 ; Lee, Andrew J 1 ; Chun, Hyonho 2 ; Jung, Inkyung 1   VIAFID ORCID Logo 

 Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST) , Daejeon  34141, Republic of Korea 
 Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology (KAIST) , Daejeon  34141, Republic of Korea 
Publication year
2022
Publication date
Dec 2022
Publisher
Oxford University Press
e-ISSN
26319268
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3170909811
Copyright
© The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. This work is published under https://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.