Abstract

Background

The widespread use of next-generation sequencing has identified an important role for somatic mosaicism in many diseases. However, detecting low-level mosaic variants from next-generation sequencing data remains challenging.

Results

Here, we present a method for Position-Based Variant Identification (PBVI) that uses empirically-derived distributions of alternate nucleotides from a control dataset. We modeled this approach on 11 segmental overgrowth genes. We show that this method improves detection of single nucleotide mosaic variants of 0.01–0.05 variant allele fraction compared to other low-level variant callers. At depths of 600 × and 1200 ×, we observed > 85% and > 95% sensitivity, respectively. In a cohort of 26 individuals with somatic overgrowth disorders PBVI showed improved signal to noise, identifying pathogenic variants in 17 individuals.

Conclusion

PBVI can facilitate identification of low-level mosaic variants thus increasing the utility of next-generation sequencing data for research and diagnostic purposes.

Details

Title
Low-level variant calling for non-matched samples using a position-based and nucleotide-specific approach
Author
Dudley, Jeffrey N; Hong, Celine S; Hawari, Marwan A; Shwetar, Jasmine; Sapp, Julie C; Lack, Justin; Shiferaw, Henoke; Johnston, Jennifer J; Biesecker, Leslie G
Pages
1-17
Section
Methodology article
Publication year
2021
Publication date
2021
Publisher
Springer Nature B.V.
e-ISSN
14712105
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2514262927
Copyright
© 2021. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.