Abstract

When interpreting sequencing data from multiple spatial or longitudinal biopsies, detecting sample mix-ups is essential yet more difficult than in studies of germline variation. In most genomic studies of tumors, genetic variation is frequently detected through pairwise comparisons of the tumor and a matched normal tissue from the sample donor, and in many cases, only somatic variants are reported. The disjoint genotype information that results hinders the use of existing tools that detect sample swaps solely based on genotypes of germline variants. To address this problem, we have developed somalier, which can operate directly on the alignments, so as not to require jointly-called germline variants. Instead, somalier extracts a small sketch of informative genetic variation for each sample. Sketches from hundreds of biopsies and normal tissues can then be compared in under a second. This speed also makes it useful for checking relatedness in large cohorts of germline samples. Somalier produces both text output and an interactive visual report that facilitates the detection and correction of sample swaps using multiple relatedness metrics. We introduce the tool and demonstrate its utility on a cohort of five glioma samples each with a normal, tumor, and cell-free DNA sample. Applying somalier to high-coverage sequence data from the 1000 Genomes Project also identifies several related samples. Somalier can be applied to diverse sequencing data types and genome builds, and is freely available for academic use at github.com/brentp/somalier.

Footnotes

* Add Joe to Author list.

* https://github.com/brentp/somalier

Details

Title
Somalier: rapid relatedness estimation for cancer and germline studies using efficient genome sketches
Author
Brent Stacey Pedersen; Bhetariya, Preeti J; Brown, Joe; Marth, Gabor; Jensen, Randy; Bronner, Mary P; Underhill, Hunter R; Quinlan, Aaron R
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2019
Publication date
Nov 13, 2019
Publisher
Cold Spring Harbor Laboratory Press
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
ProQuest document ID
2313808498
Copyright
© 2019. This article is published under http://creativecommons.org/licenses/by-nd/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.