Abstract

Comprehensive and accurate identification of structural variations (SVs) from next generation sequencing data remains a major challenge. We develop FusorSV, which uses a data mining approach to assess performance and merge callsets from an ensemble of SV-calling algorithms. It includes a fusion model built using analysis of 27 deep-coverage human genomes from the 1000 Genomes Project. We identify 843 novel SV calls that were not reported by the 1000 Genomes Project for these 27 samples. Experimental validation of a subset of these calls yields a validation rate of 86.7%. FusorSV is available at https://github.com/TheJacksonLaboratory/SVE.

Details

Title
FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods
Author
Becker, Timothy; Wan-Ping, Lee; Leone, Joseph; Zhu, Qihui; Zhang, Chengsheng; Liu, Silvia; Sargent, Jack; Shanker, Kritika; Adam Mil-homens; Cerveira, Eliza; Mallory, Ryan; Cha, Jane; Navarro, Fabio C P; Galeev, Timur; Gerstein, Mark; Mills, Ryan E
Publication year
2018
Publication date
2018
Publisher
BioMed Central
ISSN
14747596
e-ISSN
1474760X
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2207136158
Copyright
© 2018. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.