Abstract

We aimed to develop an efficient, flexible and scalable approach to diagnostic genome-wide sequence analysis of genetically heterogeneous clinical presentations. Here we present G2P (www.ebi.ac.uk/gene2phenotype) as an online system to establish, curate and distribute datasets for diagnostic variant filtering via association of allelic requirement and mutational consequence at a defined locus with phenotypic terms, confidence level and evidence links. An extension to Ensembl Variant Effect Predictor (VEP), VEP-G2P was used to filter both disease-associated and control whole exome sequence (WES) with Developmental Disorders G2P (G2PDD; 2044 entries). VEP-G2PDD shows a sensitivity/precision of 97.3%/33% for de novo and 81.6%/22.7% for inherited pathogenic genotypes respectively. Many of the missing genotypes are likely false-positive pathogenic assignments. The expected number and discriminative features of background genotypes are defined using control WES. Using only human genetic data VEP-G2P performs well compared to other freely-available diagnostic systems and future phenotypic matching capabilities should further enhance performance.

Diagnostic filtering is an important step to analyze the functional and clinical significance of the large number of genetic variants identified from next-generation genome sequencing data. Here, the authors develop a flexible and scalable system for diagnostic filtering of genetic variants using G2P with Ensembl VEP.

Details

Title
Flexible and scalable diagnostic filtering of genomic variants using G2P with Ensembl VEP
Author
Thormann Anja 1 ; Halachev Mihail 2 ; McLaren, William 3 ; Moore, David J 4 ; Svinti Victoria 5   VIAFID ORCID Logo  ; Campbell, Archie 6   VIAFID ORCID Logo  ; Kerr, Shona M 7 ; Tischkowitz Marc 8 ; Hunt, Sarah E 9   VIAFID ORCID Logo  ; Dunlop, Malcolm G 10   VIAFID ORCID Logo  ; Hurles, Matthew E 11 ; Wright, Caroline F 12   VIAFID ORCID Logo  ; Firth, Helen V 13 ; Cunningham, Fiona 9   VIAFID ORCID Logo  ; FitzPatrick, David R 14   VIAFID ORCID Logo 

 Wellcome Genome Campus, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK 
 MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988); Western General Hospital, South East Scotland Regional Genetics Services, Edinburgh, UK (GRID:grid.417068.c) (ISNI:0000 0004 0624 9907) 
 Wellcome Genome Campus, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK (GRID:grid.417068.c) 
 Western General Hospital, South East Scotland Regional Genetics Services, Edinburgh, UK (GRID:grid.417068.c) (ISNI:0000 0004 0624 9907) 
 MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988) 
 University of Edinburgh, Centre for Genomic and Experimental Medicine, Institute of Genetics & Molecular Medicine, Western General Hospital, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988); The University of Edinburgh, Nine Edinburgh BioQuarter, Usher Institute for Population Health Sciences and Informatics, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988) 
 University of Edinburgh, Centre for Genomic and Experimental Medicine, Institute of Genetics & Molecular Medicine, Western General Hospital, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988) 
 Addenbrooke’s Hospital Cambridge University Hospitals, Clinical Genetic Department, Cambridge, UK (GRID:grid.24029.3d) (ISNI:0000 0004 0383 8386) 
 Wellcome Genome Campus, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK (GRID:grid.24029.3d) 
10  MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988); University of Edinburgh, Edinburgh Cancer Research Centre, Institute of Genetics & Molecular Medicine, Western General Hospital, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988) 
11  Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge, UK (GRID:grid.4305.2) 
12  University of Exeter Medical School, RILD Level 4, Royal Devon & Exeter Hospital, Exeter, UK (GRID:grid.4305.2) 
13  Addenbrooke’s Hospital Cambridge University Hospitals, Clinical Genetic Department, Cambridge, UK (GRID:grid.24029.3d) (ISNI:0000 0004 0383 8386); Wellcome Genome Campus, Wellcome Sanger Institute, Hinxton, Cambridge, UK (GRID:grid.24029.3d) 
14  MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, MRC Human Genetics Unit, Edinburgh, UK (GRID:grid.4305.2) (ISNI:0000 0004 1936 7988) 
Publication year
2019
Publication date
2019
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2232650750
Copyright
© The Author(s) 2019. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.