Abstract

Identifying pathogenic variants and underlying functional alterations is challenging. To this end, we introduce MutPred2, a tool that improves the prioritization of pathogenic amino acid substitutions over existing methods, generates molecular mechanisms potentially causative of disease, and returns interpretable pathogenicity score distributions on individual genomes. Whilst its prioritization performance is state-of-the-art, a distinguishing feature of MutPred2 is the probabilistic modeling of variant impact on specific aspects of protein structure and function that can serve to guide experimental studies of phenotype-altering variants. We demonstrate the utility of MutPred2 in the identification of the structural and functional mutational signatures relevant to Mendelian disorders and the prioritization of de novo mutations associated with complex neurodevelopmental disorders. We then experimentally validate the functional impact of several variants identified in patients with such disorders. We argue that mechanism-driven studies of human inherited disease have the potential to significantly accelerate the discovery of clinically actionable variants.

Identifying variants capable of causing genetic disease is challenging. The authors use semisupervised learning to predict pathogenic missense variants and their impacts on protein structure and function, enabling a molecular mechanism-driven approach to studying different types of human disease.

Details

Title
Inferring the molecular and phenotypic impact of amino acid variants with MutPred2
Author
Pejaver Vikas 1 ; Urresti Jorge 2 ; Lugo-Martinez, Jose 3   VIAFID ORCID Logo  ; Pagel Kymberleigh A 4 ; Lin Guan Ning 5   VIAFID ORCID Logo  ; Hyun-Jun, Nam 2 ; Mort, Matthew 6 ; Cooper, David N 6   VIAFID ORCID Logo  ; Sebat, Jonathan 7   VIAFID ORCID Logo  ; Iakoucheva Lilia M 2   VIAFID ORCID Logo  ; Mooney, Sean D 8 ; Radivojac Predrag 9   VIAFID ORCID Logo 

 Indiana University, Department of Computer Science, Bloomington, USA (GRID:grid.411377.7) (ISNI:0000 0001 0790 959X); University of Washington, Department of Biomedical Informatics and Medical Education, Seattle, USA (GRID:grid.34477.33) (ISNI:0000000122986657) 
 University of California San Diego, Department of Psychiatry, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242) 
 Indiana University, Department of Computer Science, Bloomington, USA (GRID:grid.411377.7) (ISNI:0000 0001 0790 959X); Carnegie Mellon University, Computational Biology Department, School of Computer Science, Pittsburgh, USA (GRID:grid.147455.6) (ISNI:0000 0001 2097 0344) 
 Indiana University, Department of Computer Science, Bloomington, USA (GRID:grid.411377.7) (ISNI:0000 0001 0790 959X); Johns Hopkins University, Institute for Computational Medicine, Whiting School of Engineering, Baltimore, USA (GRID:grid.21107.35) (ISNI:0000 0001 2171 9311) 
 University of California San Diego, Department of Psychiatry, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); Shanghai Jiao Tong University, School of Biomedical Engineering, Shanghai, People’s Republic of China (GRID:grid.16821.3c) (ISNI:0000 0004 0368 8293) 
 Cardiff University, Institute of Medical Genetics, School of Medicine, Cardiff, UK (GRID:grid.5600.3) (ISNI:0000 0001 0807 5670) 
 University of California San Diego, Department of Psychiatry, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); University of California San Diego, Beyster Center for Genomics of Psychiatric Diseases, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); University of California San Diego, Department of Cellular and Molecular Medicine, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242) 
 University of Washington, Department of Biomedical Informatics and Medical Education, Seattle, USA (GRID:grid.34477.33) (ISNI:0000000122986657) 
 Indiana University, Department of Computer Science, Bloomington, USA (GRID:grid.411377.7) (ISNI:0000 0001 0790 959X); Northeastern University, Khoury College of Computer Sciences, Boston, USA (GRID:grid.261112.7) (ISNI:0000 0001 2173 3359) 
Publication year
2020
Publication date
2020
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2473249893
Copyright
© The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.