Abstract

Alzheimer’s disease (AD) is a complex genetic disease, and variants identified through genome-wide association studies (GWAS) explain only part of its heritability. Epistasis has been proposed as a major contributor to this ‘missing heritability’, however, many current methods are limited to only modelling additive effects. We use VariantSpark, a machine learning approach to GWAS, and BitEpi, a tool for epistasis detection, to identify AD associated variants and interactions across two independent cohorts, ADNI and UK Biobank. By incorporating significant epistatic interactions, we captured 10.41% more phenotypic variance than logistic regression (LR). We validate the well-established AD loci, APOE, and identify two novel genome-wide significant AD associated loci in both cohorts, SH3BP4 and SASH1, which are also in significant epistatic interactions with APOE. We show that the SH3BP4 SNP has a modulating effect on the known pathogenic APOE SNP, demonstrating a possible protective mechanism against AD. SASH1 is involved in a triplet interaction with pathogenic APOE SNP and ACOT11, where the SASH1 SNP lowered the pathogenic interaction effect between ACOT11 and APOE. Finally, we demonstrate that VariantSpark detects disease associations with 80% fewer controls than LR, unlocking discoveries in well annotated but smaller cohorts.

Details

Title
Novel Alzheimer’s disease genes and epistasis identified using machine learning GWAS platform
Author
Lundberg, Mischa 1 ; Sng, Letitia M. F. 2 ; Szul, Piotr 3 ; Dunne, Rob 4 ; Bayat, Arash 5 ; Burnham, Samantha C. 6 ; Bauer, Denis C. 7 ; Twine, Natalie A. 8 

 Commonwealth Scientific and Industrial Research Organisation, Transformational Bioinformatics, Sydney, Australia (GRID:grid.1016.6) (ISNI:0000 0001 2173 2719); The University of Queensland, UQ Frazer Institute, Woolloongabba, Australia (GRID:grid.1003.2) (ISNI:0000 0000 9320 7537); The University of Queensland, Institute for Molecular Bioscience, St Lucia, Australia (GRID:grid.1003.2) (ISNI:0000 0000 9320 7537) 
 Commonwealth Scientific and Industrial Research Organisation, Transformational Bioinformatics, Sydney, Australia (GRID:grid.1016.6) (ISNI:0000 0001 2173 2719) 
 Commonwealth Scientific and Industrial Research Organisation AU, Health Data Semantics and Interoperability, Brisbane, Australia (GRID:grid.1016.6) (ISNI:0000 0001 2173 2719) 
 Commonwealth Scientific and Industrial Research Organisation, Data61, Brisbane, Australia (GRID:grid.1016.6) (ISNI:0000 0001 2173 2719) 
 Garvan Institute of Medical Research, The Kinghorn Cancer Center (KCCG), Sydney, Australia (GRID:grid.415306.5) (ISNI:0000 0000 9983 6924) 
 CSIRO, Biomedical Imaging Group, Brisbane, Australia (GRID:grid.516269.f) 
 Commonwealth Scientific and Industrial Research Organisation, Transformational Bioinformatics, Sydney, Australia (GRID:grid.1016.6) (ISNI:0000 0001 2173 2719); Macquarie University, Department of Biomedical Sciences, Faculty of Medicine and Health Science, Macquarie Park, Australia (GRID:grid.1004.5) (ISNI:0000 0001 2158 5405); Macquarie University, Applied BioSciences, Faculty of Science and Engineering, Macquarie Park, Australia (GRID:grid.1004.5) (ISNI:0000 0001 2158 5405) 
 Commonwealth Scientific and Industrial Research Organisation, Transformational Bioinformatics, Sydney, Australia (GRID:grid.1016.6) (ISNI:0000 0001 2173 2719); Macquarie University, Applied BioSciences, Faculty of Science and Engineering, Macquarie Park, Australia (GRID:grid.1004.5) (ISNI:0000 0001 2158 5405) 
Pages
17662
Publication year
2023
Publication date
2023
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2878154955
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.