Abstract

Functional annotations have the potential to increase power of genome-wide association studies (GWAS) by prioritizing variants according to their biological function, but this potential has not been well studied. We comprehensively evaluated all 1132 traits in the UK Biobank whose SNP-heritability estimates were given “medium” or “high” labels by Neale’s lab. For each trait, we integrated GWAS summary statistics of close to 8 million common variants (minor allele frequency >1%) with either their 75 individual functional scores or their meta-scores, using three different data-integration methods. Overall, the number of new genome-wide significant findings after data-integration increases as a trait SNP-heritability estimate increases. However, there is a trade-off between new findings and loss of baseline GWAS findings, resulting in similar total numbers of significant findings between using GWAS alone and integrating GWAS with functional scores, across all 1132 traits analyzed and all three data-integration methods considered. Our findings suggest that, even with the current biobank-level sample size, more informative functional scores and/or new data-integration methods are needed to further improve the power of GWAS of common variants. For example, studying variants in coding sequence and obtaining cell-type-specific scores are potential future directions.

Details

Title
Integrating variant functional annotation scores have varied abilities to improve power of genome-wide association studies
Author
Gao, Jianhui 1 ; Espin-Garcia, Osvaldo 2 ; Paterson, Andrew D. 3 ; Sun, Lei 4 

 University of Toronto, Division of Biostatistics, Dalla Lana School of Public Health, Toronto, Canada (GRID:grid.17063.33) (ISNI:0000 0001 2157 2938) 
 University of Toronto, Division of Biostatistics, Dalla Lana School of Public Health, Toronto, Canada (GRID:grid.17063.33) (ISNI:0000 0001 2157 2938); University Health Network, Department of Biostatistics, Princess Margaret Cancer Centre, Toronto, Canada (GRID:grid.231844.8) (ISNI:0000 0004 0474 0428) 
 University of Toronto, Division of Biostatistics, Dalla Lana School of Public Health, Toronto, Canada (GRID:grid.17063.33) (ISNI:0000 0001 2157 2938); Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Canada (GRID:grid.42327.30) (ISNI:0000 0004 0473 9646) 
 University of Toronto, Division of Biostatistics, Dalla Lana School of Public Health, Toronto, Canada (GRID:grid.17063.33) (ISNI:0000 0001 2157 2938); University of Toronto, Department of Statistical Sciences, Faculty of Arts and Science, Toronto, Canada (GRID:grid.17063.33) (ISNI:0000 0001 2157 2938) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2680440920
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.