Abstract

Summary statistics from genome-wide association studies (GWAS) have facilitated the development of various summary data-based methods, which typically require a reference sample for linkage disequilibrium (LD) estimation. Analyses using these methods may be biased by errors in GWAS summary data or LD reference or heterogeneity between GWAS and LD reference. Here we propose a quality control method, DENTIST, that leverages LD among genetic variants to detect and eliminate errors in GWAS or LD reference and heterogeneity between the two. Through simulations, we demonstrate that DENTIST substantially reduces false-positive rate in detecting secondary signals in the summary-data-based conditional and joint association analysis, especially for imputed rare variants (false-positive rate reduced from >28% to <2% in the presence of heterogeneity between GWAS and LD reference). We further show that DENTIST can improve other summary-data-based analyses such as fine-mapping analysis.

Analyses of summary statistics from GWAS are subject to biases due to errors in the discovery GWAS or linkage disequilibrium reference data set or heterogeneity between data sets. Here, the authors propose a quality control method to be added to analysis of GWAS summary data that can reduce such biases.

Details

Title
Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors
Author
Chen, Wenhan 1 ; Wu, Yang 2   VIAFID ORCID Logo  ; Zheng Zhili 2 ; Ting, Qi 3 ; Visscher, Peter M 2   VIAFID ORCID Logo  ; Zhu, Zhihong 2 ; Yang, Jian 3   VIAFID ORCID Logo 

 The University of Queensland, Institute for Molecular Bioscience, Brisbane, Australia (GRID:grid.1003.2) (ISNI:0000 0000 9320 7537); Garvan Institute of Medical Research, Epigenetics Research Laboratory, Genomics and Epigenetics Theme, Sydney, Australia (GRID:grid.415306.5) (ISNI:0000 0000 9983 6924) 
 The University of Queensland, Institute for Molecular Bioscience, Brisbane, Australia (GRID:grid.1003.2) (ISNI:0000 0000 9320 7537) 
 The University of Queensland, Institute for Molecular Bioscience, Brisbane, Australia (GRID:grid.1003.2) (ISNI:0000 0000 9320 7537); Westlake University, School of Life Sciences, Hangzhou, China (GRID:grid.494629.4) (ISNI:0000 0004 8008 9315); Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China (GRID:grid.494629.4) (ISNI:0000 0004 8008 9315) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2607919265
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.