Abstract

The large majority of variants identified by GWAS are non-coding, motivating detailed characterization of the function of non-coding variants. Experimental methods to assess variants’ effect on gene expressions in native chromatin context via direct perturbation are low-throughput. Existing high-throughput computational predictors thus have lacked large gold standard sets of regulatory variants for training and validation. Here, we leverage a set of 14,807 putative causal eQTLs in humans obtained through statistical fine-mapping, and we use 6121 features to directly train a predictor of whether a variant modifies nearby gene expression. We call the resulting prediction the expression modifier score (EMS). We validate EMS by comparing its ability to prioritize functional variants with other major scores. We then use EMS as a prior for statistical fine-mapping of eQTLs to identify an additional 20,913 putatively causal eQTLs, and we incorporate EMS into co-localization analysis to identify 310 additional candidate genes across UK Biobank phenotypes.

Finding causal variants and genes from GWAS loci results remains a challenge. Here, the authors train a model to predict if a variant affects nearby gene expression, and apply the method to identify new possible causal eQTLs and mechanisms of GWAS loci.

Details

Title
Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
Author
Wang, Qingbo S 1   VIAFID ORCID Logo  ; Kelley, David R 2   VIAFID ORCID Logo  ; Ulirsch Jacob 3   VIAFID ORCID Logo  ; Kanai Masahiro 4   VIAFID ORCID Logo  ; Shuvom, Sadhuka 5 ; Cui Ran 6 ; Albors Carlos 6 ; Cheng, Nathan 6 ; Okada Yukinori 7   VIAFID ORCID Logo  ; Matsuda Koichi 8 ; Yamanashi Yuji 9 ; Furukawa Yoichi 10 ; Morisaki Takayuki 11 ; Murakami Yoshinori 12 ; Kamatani Yoichiro 13 ; Muto Kaori 14 ; Nagai Akiko 14 ; Obara Wataru 15 ; Yamaji, Ken 16 ; Takahashi, Kazuhisa 17 ; Asai Satoshi 18 ; Takahashi, Yasuo 19 ; Suzuki, Takao 20 ; Sinozaki Nobuaki 20 ; Yamaguchi Hiroki 21 ; Minami Shiro 22 ; Murayama Shigeo 23 ; Yoshimori Kozo 24 ; Nagayama Satoshi 25 ; Obata Daisuke 26 ; Higashiyama Masahiko 27 ; Masumoto Akihide 28 ; Koretsune Yukihiro 29 ; Aguet Francois 30   VIAFID ORCID Logo  ; Ardlie, Kristin G 30 ; MacArthur, Daniel G 31   VIAFID ORCID Logo  ; Finucane, Hilary K 6   VIAFID ORCID Logo 

 Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34); Massachusetts General Hospital, Analytic and Translational Genetics Unit, Boston, USA (GRID:grid.32224.35) (ISNI:0000 0004 0386 9924); PhD program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X) 
 Calico Life Sciences, South San Francisco, USA (GRID:grid.38142.3c) 
 Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34); Massachusetts General Hospital, Analytic and Translational Genetics Unit, Boston, USA (GRID:grid.32224.35) (ISNI:0000 0004 0386 9924); PhD program in Biological and Biomedical Sciences, Harvard Medical School, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X) 
 Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34); Massachusetts General Hospital, Analytic and Translational Genetics Unit, Boston, USA (GRID:grid.32224.35) (ISNI:0000 0004 0386 9924); PhD program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Osaka University Graduate School of Medicine, Department of Statistical Genetics, Osaka, Japan (GRID:grid.136593.b) (ISNI:0000 0004 0373 3971) 
 Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34); Harvard College, Cambridge, USA (GRID:grid.38142.3c) (ISNI:000000041936754X) 
 Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34); Massachusetts General Hospital, Analytic and Translational Genetics Unit, Boston, USA (GRID:grid.32224.35) (ISNI:0000 0004 0386 9924) 
 Osaka University Graduate School of Medicine, Department of Statistical Genetics, Osaka, Japan (GRID:grid.136593.b) (ISNI:0000 0004 0373 3971); Osaka University, Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka, Japan (GRID:grid.136593.b) (ISNI:0000 0004 0373 3971); Osaka University, Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka, Japan (GRID:grid.136593.b) (ISNI:0000 0004 0373 3971) 
 The University of Tokyo, Laboratory of Genome Technology, Human Genome Center, Institute of Medical Science, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X); The University of Tokyo, Laboratory of Clinical Genome Sequencing, Graduate School of Frontier Sciences, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
 The University of Tokyo, Division of Genetics, The Institute of Medical Science, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
10  The University of Tokyo, Division of Clinical Genome Research, Institute of Medical Science, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
11  The University of Tokyo, Division of Molecular Pathology, IMSUT Hospital Department of Internal Medicine, Institute of Medical Science, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
12  The University of Tokyo, Department of Cancer Biology, Institute of Medical Science, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
13  The University of Tokyo, Laboratory of Clinical Genome Sequencing, Graduate School of Frontier Sciences, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X); The University of Tokyo, Laboratory of Complex Trait Genomics, Graduate School of Frontier Sciences, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
14  The University of Tokyo, Department of Public Policy, Institute of Medical Science, Tokyo, Japan (GRID:grid.26999.3d) (ISNI:0000 0001 2151 536X) 
15  Iwate Medical University, Department of Urology, Iwate, Japan (GRID:grid.411790.a) (ISNI:0000 0000 9613 6383) 
16  Juntendo University Graduate School of Medicine, Department of Internal Medicine and Rheumatology, Tokyo, Japan (GRID:grid.258269.2) (ISNI:0000 0004 1762 2738) 
17  Juntendo University Graduate School of Medicine, Department of Respiratory Medicine, Tokyo, Japan (GRID:grid.258269.2) (ISNI:0000 0004 1762 2738) 
18  Nihon University School of Medicine, Division of Pharmacology, Department of Biomedical Science, Tokyo, Japan (GRID:grid.260969.2) (ISNI:0000 0001 2149 8846); Nihon University. School of Medicine, Division of Genomic Epidemiology and Clinical Trials, Clinical Trials Research Center, Tokyo, Japan (GRID:grid.260969.2) (ISNI:0000 0001 2149 8846) 
19  Nihon University School of Medicine, Division of Genomic Epidemiology and Clinical Trials, Clinical Trials Research Center, Tokyo, Japan (GRID:grid.260969.2) (ISNI:0000 0001 2149 8846) 
20  Tokushukai Group, Tokyo, Japan (GRID:grid.260969.2) 
21  Departmentof Hematology, Nippon Medical School, Tokyo, Japan (GRID:grid.410821.e) (ISNI:0000 0001 2173 8328) 
22  Nippon Medical School, Department of Bioregulation, Kawasaki, Japan (GRID:grid.410821.e) (ISNI:0000 0001 2173 8328) 
23  Tokyo Metropolitan Geriatric Hospital and Institute of Gerontology, Tokyo, Japan (GRID:grid.417092.9) 
24  Fukujuji Hospital, Japan Anti-Tuberculosis Association, Tokyo, Japan (GRID:grid.417092.9) 
25  The Cancer Institute Hospital of the Japanese Foundation for Cancer Research, Tokyo, Japan (GRID:grid.410807.a) (ISNI:0000 0001 0037 4131) 
26  Shiga University of Medical Science, Center for Clinical Research and Advanced Medicine, Shiga, Japan (GRID:grid.410827.8) (ISNI:0000 0000 9747 6806) 
27  Osaka International Cancer Institute, Department of General Thoracic Surgery, Osaka, Japan (GRID:grid.489169.b) 
28  IIZUKA-HOSPITAL, Fukuoka, Japan (GRID:grid.413984.3) 
29  National Hospital Organization Osaka National Hospital, Osaka, Japan (GRID:grid.416803.8) (ISNI:0000 0004 0377 7966) 
30  Broad Institute of MIT and Harvard, Cambridge, USA (GRID:grid.66859.34) 
31  Centre for Population Genomics, Garvan Institute of Medical Research, Darlinghurst, Australia (GRID:grid.415306.5) (ISNI:0000 0000 9983 6924); Centre for Population Genomics, Murdoch Children’s Research Institute, Parkville, Australia (GRID:grid.1058.c) (ISNI:0000 0000 9442 535X) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2537860624
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.