It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk scores (PRS). In this work, we considered the epigenetic analog of PRS, methylation risk scores (MRS), a linear combination of methylation states. We measured methylation across a large cohort (n = 831) of diverse samples in the UCLA Health biobank, for which both genetic and complete EHR data are available. We constructed MRS for 607 phenotypes spanning diagnoses, clinical lab tests, and medication prescriptions. When added to a baseline set of predictive features, MRS significantly improved the imputation of 139 outcomes, whereas the PRS improved only 22 (median improvement for methylation 10.74%, 141.52%, and 15.46% in medications, labs, and diagnosis codes, respectively, whereas genotypes only improved the labs at a median increase of 18.42%). We added significant MRS to state-of-the-art EHR imputation methods that leverage the entire set of medical records, and found that including MRS as a medical feature in the algorithm significantly improves EHR imputation in 37% of lab tests examined (median R2 increase 47.6%). Finally, we replicated several MRS in multiple external studies of methylation (minimum p-value of 2.72 × 10−7) and replicated 22 of 30 tested MRS internally in two separate cohorts of different ethnicity. Our publicly available results and weights show promise for methylation risk scores as clinical and scientific tools.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 University of California Los Angeles, Department of Computer Science, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
2 University of California Los Angeles, Department of Computational Medicine, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
3 University of California Los Angeles, Institute of Precision Health, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
4 University of California Los Angeles, Department of Computer Science, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California Los Angeles, Department of Computational Medicine, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California Los Angeles, Department of Human Genetics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
5 University of California Los Angeles, Department of Anesthesiology and Perioperative Medicine, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
6 University of California Los Angeles, Department of Neurology, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
7 University of California Los Angeles, Department of Computer Science, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California Los Angeles, Department of Computational Medicine, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California Los Angeles, Department of Human Genetics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California Los Angeles, Department of Anesthesiology and Perioperative Medicine, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)