It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Annotations of evolutionary sequence constraint based on multi-species genome alignments and genome-wide maps of epigenomic marks and transcription factor binding provide important complementary information for understanding the human genome and genetic variation. Here we developed the Constrained Non-Exonic Predictor (CNEP) to quantify the evidence of each base in the genome being in an evolutionarily constrained non-exonic element from an input of over 60,000 epigenomic and transcription factor binding features. We find that the CNEP score outperforms baseline and related existing scores at predicting evolutionarily constrained non-exonic bases from such data. However, a subset of them are still not well predicted by CNEP. We developed a complementary Conservation Signature Score by CNEP (CSS-CNEP) that is predictive of those bases. We further characterize the nature of constrained non-exonic bases with low CNEP scores using additional types of information. CNEP and CSS-CNEP are resources for analyzing constrained non-exonic bases in the genome.
Genome-wide maps of evolutionary constraint and large-scale compendia of epigenomic and transcription factor data provide complementary information for genome annotation. Here, the authors develop the Constrained Non-Exonic Predictor (CNEP) that enables better understanding of their relationship.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details




1 University of California, Los Angeles, Computer Science Department, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Department of Biological Chemistry, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
2 University of California, Los Angeles, Interdepartmental Program in Bioinformatics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
3 University of California, Los Angeles, Department of Biological Chemistry, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Interdepartmental Program in Bioinformatics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
4 University of California, Los Angeles, Computer Science Department, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
5 University of California, Los Angeles, Interdepartmental Program in Bioinformatics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Department of Ecology and Evolutionary Biology, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Department of Human Genetics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)
6 University of California, Los Angeles, Computer Science Department, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Department of Biological Chemistry, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Interdepartmental Program in Bioinformatics, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research at University of California, Los Angeles, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Jonsson Comprehensive Cancer Center, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Los Angeles, Molecular Biology Institute, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718)