Abstract

Identifying genomic regions with functional genomic properties that are conserved between human and mouse is an important challenge in the context of mouse model studies. To address this, we develop a method to learn a score of evidence of conservation at the functional genomics level by integrating information from a compendium of epigenomic, transcription factor binding, and transcriptomic data from human and mouse. The method, Learning Evidence of Conservation from Integrated Functional genomic annotations (LECIF), trains neural networks to generate this score for the human and mouse genomes. The resulting LECIF score highlights human and mouse regions with shared functional genomic properties and captures correspondence of biologically similar human and mouse annotations. Analysis with independent datasets shows the score also highlights loci associated with similar phenotypes in both species. LECIF will be a resource for mouse model studies by identifying loci whose functional genomic properties are likely conserved.

Understanding conserved functional genomic properties between human and mouse provides important context for mouse model studies. Here, the authors present a genome-wide conservation score integrating epigenomic, transcription factor binding, and transcriptomic data from mouse and human genomes.

Details

Title
Learning a genome-wide score of human–mouse conservation at the functional genomics level
Author
Kwon, Soo Bin 1   VIAFID ORCID Logo  ; Ernst, Jason 2   VIAFID ORCID Logo 

 University of California, Bioinformatics Interdepartmental Program, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Department of Biological Chemistry, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718) 
 University of California, Bioinformatics Interdepartmental Program, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Department of Biological Chemistry, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research at University of California, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Computer Science Department, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Department of Computational Medicine, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Jonsson Comprehensive Cancer Center, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718); University of California, Molecular Biology Institute, Los Angeles, USA (GRID:grid.19006.3e) (ISNI:0000 0000 9632 6718) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2521265329
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.