Abstract

Background

Inferring gene-to-phenotype and gene-to-human disease model relationships from annotated mouse phenotypes and disease associations is critical when researching gene function and identifying candidate disease genes. Filtering the various kinds of genotypes to determine which phenotypes are caused by a mutation in a particular gene can be a laborious and time-consuming process.

Methods

At Mouse Genome Informatics (MGI, www.informatics.jax.org), we have developed a gene annotation derivation algorithm that computes gene-to-phenotype and gene-to-disease annotations from our existing corpus of annotations to genotypes. This algorithm differentiates between simple genotypes with causative mutations in a single gene and more complex genotypes where mutations in multiple genes may contribute to the phenotype. As part of the process, alleles functioning as tools (e.g., reporters, recombinases) are filtered out.

Results

Using this algorithm derived gene-to-phenotype and gene-to-disease annotations were created for 16,000 and 2100 mouse markers, respectively, starting from over 57,900 and 4800 genotypes with at least one phenotype and disease annotation, respectively.

Conclusions

Implementation of this algorithm provides consistent and accurate gene annotations across MGI and provides a vital time-savings relative to manual annotation by curators.

Details

Title
Inferring gene-to-phenotype and gene-to-disease relationships at Mouse Genome Informatics: challenges and solutions
Author
Bello, Susan M; Eppig, Janan T
Publication year
2016
Publication date
2016
Publisher
BioMed Central
e-ISSN
20411480
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1796903941
Copyright
Copyright BioMed Central 2016