Full Text

Turn on search term navigation

© 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Premise

The functional annotation of genes is a crucial component of genomic analyses. A common way to summarize functional annotations is with hierarchical gene ontologies, such as the Gene Ontology (GO) Resource. GO includes information about the cellular location, molecular function(s), and products/processes that genes produce or are involved in. For a set of genes, summarizing GO annotations using pre-defined, higher-order terms (GO slims) is often desirable in order to characterize the overall function of the data set, and it is impractical to do this manually.

Methods and Results

The GOgetter pipeline consists of bash and Python scripts. From an input FASTA file of nucleotide gene sequences, it outputs text and image files that list (1) the best hit for each input gene in a set of reference gene models, (2) all GO terms and annotations associated with those hits, and (3) a summary and visualization of GO slim categories for the data set. These output files can be queried further and analyzed statistically, depending on the downstream need(s).

Conclusions

GO annotations are a widely used “universal language” for describing gene functions and products. GOgetter is a fast and easy-to-implement pipeline for obtaining, summarizing, and visualizing GO slim categories associated with a set of genes.

Details

Title
GOgetter: A pipeline for summarizing and visualizing GO slim annotations for plant genetic data
Author
Sessa, Emily B 1   VIAFID ORCID Logo  ; Masalia, Rishi R 2   VIAFID ORCID Logo  ; Arrigo, Nils 3 ; Barker, Michael S 2   VIAFID ORCID Logo  ; Pelosi, Jessie A 4   VIAFID ORCID Logo 

 New York Botanical Garden, Bronx, New York, USA 
 Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, USA 
 SOPHiA Genetics, Saint Sulpice, Switzerland 
 Department of Biology, University of Florida, Gainesville, Florida, USA 
Section
SOFTWARE NOTE
Publication year
2023
Publication date
Jul 2023
Publisher
John Wiley & Sons, Inc.
e-ISSN
21680450
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2853072900
Copyright
© 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.