Abstract

GapMind is a web-based tool for annotating amino acid biosynthesis in bacteria and archaea (http://papers.genomics.lbl/gov/gaps). GapMind incorporates many variant pathways and 130 different reactions, and it analyzes a genome in just 15 seconds. To avoid error-prone "transitive" annotations, GapMind relies primarily on a database of experimentally-characterized proteins. GapMind correctly handles fusion proteins and split proteins, which often cause errors for "best hit" approaches. To improve GapMind's coverage, we examined genetic data from 35 bacteria that grow in minimal media and we filled many gaps in amino acid biosynthesis pathways. For example, we identified additional genes for arginine synthesis with succinylated intermediates in Bacteroides thetaiotaomicron and we propose that Dyella japonica synthesizes tyrosine from phenylalanine. Nevertheless, for many bacteria and archaea that grow in minimal media, genes for some steps still cannot be identified. If a potential gap in the genome of interest is also a gap in a related microbe that can grow in minimal media, GapMind marks the gap as "known."

Footnotes

* https://doi.org/10.6084/m9.figshare.9693689.v1​

Details

Title
GapMind: Automated annotation of amino acid biosynthesis
Author
Price, Morgan N; Deutschbauer, Adam M; Arkin, Adam P
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2019
Publication date
Aug 21, 2019
Publisher
Cold Spring Harbor Laboratory Press
Source type
Working Paper
Language of publication
English
ProQuest document ID
2277172380
Copyright
© 2019. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.