Abstract

With the goal of solving the whole-cell problem with Escherichia coli K-12 as a model cell, highly accurate genomes were determined for two closely related K-12 strains, MG1655 and W3110. Completion of the W3110 genome and comparison with the MG1655 genome revealed differences at 267 sites, including 251 sites with short, mostly single-nucleotide, insertions or deletions (indels) or base substitutions (totaling 358 nucleotides), in addition to 13 sites with an insertion sequence element or defective prophage in only one strain and two sites for the W3110 inversion. Direct DNA sequencing of PCR products for the 251 regions with short indel and base disparities revealed that only eight sites are true differences. The other 243 discrepancies were due to errors in the original MG1655 sequence, including 79 frameshifts, one amino-acid residue deletion, five amino-acid residue insertions, 73 missense, and 17 silent changes within coding regions. Errors in the original MG1655 sequence (<1 per 13 000 bases) were mostly within portions sequenced with out-dated technology based on radioactive chemistry.

Details

Title
Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110
Author
Hayashi, Koji 1 ; Morooka, Naoki 1 ; Yamamoto, Yoshihiro 2 ; Fujita, Katsutoshi 3 ; Isono, Katsumi 3 ; Choi, Sunju 4 ; Ohtsubo, Eiichi 4 ; Baba, Tomoya 5 ; Wanner, Barry L 6 ; Mori, Hirotada 5 ; Horiuchi, Takashi 1 

 Division of Genome Dynamics, National Institute for Basic Biology, Myodaiji, Okazaki, Aichi Pref., Japan 
 Hyogo College of Medicine, Nishinomiya, Japan 
 Graduate School of Science and Technology, Kobe University, Kobe, Japan 
 Institute of Molecular and Cellular Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan 
 Institute of Advanced Biosciences, Keio University, Tsuruoka, Yamagata, Japan; Nara Institute of Science and Technology, Ikoma, Nara, Japan 
 Department of Biological Sciences, Purdue University, West Lafayette, IN, USA 
Section
Report
Publication year
2006
Publication date
2006
Publisher
EMBO Press
e-ISSN
17444292
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2299159781
Copyright
© 2006. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.