Abstract

Streptomyces are Gram-positive bacteria of significant industrial importance due to their ability to produce a wide range of antibiotics and bioactive secondary metabolites. Recent advances in genome mining have revealed that Streptomyces genomes possess a large number of unexplored silent secondary metabolite biosynthetic gene clusters (smBGCs). This indicates that Streptomyces genomes continue to be an invaluable source for new drug discovery. Here, we present high-quality genome sequences of 22 Streptomyces species and eight different Streptomyces venezuelae strains assembled by a hybrid strategy exploiting both long-read and short-read genome sequencing methods. The assembled genomes have more than 97.4% gene space completeness and total lengths ranging from 6.7 to 10.1 Mbp. Their annotation identified 7,000 protein coding genes, 20 rRNAs, and 68 tRNAs on average. In silico prediction of smBGCs identified a total of 922 clusters, including many clusters whose products are unknown. We anticipate that the availability of these genomes will accelerate discovery of novel secondary metabolites from Streptomyces and elucidate complex smBGC regulation.

Measurement(s)

DNA • genome • sequence_assembly • sequence feature annotation

Technology Type(s)

DNA sequencing • sequence assembly process • sequence annotation

Factor Type(s)

strain

Sample Characteristic - Organism

Streptomyces

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.11791323

Details

Title
Thirty complete Streptomyces genome sequences for mining novel secondary metabolite biosynthetic gene clusters
Author
Lee, Namil 1 ; Kim Woori 1 ; Hwang Soonkyu 1 ; Lee, Yongjae 1 ; Cho Suhyung 1 ; Palsson Bernhard 2   VIAFID ORCID Logo  ; Byung-Kwan, Cho 3   VIAFID ORCID Logo 

 Korea Advanced Institute of Science and Technology, Department of Biological Sciences and KI for the BioCentury, Daejeon, Republic of Korea (GRID:grid.37172.30) (ISNI:0000 0001 2292 0500) 
 University of California San Diego, Department of Bioengineering, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); University of California San Diego, Department of Pediatrics, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); Technical University of Denmark, Novo Nordisk Foundation Center for Biosustainability, Lyngby, Denmark (GRID:grid.5170.3) (ISNI:0000 0001 2181 8870) 
 Korea Advanced Institute of Science and Technology, Department of Biological Sciences and KI for the BioCentury, Daejeon, Republic of Korea (GRID:grid.37172.30) (ISNI:0000 0001 2292 0500); Intelligent Synthetic Biology Center, Daejeon, Republic of Korea (GRID:grid.37172.30); Technical University of Denmark, Novo Nordisk Foundation Center for Biosustainability, Lyngby, Denmark (GRID:grid.5170.3) (ISNI:0000 0001 2181 8870) 
Publication year
2020
Publication date
2020
Publisher
Nature Publishing Group
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2489907111
Copyright
© The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.