Abstract

Background

Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. Here we aimed to create a reproducible end-to-end bacterial genome assembly pipeline using ONT in combination with Illumina sequencing.

Results

We evaluated the performance of several popular tools used during genome reconstruction, including base-calling, filtering, assembly, and polishing. We also assessed overall genome accuracy using ONT both natively and with Illumina. All steps were validated using the high-quality complete reference genome for the Escherichia coli sequence type (ST)131 strain EC958. Software chosen at each stage were incorporated into our final pipeline, MicroPIPE.

Further validation of MicroPIPE was carried out using 11 additional ST131 E. coli isolates, which demonstrated that complete circularised chromosomes and plasmids could be achieved without manual intervention. Twelve publicly available Gram-negative and Gram-positive bacterial genomes (with available raw ONT data and matched complete genomes) were also assembled using MicroPIPE. We found that revised basecalling and updated assembly of the majority of these genomes resulted in improved accuracy compared to the current publicly available complete genomes.

Conclusions

MicroPIPE is built in modules using Singularity container images and the bioinformatics workflow manager Nextflow, allowing changes and adjustments to be made in response to future tool development. Overall, MicroPIPE provides an easy-access, end-to-end solution for attaining high-quality bacterial genomes. MicroPIPE is available at https://github.com/BeatsonLab-MicrobialGenomics/micropipe.

Details

Title
MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
Author
Valentine Murigneux  VIAFID ORCID Logo  ; Roberts, Leah W  VIAFID ORCID Logo  ; Forde, Brian M  VIAFID ORCID Logo  ; Minh-Duy Phan  VIAFID ORCID Logo  ; Nguyen Thi Khanh Nhu  VIAFID ORCID Logo  ; Irwin, Adam D  VIAFID ORCID Logo  ; Harris, Patrick N A  VIAFID ORCID Logo  ; Paterson, David L  VIAFID ORCID Logo  ; Schembri, Mark A  VIAFID ORCID Logo  ; Whiley, David M  VIAFID ORCID Logo  ; Beatson, Scott A  VIAFID ORCID Logo 
Pages
1-15
Section
Software
Publication year
2021
Publication date
2021
Publisher
Springer Nature B.V.
e-ISSN
14712164
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2552858346
Copyright
© 2021. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.