Cargando…

MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction

BACKGROUND: Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. He...

Descripción completa

Detalles Bibliográficos
Autores principales: Murigneux, Valentine, Roberts, Leah W., Forde, Brian M., Phan, Minh-Duy, Nhu, Nguyen Thi Khanh, Irwin, Adam D., Harris, Patrick N. A., Paterson, David L., Schembri, Mark A., Whiley, David M., Beatson, Scott A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8235852/
https://www.ncbi.nlm.nih.gov/pubmed/34172000
http://dx.doi.org/10.1186/s12864-021-07767-z
_version_ 1783714414144258048
author Murigneux, Valentine
Roberts, Leah W.
Forde, Brian M.
Phan, Minh-Duy
Nhu, Nguyen Thi Khanh
Irwin, Adam D.
Harris, Patrick N. A.
Paterson, David L.
Schembri, Mark A.
Whiley, David M.
Beatson, Scott A.
author_facet Murigneux, Valentine
Roberts, Leah W.
Forde, Brian M.
Phan, Minh-Duy
Nhu, Nguyen Thi Khanh
Irwin, Adam D.
Harris, Patrick N. A.
Paterson, David L.
Schembri, Mark A.
Whiley, David M.
Beatson, Scott A.
author_sort Murigneux, Valentine
collection PubMed
description BACKGROUND: Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. Here we aimed to create a reproducible end-to-end bacterial genome assembly pipeline using ONT in combination with Illumina sequencing. RESULTS: We evaluated the performance of several popular tools used during genome reconstruction, including base-calling, filtering, assembly, and polishing. We also assessed overall genome accuracy using ONT both natively and with Illumina. All steps were validated using the high-quality complete reference genome for the Escherichia coli sequence type (ST)131 strain EC958. Software chosen at each stage were incorporated into our final pipeline, MicroPIPE. Further validation of MicroPIPE was carried out using 11 additional ST131 E. coli isolates, which demonstrated that complete circularised chromosomes and plasmids could be achieved without manual intervention. Twelve publicly available Gram-negative and Gram-positive bacterial genomes (with available raw ONT data and matched complete genomes) were also assembled using MicroPIPE. We found that revised basecalling and updated assembly of the majority of these genomes resulted in improved accuracy compared to the current publicly available complete genomes. CONCLUSIONS: MicroPIPE is built in modules using Singularity container images and the bioinformatics workflow manager Nextflow, allowing changes and adjustments to be made in response to future tool development. Overall, MicroPIPE provides an easy-access, end-to-end solution for attaining high-quality bacterial genomes. MicroPIPE is available at https://github.com/BeatsonLab-MicrobialGenomics/micropipe. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07767-z.
format Online
Article
Text
id pubmed-8235852
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82358522021-06-28 MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction Murigneux, Valentine Roberts, Leah W. Forde, Brian M. Phan, Minh-Duy Nhu, Nguyen Thi Khanh Irwin, Adam D. Harris, Patrick N. A. Paterson, David L. Schembri, Mark A. Whiley, David M. Beatson, Scott A. BMC Genomics Software BACKGROUND: Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. Here we aimed to create a reproducible end-to-end bacterial genome assembly pipeline using ONT in combination with Illumina sequencing. RESULTS: We evaluated the performance of several popular tools used during genome reconstruction, including base-calling, filtering, assembly, and polishing. We also assessed overall genome accuracy using ONT both natively and with Illumina. All steps were validated using the high-quality complete reference genome for the Escherichia coli sequence type (ST)131 strain EC958. Software chosen at each stage were incorporated into our final pipeline, MicroPIPE. Further validation of MicroPIPE was carried out using 11 additional ST131 E. coli isolates, which demonstrated that complete circularised chromosomes and plasmids could be achieved without manual intervention. Twelve publicly available Gram-negative and Gram-positive bacterial genomes (with available raw ONT data and matched complete genomes) were also assembled using MicroPIPE. We found that revised basecalling and updated assembly of the majority of these genomes resulted in improved accuracy compared to the current publicly available complete genomes. CONCLUSIONS: MicroPIPE is built in modules using Singularity container images and the bioinformatics workflow manager Nextflow, allowing changes and adjustments to be made in response to future tool development. Overall, MicroPIPE provides an easy-access, end-to-end solution for attaining high-quality bacterial genomes. MicroPIPE is available at https://github.com/BeatsonLab-MicrobialGenomics/micropipe. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07767-z. BioMed Central 2021-06-25 /pmc/articles/PMC8235852/ /pubmed/34172000 http://dx.doi.org/10.1186/s12864-021-07767-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Murigneux, Valentine
Roberts, Leah W.
Forde, Brian M.
Phan, Minh-Duy
Nhu, Nguyen Thi Khanh
Irwin, Adam D.
Harris, Patrick N. A.
Paterson, David L.
Schembri, Mark A.
Whiley, David M.
Beatson, Scott A.
MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
title MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
title_full MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
title_fullStr MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
title_full_unstemmed MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
title_short MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
title_sort micropipe: validating an end-to-end workflow for high-quality complete bacterial genome construction
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8235852/
https://www.ncbi.nlm.nih.gov/pubmed/34172000
http://dx.doi.org/10.1186/s12864-021-07767-z
work_keys_str_mv AT murigneuxvalentine micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT robertsleahw micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT fordebrianm micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT phanminhduy micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT nhunguyenthikhanh micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT irwinadamd micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT harrispatrickna micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT patersondavidl micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT schembrimarka micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT whileydavidm micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction
AT beatsonscotta micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction