Cargando…
MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction
BACKGROUND: Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. He...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8235852/ https://www.ncbi.nlm.nih.gov/pubmed/34172000 http://dx.doi.org/10.1186/s12864-021-07767-z |
_version_ | 1783714414144258048 |
---|---|
author | Murigneux, Valentine Roberts, Leah W. Forde, Brian M. Phan, Minh-Duy Nhu, Nguyen Thi Khanh Irwin, Adam D. Harris, Patrick N. A. Paterson, David L. Schembri, Mark A. Whiley, David M. Beatson, Scott A. |
author_facet | Murigneux, Valentine Roberts, Leah W. Forde, Brian M. Phan, Minh-Duy Nhu, Nguyen Thi Khanh Irwin, Adam D. Harris, Patrick N. A. Paterson, David L. Schembri, Mark A. Whiley, David M. Beatson, Scott A. |
author_sort | Murigneux, Valentine |
collection | PubMed |
description | BACKGROUND: Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. Here we aimed to create a reproducible end-to-end bacterial genome assembly pipeline using ONT in combination with Illumina sequencing. RESULTS: We evaluated the performance of several popular tools used during genome reconstruction, including base-calling, filtering, assembly, and polishing. We also assessed overall genome accuracy using ONT both natively and with Illumina. All steps were validated using the high-quality complete reference genome for the Escherichia coli sequence type (ST)131 strain EC958. Software chosen at each stage were incorporated into our final pipeline, MicroPIPE. Further validation of MicroPIPE was carried out using 11 additional ST131 E. coli isolates, which demonstrated that complete circularised chromosomes and plasmids could be achieved without manual intervention. Twelve publicly available Gram-negative and Gram-positive bacterial genomes (with available raw ONT data and matched complete genomes) were also assembled using MicroPIPE. We found that revised basecalling and updated assembly of the majority of these genomes resulted in improved accuracy compared to the current publicly available complete genomes. CONCLUSIONS: MicroPIPE is built in modules using Singularity container images and the bioinformatics workflow manager Nextflow, allowing changes and adjustments to be made in response to future tool development. Overall, MicroPIPE provides an easy-access, end-to-end solution for attaining high-quality bacterial genomes. MicroPIPE is available at https://github.com/BeatsonLab-MicrobialGenomics/micropipe. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07767-z. |
format | Online Article Text |
id | pubmed-8235852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-82358522021-06-28 MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction Murigneux, Valentine Roberts, Leah W. Forde, Brian M. Phan, Minh-Duy Nhu, Nguyen Thi Khanh Irwin, Adam D. Harris, Patrick N. A. Paterson, David L. Schembri, Mark A. Whiley, David M. Beatson, Scott A. BMC Genomics Software BACKGROUND: Oxford Nanopore Technology (ONT) long-read sequencing has become a popular platform for microbial researchers due to the accessibility and affordability of its devices. However, easy and automated construction of high-quality bacterial genomes using nanopore reads remains challenging. Here we aimed to create a reproducible end-to-end bacterial genome assembly pipeline using ONT in combination with Illumina sequencing. RESULTS: We evaluated the performance of several popular tools used during genome reconstruction, including base-calling, filtering, assembly, and polishing. We also assessed overall genome accuracy using ONT both natively and with Illumina. All steps were validated using the high-quality complete reference genome for the Escherichia coli sequence type (ST)131 strain EC958. Software chosen at each stage were incorporated into our final pipeline, MicroPIPE. Further validation of MicroPIPE was carried out using 11 additional ST131 E. coli isolates, which demonstrated that complete circularised chromosomes and plasmids could be achieved without manual intervention. Twelve publicly available Gram-negative and Gram-positive bacterial genomes (with available raw ONT data and matched complete genomes) were also assembled using MicroPIPE. We found that revised basecalling and updated assembly of the majority of these genomes resulted in improved accuracy compared to the current publicly available complete genomes. CONCLUSIONS: MicroPIPE is built in modules using Singularity container images and the bioinformatics workflow manager Nextflow, allowing changes and adjustments to be made in response to future tool development. Overall, MicroPIPE provides an easy-access, end-to-end solution for attaining high-quality bacterial genomes. MicroPIPE is available at https://github.com/BeatsonLab-MicrobialGenomics/micropipe. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07767-z. BioMed Central 2021-06-25 /pmc/articles/PMC8235852/ /pubmed/34172000 http://dx.doi.org/10.1186/s12864-021-07767-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Murigneux, Valentine Roberts, Leah W. Forde, Brian M. Phan, Minh-Duy Nhu, Nguyen Thi Khanh Irwin, Adam D. Harris, Patrick N. A. Paterson, David L. Schembri, Mark A. Whiley, David M. Beatson, Scott A. MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction |
title | MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction |
title_full | MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction |
title_fullStr | MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction |
title_full_unstemmed | MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction |
title_short | MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction |
title_sort | micropipe: validating an end-to-end workflow for high-quality complete bacterial genome construction |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8235852/ https://www.ncbi.nlm.nih.gov/pubmed/34172000 http://dx.doi.org/10.1186/s12864-021-07767-z |
work_keys_str_mv | AT murigneuxvalentine micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT robertsleahw micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT fordebrianm micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT phanminhduy micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT nhunguyenthikhanh micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT irwinadamd micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT harrispatrickna micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT patersondavidl micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT schembrimarka micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT whileydavidm micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction AT beatsonscotta micropipevalidatinganendtoendworkflowforhighqualitycompletebacterialgenomeconstruction |