Cargando…

Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes

Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (...

Descripción completa

Detalles Bibliográficos
Autores principales: De Maio, Nicola, Shaw, Liam P., Hubbard, Alasdair, George, Sophie, Sanderson, Nicholas D., Swann, Jeremy, Wick, Ryan, AbuOun, Manal, Stubberfield, Emma, Hoosdally, Sarah J., Crook, Derrick W., Peto, Timothy E. A., Sheppard, Anna E., Bailey, Mark J., Read, Daniel S., Anjum, Muna F., Walker, A. Sarah, Stoesser, Nicole
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6807382/
https://www.ncbi.nlm.nih.gov/pubmed/31483244
http://dx.doi.org/10.1099/mgen.0.000294
_version_ 1783461708705038336
author De Maio, Nicola
Shaw, Liam P.
Hubbard, Alasdair
George, Sophie
Sanderson, Nicholas D.
Swann, Jeremy
Wick, Ryan
AbuOun, Manal
Stubberfield, Emma
Hoosdally, Sarah J.
Crook, Derrick W.
Peto, Timothy E. A.
Sheppard, Anna E.
Bailey, Mark J.
Read, Daniel S.
Anjum, Muna F.
Walker, A. Sarah
Stoesser, Nicole
author_facet De Maio, Nicola
Shaw, Liam P.
Hubbard, Alasdair
George, Sophie
Sanderson, Nicholas D.
Swann, Jeremy
Wick, Ryan
AbuOun, Manal
Stubberfield, Emma
Hoosdally, Sarah J.
Crook, Derrick W.
Peto, Timothy E. A.
Sheppard, Anna E.
Bailey, Mark J.
Read, Daniel S.
Anjum, Muna F.
Walker, A. Sarah
Stoesser, Nicole
author_sort De Maio, Nicola
collection PubMed
description Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly.
format Online
Article
Text
id pubmed-6807382
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-68073822019-10-24 Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes De Maio, Nicola Shaw, Liam P. Hubbard, Alasdair George, Sophie Sanderson, Nicholas D. Swann, Jeremy Wick, Ryan AbuOun, Manal Stubberfield, Emma Hoosdally, Sarah J. Crook, Derrick W. Peto, Timothy E. A. Sheppard, Anna E. Bailey, Mark J. Read, Daniel S. Anjum, Muna F. Walker, A. Sarah Stoesser, Nicole Microb Genom Research Article Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly. Microbiology Society 2019-09-04 /pmc/articles/PMC6807382/ /pubmed/31483244 http://dx.doi.org/10.1099/mgen.0.000294 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
De Maio, Nicola
Shaw, Liam P.
Hubbard, Alasdair
George, Sophie
Sanderson, Nicholas D.
Swann, Jeremy
Wick, Ryan
AbuOun, Manal
Stubberfield, Emma
Hoosdally, Sarah J.
Crook, Derrick W.
Peto, Timothy E. A.
Sheppard, Anna E.
Bailey, Mark J.
Read, Daniel S.
Anjum, Muna F.
Walker, A. Sarah
Stoesser, Nicole
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_full Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_fullStr Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_full_unstemmed Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_short Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_sort comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6807382/
https://www.ncbi.nlm.nih.gov/pubmed/31483244
http://dx.doi.org/10.1099/mgen.0.000294
work_keys_str_mv AT demaionicola comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT shawliamp comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT hubbardalasdair comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT georgesophie comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT sandersonnicholasd comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT swannjeremy comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT wickryan comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT abuounmanal comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT stubberfieldemma comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT hoosdallysarahj comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT crookderrickw comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT petotimothyea comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT sheppardannae comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT baileymarkj comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT readdaniels comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT anjummunaf comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT walkerasarah comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT stoessernicole comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes