Cargando…
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Microbiology Society
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6807382/ https://www.ncbi.nlm.nih.gov/pubmed/31483244 http://dx.doi.org/10.1099/mgen.0.000294 |
_version_ | 1783461708705038336 |
---|---|
author | De Maio, Nicola Shaw, Liam P. Hubbard, Alasdair George, Sophie Sanderson, Nicholas D. Swann, Jeremy Wick, Ryan AbuOun, Manal Stubberfield, Emma Hoosdally, Sarah J. Crook, Derrick W. Peto, Timothy E. A. Sheppard, Anna E. Bailey, Mark J. Read, Daniel S. Anjum, Muna F. Walker, A. Sarah Stoesser, Nicole |
author_facet | De Maio, Nicola Shaw, Liam P. Hubbard, Alasdair George, Sophie Sanderson, Nicholas D. Swann, Jeremy Wick, Ryan AbuOun, Manal Stubberfield, Emma Hoosdally, Sarah J. Crook, Derrick W. Peto, Timothy E. A. Sheppard, Anna E. Bailey, Mark J. Read, Daniel S. Anjum, Muna F. Walker, A. Sarah Stoesser, Nicole |
author_sort | De Maio, Nicola |
collection | PubMed |
description | Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly. |
format | Online Article Text |
id | pubmed-6807382 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Microbiology Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-68073822019-10-24 Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes De Maio, Nicola Shaw, Liam P. Hubbard, Alasdair George, Sophie Sanderson, Nicholas D. Swann, Jeremy Wick, Ryan AbuOun, Manal Stubberfield, Emma Hoosdally, Sarah J. Crook, Derrick W. Peto, Timothy E. A. Sheppard, Anna E. Bailey, Mark J. Read, Daniel S. Anjum, Muna F. Walker, A. Sarah Stoesser, Nicole Microb Genom Research Article Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly. Microbiology Society 2019-09-04 /pmc/articles/PMC6807382/ /pubmed/31483244 http://dx.doi.org/10.1099/mgen.0.000294 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article De Maio, Nicola Shaw, Liam P. Hubbard, Alasdair George, Sophie Sanderson, Nicholas D. Swann, Jeremy Wick, Ryan AbuOun, Manal Stubberfield, Emma Hoosdally, Sarah J. Crook, Derrick W. Peto, Timothy E. A. Sheppard, Anna E. Bailey, Mark J. Read, Daniel S. Anjum, Muna F. Walker, A. Sarah Stoesser, Nicole Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
title | Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
title_full | Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
title_fullStr | Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
title_full_unstemmed | Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
title_short | Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
title_sort | comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6807382/ https://www.ncbi.nlm.nih.gov/pubmed/31483244 http://dx.doi.org/10.1099/mgen.0.000294 |
work_keys_str_mv | AT demaionicola comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT shawliamp comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT hubbardalasdair comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT georgesophie comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT sandersonnicholasd comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT swannjeremy comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT wickryan comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT abuounmanal comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT stubberfieldemma comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT hoosdallysarahj comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT crookderrickw comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT petotimothyea comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT sheppardannae comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT baileymarkj comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT readdaniels comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT anjummunaf comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT walkerasarah comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT stoessernicole comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes AT comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes |