Cargando…

Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing

BACKGROUND: Short-read sequencing technologies have made microbial genome sequencing cheap and accessible. However, closing genomes is often costly and assembling short reads from genomes that are repetitive and/or have extreme %GC content remains challenging. Long-read, single-molecule sequencing t...

Descripción completa

Detalles Bibliográficos
Autores principales: Goldstein, Sarah, Beka, Lidia, Graf, Joerg, Klassen, Jonathan L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325685/
https://www.ncbi.nlm.nih.gov/pubmed/30626323
http://dx.doi.org/10.1186/s12864-018-5381-7
_version_ 1783386167702454272
author Goldstein, Sarah
Beka, Lidia
Graf, Joerg
Klassen, Jonathan L.
author_facet Goldstein, Sarah
Beka, Lidia
Graf, Joerg
Klassen, Jonathan L.
author_sort Goldstein, Sarah
collection PubMed
description BACKGROUND: Short-read sequencing technologies have made microbial genome sequencing cheap and accessible. However, closing genomes is often costly and assembling short reads from genomes that are repetitive and/or have extreme %GC content remains challenging. Long-read, single-molecule sequencing technologies such as the Oxford Nanopore MinION have the potential to overcome these difficulties, although the best approach for harnessing their potential remains poorly evaluated. RESULTS: We sequenced nine bacterial genomes spanning a wide range of GC contents using Illumina MiSeq and Oxford Nanopore MinION sequencing technologies to determine the advantages of each approach, both individually and combined. Assemblies using only MiSeq reads were highly accurate but lacked contiguity, a deficiency that was partially overcome by adding MinION reads to these assemblies. Even more contiguous genome assemblies were generated by using MinION reads for initial assembly, but these assemblies were more error-prone and required further polishing. This was especially pronounced when Illumina libraries were biased, as was the case for our strains with both high and low GC content. Increased genome contiguity dramatically improved the annotation of insertion sequences and secondary metabolite biosynthetic gene clusters, likely because long-reads can disambiguate these highly repetitive but biologically important genomic regions. CONCLUSIONS: Genome assembly using short-reads is challenged by repetitive sequences and extreme GC contents. Our results indicate that these difficulties can be largely overcome by using single-molecule, long-read sequencing technologies such as the Oxford Nanopore MinION. Using MinION reads for assembly followed by polishing with Illumina reads generated the most contiguous genomes with sufficient accuracy to enable the accurate annotation of important but difficult to sequence genomic features such as insertion sequences and secondary metabolite biosynthetic gene clusters. The combination of Oxford Nanopore and Illumina sequencing can therefore cost-effectively advance studies of microbial evolution and genome-driven drug discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5381-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6325685
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63256852019-01-11 Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing Goldstein, Sarah Beka, Lidia Graf, Joerg Klassen, Jonathan L. BMC Genomics Research Article BACKGROUND: Short-read sequencing technologies have made microbial genome sequencing cheap and accessible. However, closing genomes is often costly and assembling short reads from genomes that are repetitive and/or have extreme %GC content remains challenging. Long-read, single-molecule sequencing technologies such as the Oxford Nanopore MinION have the potential to overcome these difficulties, although the best approach for harnessing their potential remains poorly evaluated. RESULTS: We sequenced nine bacterial genomes spanning a wide range of GC contents using Illumina MiSeq and Oxford Nanopore MinION sequencing technologies to determine the advantages of each approach, both individually and combined. Assemblies using only MiSeq reads were highly accurate but lacked contiguity, a deficiency that was partially overcome by adding MinION reads to these assemblies. Even more contiguous genome assemblies were generated by using MinION reads for initial assembly, but these assemblies were more error-prone and required further polishing. This was especially pronounced when Illumina libraries were biased, as was the case for our strains with both high and low GC content. Increased genome contiguity dramatically improved the annotation of insertion sequences and secondary metabolite biosynthetic gene clusters, likely because long-reads can disambiguate these highly repetitive but biologically important genomic regions. CONCLUSIONS: Genome assembly using short-reads is challenged by repetitive sequences and extreme GC contents. Our results indicate that these difficulties can be largely overcome by using single-molecule, long-read sequencing technologies such as the Oxford Nanopore MinION. Using MinION reads for assembly followed by polishing with Illumina reads generated the most contiguous genomes with sufficient accuracy to enable the accurate annotation of important but difficult to sequence genomic features such as insertion sequences and secondary metabolite biosynthetic gene clusters. The combination of Oxford Nanopore and Illumina sequencing can therefore cost-effectively advance studies of microbial evolution and genome-driven drug discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5381-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-09 /pmc/articles/PMC6325685/ /pubmed/30626323 http://dx.doi.org/10.1186/s12864-018-5381-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Goldstein, Sarah
Beka, Lidia
Graf, Joerg
Klassen, Jonathan L.
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
title Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
title_full Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
title_fullStr Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
title_full_unstemmed Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
title_short Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing
title_sort evaluation of strategies for the assembly of diverse bacterial genomes using minion long-read sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325685/
https://www.ncbi.nlm.nih.gov/pubmed/30626323
http://dx.doi.org/10.1186/s12864-018-5381-7
work_keys_str_mv AT goldsteinsarah evaluationofstrategiesfortheassemblyofdiversebacterialgenomesusingminionlongreadsequencing
AT bekalidia evaluationofstrategiesfortheassemblyofdiversebacterialgenomesusingminionlongreadsequencing
AT grafjoerg evaluationofstrategiesfortheassemblyofdiversebacterialgenomesusingminionlongreadsequencing
AT klassenjonathanl evaluationofstrategiesfortheassemblyofdiversebacterialgenomesusingminionlongreadsequencing