Cargando…

Reducing assembly complexity of microbial genomes with single-molecule sequencing

BACKGROUND: The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation,...

Descripción completa

Detalles Bibliográficos
Autores principales: Koren, Sergey, Harhay, Gregory P, Smith, Timothy PL, Bono, James L, Harhay, Dayna M, Mcvey, Scott D, Radune, Diana, Bergman, Nicholas H, Phillippy, Adam M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4053942/
https://www.ncbi.nlm.nih.gov/pubmed/24034426
http://dx.doi.org/10.1186/gb-2013-14-9-r101
_version_ 1782320473243123712
author Koren, Sergey
Harhay, Gregory P
Smith, Timothy PL
Bono, James L
Harhay, Dayna M
Mcvey, Scott D
Radune, Diana
Bergman, Nicholas H
Phillippy, Adam M
author_facet Koren, Sergey
Harhay, Gregory P
Smith, Timothy PL
Bono, James L
Harhay, Dayna M
Mcvey, Scott D
Radune, Diana
Bergman, Nicholas H
Phillippy, Adam M
author_sort Koren, Sergey
collection PubMed
description BACKGROUND: The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. RESULTS: To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. CONCLUSIONS: Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.
format Online
Article
Text
id pubmed-4053942
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40539422014-06-12 Reducing assembly complexity of microbial genomes with single-molecule sequencing Koren, Sergey Harhay, Gregory P Smith, Timothy PL Bono, James L Harhay, Dayna M Mcvey, Scott D Radune, Diana Bergman, Nicholas H Phillippy, Adam M Genome Biol Research BACKGROUND: The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. RESULTS: To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. CONCLUSIONS: Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization. BioMed Central 2013 2013-09-13 /pmc/articles/PMC4053942/ /pubmed/24034426 http://dx.doi.org/10.1186/gb-2013-14-9-r101 Text en Copyright © 2013 Koren et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Koren, Sergey
Harhay, Gregory P
Smith, Timothy PL
Bono, James L
Harhay, Dayna M
Mcvey, Scott D
Radune, Diana
Bergman, Nicholas H
Phillippy, Adam M
Reducing assembly complexity of microbial genomes with single-molecule sequencing
title Reducing assembly complexity of microbial genomes with single-molecule sequencing
title_full Reducing assembly complexity of microbial genomes with single-molecule sequencing
title_fullStr Reducing assembly complexity of microbial genomes with single-molecule sequencing
title_full_unstemmed Reducing assembly complexity of microbial genomes with single-molecule sequencing
title_short Reducing assembly complexity of microbial genomes with single-molecule sequencing
title_sort reducing assembly complexity of microbial genomes with single-molecule sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4053942/
https://www.ncbi.nlm.nih.gov/pubmed/24034426
http://dx.doi.org/10.1186/gb-2013-14-9-r101
work_keys_str_mv AT korensergey reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT harhaygregoryp reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT smithtimothypl reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT bonojamesl reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT harhaydaynam reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT mcveyscottd reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT radunediana reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT bergmannicholash reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing
AT phillippyadamm reducingassemblycomplexityofmicrobialgenomeswithsinglemoleculesequencing