Cargando…

An algorithm for automated closure during assembly

BACKGROUND: Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targe...

Descripción completa

Detalles Bibliográficos
Autores principales: Koren, Sergey, Miller, Jason R, Walenz, Brian P, Sutton, Granger
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2945939/
https://www.ncbi.nlm.nih.gov/pubmed/20831800
http://dx.doi.org/10.1186/1471-2105-11-457
_version_ 1782187252230651904
author Koren, Sergey
Miller, Jason R
Walenz, Brian P
Sutton, Granger
author_facet Koren, Sergey
Miller, Jason R
Walenz, Brian P
Sutton, Granger
author_sort Koren, Sergey
collection PubMed
description BACKGROUND: Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targeted addition by local re-assembly of gap regions. An obvious alternative uses de novo assembly of all the reads. RESULTS: A procedure called the bounding read algorithm was developed for assembly of shotgun reads plus finishing reads and their constraints, targeting repeat regions. The algorithm was implemented within the Celera Assembler software and its pyrosequencing-specific variant, CABOG. The implementation was tested on Sanger and pyrosequencing data from six genomes. The bounding read assemblies were compared to assemblies from two other methods on the same data. The algorithm generates improved assemblies of repeat regions, closing and tiling some gaps while degrading none. CONCLUSIONS: The algorithm is useful for small-genome automated finishing projects. Our implementation is available as open-source from http://wgs-assembler.sourceforge.net under the GNU Public License.
format Text
id pubmed-2945939
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29459392010-10-21 An algorithm for automated closure during assembly Koren, Sergey Miller, Jason R Walenz, Brian P Sutton, Granger BMC Bioinformatics Research Article BACKGROUND: Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targeted addition by local re-assembly of gap regions. An obvious alternative uses de novo assembly of all the reads. RESULTS: A procedure called the bounding read algorithm was developed for assembly of shotgun reads plus finishing reads and their constraints, targeting repeat regions. The algorithm was implemented within the Celera Assembler software and its pyrosequencing-specific variant, CABOG. The implementation was tested on Sanger and pyrosequencing data from six genomes. The bounding read assemblies were compared to assemblies from two other methods on the same data. The algorithm generates improved assemblies of repeat regions, closing and tiling some gaps while degrading none. CONCLUSIONS: The algorithm is useful for small-genome automated finishing projects. Our implementation is available as open-source from http://wgs-assembler.sourceforge.net under the GNU Public License. BioMed Central 2010-09-10 /pmc/articles/PMC2945939/ /pubmed/20831800 http://dx.doi.org/10.1186/1471-2105-11-457 Text en Copyright ©2010 Koren et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Koren, Sergey
Miller, Jason R
Walenz, Brian P
Sutton, Granger
An algorithm for automated closure during assembly
title An algorithm for automated closure during assembly
title_full An algorithm for automated closure during assembly
title_fullStr An algorithm for automated closure during assembly
title_full_unstemmed An algorithm for automated closure during assembly
title_short An algorithm for automated closure during assembly
title_sort algorithm for automated closure during assembly
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2945939/
https://www.ncbi.nlm.nih.gov/pubmed/20831800
http://dx.doi.org/10.1186/1471-2105-11-457
work_keys_str_mv AT korensergey analgorithmforautomatedclosureduringassembly
AT millerjasonr analgorithmforautomatedclosureduringassembly
AT walenzbrianp analgorithmforautomatedclosureduringassembly
AT suttongranger analgorithmforautomatedclosureduringassembly
AT korensergey algorithmforautomatedclosureduringassembly
AT millerjasonr algorithmforautomatedclosureduringassembly
AT walenzbrianp algorithmforautomatedclosureduringassembly
AT suttongranger algorithmforautomatedclosureduringassembly