Cargando…

Efficient and scalable scaffolding using optical restriction maps

In the next generation sequencing techniques millions of short reads are produced from a genomic sequence at a single run. The chances of low read coverage to some regions of the sequence are very high. The reads are short and very large in number. Due to erroneous base calling, there could be error...

Descripción completa

Detalles Bibliográficos
Autores principales: Saha, Subrata, Rajasekaran, Sanguthevar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120203/
https://www.ncbi.nlm.nih.gov/pubmed/25081913
http://dx.doi.org/10.1186/1471-2164-15-S5-S5
_version_ 1782329051356069888
author Saha, Subrata
Rajasekaran, Sanguthevar
author_facet Saha, Subrata
Rajasekaran, Sanguthevar
author_sort Saha, Subrata
collection PubMed
description In the next generation sequencing techniques millions of short reads are produced from a genomic sequence at a single run. The chances of low read coverage to some regions of the sequence are very high. The reads are short and very large in number. Due to erroneous base calling, there could be errors in the reads. As a consequence, sequence assemblers often fail to sequence an entire DNA molecule and instead output a set of overlapping segments that together represent a consensus region of the DNA. This set of overlapping segments are collectively called contigs in the literature. The final step of the sequencing process, called scaffolding, is to assemble the contigs into a correct order. Scaffolding techniques typically exploit additional information such as mate-pairs, pair-ends, or optical restriction maps. In this paper we introduce a series of novel algorithms for scaffolding that exploit optical restriction maps (ORMs). Simulation results show that our algorithms are indeed reliable, scalable, and efficient compared to the best known algorithms in the literature.
format Online
Article
Text
id pubmed-4120203
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41202032014-08-11 Efficient and scalable scaffolding using optical restriction maps Saha, Subrata Rajasekaran, Sanguthevar BMC Genomics Research In the next generation sequencing techniques millions of short reads are produced from a genomic sequence at a single run. The chances of low read coverage to some regions of the sequence are very high. The reads are short and very large in number. Due to erroneous base calling, there could be errors in the reads. As a consequence, sequence assemblers often fail to sequence an entire DNA molecule and instead output a set of overlapping segments that together represent a consensus region of the DNA. This set of overlapping segments are collectively called contigs in the literature. The final step of the sequencing process, called scaffolding, is to assemble the contigs into a correct order. Scaffolding techniques typically exploit additional information such as mate-pairs, pair-ends, or optical restriction maps. In this paper we introduce a series of novel algorithms for scaffolding that exploit optical restriction maps (ORMs). Simulation results show that our algorithms are indeed reliable, scalable, and efficient compared to the best known algorithms in the literature. BioMed Central 2014-07-14 /pmc/articles/PMC4120203/ /pubmed/25081913 http://dx.doi.org/10.1186/1471-2164-15-S5-S5 Text en Copyright © 2014 Saha and Rajasekaran; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Saha, Subrata
Rajasekaran, Sanguthevar
Efficient and scalable scaffolding using optical restriction maps
title Efficient and scalable scaffolding using optical restriction maps
title_full Efficient and scalable scaffolding using optical restriction maps
title_fullStr Efficient and scalable scaffolding using optical restriction maps
title_full_unstemmed Efficient and scalable scaffolding using optical restriction maps
title_short Efficient and scalable scaffolding using optical restriction maps
title_sort efficient and scalable scaffolding using optical restriction maps
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120203/
https://www.ncbi.nlm.nih.gov/pubmed/25081913
http://dx.doi.org/10.1186/1471-2164-15-S5-S5
work_keys_str_mv AT sahasubrata efficientandscalablescaffoldingusingopticalrestrictionmaps
AT rajasekaransanguthevar efficientandscalablescaffoldingusingopticalrestrictionmaps