Cargando…

A fast exact sequential algorithm for the partial digest problem

BACKGROUND: Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digesti...

Descripción completa

Detalles Bibliográficos
Autores principales: Abbas, Mostafa M., Bahig, Hazem M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259970/
https://www.ncbi.nlm.nih.gov/pubmed/28155644
http://dx.doi.org/10.1186/s12859-016-1365-2
_version_ 1782499314445058048
author Abbas, Mostafa M.
Bahig, Hazem M.
author_facet Abbas, Mostafa M.
Bahig, Hazem M.
author_sort Abbas, Mostafa M.
collection PubMed
description BACKGROUND: Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digestion. Determining the exact locations of restriction sites following a partial digestion is challenging due to the computational time required even with the best known practical algorithm. RESULTS: In this paper, we introduce an efficient algorithm to find the exact solution for the partial digest problem. The algorithm is able to find all possible solutions for the input and works by traversing the solution tree with a breadth-first search in two stages and deleting all repeated subproblems. Two types of simulated data, random and Zhang, are used to measure the efficiency of the algorithm. We also apply the algorithm to real data for the Luciferase gene and the E. coli K12 genome. CONCLUSION: Our algorithm is a fast tool to find the exact solution for the partial digest problem. The percentage of improvement is more than 75% over the best known practical algorithm for the worst case. For large numbers of inputs, our algorithm is able to solve the problem in a suitable time, while the best known practical algorithm is unable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1365-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5259970
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52599702017-01-26 A fast exact sequential algorithm for the partial digest problem Abbas, Mostafa M. Bahig, Hazem M. BMC Bioinformatics Research BACKGROUND: Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digestion. Determining the exact locations of restriction sites following a partial digestion is challenging due to the computational time required even with the best known practical algorithm. RESULTS: In this paper, we introduce an efficient algorithm to find the exact solution for the partial digest problem. The algorithm is able to find all possible solutions for the input and works by traversing the solution tree with a breadth-first search in two stages and deleting all repeated subproblems. Two types of simulated data, random and Zhang, are used to measure the efficiency of the algorithm. We also apply the algorithm to real data for the Luciferase gene and the E. coli K12 genome. CONCLUSION: Our algorithm is a fast tool to find the exact solution for the partial digest problem. The percentage of improvement is more than 75% over the best known practical algorithm for the worst case. For large numbers of inputs, our algorithm is able to solve the problem in a suitable time, while the best known practical algorithm is unable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1365-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-12-22 /pmc/articles/PMC5259970/ /pubmed/28155644 http://dx.doi.org/10.1186/s12859-016-1365-2 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Abbas, Mostafa M.
Bahig, Hazem M.
A fast exact sequential algorithm for the partial digest problem
title A fast exact sequential algorithm for the partial digest problem
title_full A fast exact sequential algorithm for the partial digest problem
title_fullStr A fast exact sequential algorithm for the partial digest problem
title_full_unstemmed A fast exact sequential algorithm for the partial digest problem
title_short A fast exact sequential algorithm for the partial digest problem
title_sort fast exact sequential algorithm for the partial digest problem
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259970/
https://www.ncbi.nlm.nih.gov/pubmed/28155644
http://dx.doi.org/10.1186/s12859-016-1365-2
work_keys_str_mv AT abbasmostafam afastexactsequentialalgorithmforthepartialdigestproblem
AT bahighazemm afastexactsequentialalgorithmforthepartialdigestproblem
AT abbasmostafam fastexactsequentialalgorithmforthepartialdigestproblem
AT bahighazemm fastexactsequentialalgorithmforthepartialdigestproblem