Cargando…
A fast exact sequential algorithm for the partial digest problem
BACKGROUND: Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digesti...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259970/ https://www.ncbi.nlm.nih.gov/pubmed/28155644 http://dx.doi.org/10.1186/s12859-016-1365-2 |
_version_ | 1782499314445058048 |
---|---|
author | Abbas, Mostafa M. Bahig, Hazem M. |
author_facet | Abbas, Mostafa M. Bahig, Hazem M. |
author_sort | Abbas, Mostafa M. |
collection | PubMed |
description | BACKGROUND: Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digestion. Determining the exact locations of restriction sites following a partial digestion is challenging due to the computational time required even with the best known practical algorithm. RESULTS: In this paper, we introduce an efficient algorithm to find the exact solution for the partial digest problem. The algorithm is able to find all possible solutions for the input and works by traversing the solution tree with a breadth-first search in two stages and deleting all repeated subproblems. Two types of simulated data, random and Zhang, are used to measure the efficiency of the algorithm. We also apply the algorithm to real data for the Luciferase gene and the E. coli K12 genome. CONCLUSION: Our algorithm is a fast tool to find the exact solution for the partial digest problem. The percentage of improvement is more than 75% over the best known practical algorithm for the worst case. For large numbers of inputs, our algorithm is able to solve the problem in a suitable time, while the best known practical algorithm is unable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1365-2) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5259970 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52599702017-01-26 A fast exact sequential algorithm for the partial digest problem Abbas, Mostafa M. Bahig, Hazem M. BMC Bioinformatics Research BACKGROUND: Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digestion. Determining the exact locations of restriction sites following a partial digestion is challenging due to the computational time required even with the best known practical algorithm. RESULTS: In this paper, we introduce an efficient algorithm to find the exact solution for the partial digest problem. The algorithm is able to find all possible solutions for the input and works by traversing the solution tree with a breadth-first search in two stages and deleting all repeated subproblems. Two types of simulated data, random and Zhang, are used to measure the efficiency of the algorithm. We also apply the algorithm to real data for the Luciferase gene and the E. coli K12 genome. CONCLUSION: Our algorithm is a fast tool to find the exact solution for the partial digest problem. The percentage of improvement is more than 75% over the best known practical algorithm for the worst case. For large numbers of inputs, our algorithm is able to solve the problem in a suitable time, while the best known practical algorithm is unable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1365-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-12-22 /pmc/articles/PMC5259970/ /pubmed/28155644 http://dx.doi.org/10.1186/s12859-016-1365-2 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Abbas, Mostafa M. Bahig, Hazem M. A fast exact sequential algorithm for the partial digest problem |
title | A fast exact sequential algorithm for the partial digest problem |
title_full | A fast exact sequential algorithm for the partial digest problem |
title_fullStr | A fast exact sequential algorithm for the partial digest problem |
title_full_unstemmed | A fast exact sequential algorithm for the partial digest problem |
title_short | A fast exact sequential algorithm for the partial digest problem |
title_sort | fast exact sequential algorithm for the partial digest problem |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259970/ https://www.ncbi.nlm.nih.gov/pubmed/28155644 http://dx.doi.org/10.1186/s12859-016-1365-2 |
work_keys_str_mv | AT abbasmostafam afastexactsequentialalgorithmforthepartialdigestproblem AT bahighazemm afastexactsequentialalgorithmforthepartialdigestproblem AT abbasmostafam fastexactsequentialalgorithmforthepartialdigestproblem AT bahighazemm fastexactsequentialalgorithmforthepartialdigestproblem |