Cargando…

An Algorithm for Gene Fragment Reconstruction

Gene sequencing technology has been playing an important role in many aspects, such as life science, disease medicine and health medicine, particularly in the extremely tough process of fighting against 2019-novel coronavirus. Drawing DNA restriction map is a particularly important technology in gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Ningyuan, Wang, Kaifa, Tong, Dali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7896547/
https://www.ncbi.nlm.nih.gov/pubmed/33609237
http://dx.doi.org/10.1007/s12539-021-00419-6
_version_ 1783653566097915904
author Fang, Ningyuan
Wang, Kaifa
Tong, Dali
author_facet Fang, Ningyuan
Wang, Kaifa
Tong, Dali
author_sort Fang, Ningyuan
collection PubMed
description Gene sequencing technology has been playing an important role in many aspects, such as life science, disease medicine and health medicine, particularly in the extremely tough process of fighting against 2019-novel coronavirus. Drawing DNA restriction map is a particularly important technology in genetic biology. The simplified partial digestion method (SPDP), a biological method, has been widely used to cut DNA molecules into DNA fragments and obtain the biological information of each fragment. In this work, we propose an algorithm based on 0–1 planning for the location of restriction sites on a DNA molecule, which is able to solve the problem of DNA fragment reconstruction just based on data of fragments’ length. Two specific examples are presented in detail. Furthermore, based on 1000 groups of original DNA sequences randomly generated, we define the coincidence rate and unique coincidence rate between the reconstructed DNA sequence and the original DNA sequence, and then analyze separately the effect of the number of fragments and the maximum length of DNA fragments on the coincidence rate and unique coincidence rate as defined. The effectiveness of the algorithm is proved. Besides, based on the existing optimization solution obtained, we simulate and discuss the influence of the error by computation method. It turns out that the error of position of one restriction site does not affect other restriction sites and errors of most restriction sites may lead to the failure of sequence reconstruction. Matlab 7.1 program is used to solve feasible solutions of the location of restriction sites, derive DNA fragment sequence and carry out the statistical analysis and error analysis. This paper focuses on basic computer algorithm implementation of rearrangement and sequencing rather than biochemical technology. The innovative application of the mathematical idea of 0–1 planning to DNA sequence mapping construction, to a certain extent, greatly simplifies the difficulty and complexity of calculation and accelerates the process of 'jigsaw' of DNA fragments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12539-021-00419-6.
format Online
Article
Text
id pubmed-7896547
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-78965472021-02-22 An Algorithm for Gene Fragment Reconstruction Fang, Ningyuan Wang, Kaifa Tong, Dali Interdiscip Sci Original Research Article Gene sequencing technology has been playing an important role in many aspects, such as life science, disease medicine and health medicine, particularly in the extremely tough process of fighting against 2019-novel coronavirus. Drawing DNA restriction map is a particularly important technology in genetic biology. The simplified partial digestion method (SPDP), a biological method, has been widely used to cut DNA molecules into DNA fragments and obtain the biological information of each fragment. In this work, we propose an algorithm based on 0–1 planning for the location of restriction sites on a DNA molecule, which is able to solve the problem of DNA fragment reconstruction just based on data of fragments’ length. Two specific examples are presented in detail. Furthermore, based on 1000 groups of original DNA sequences randomly generated, we define the coincidence rate and unique coincidence rate between the reconstructed DNA sequence and the original DNA sequence, and then analyze separately the effect of the number of fragments and the maximum length of DNA fragments on the coincidence rate and unique coincidence rate as defined. The effectiveness of the algorithm is proved. Besides, based on the existing optimization solution obtained, we simulate and discuss the influence of the error by computation method. It turns out that the error of position of one restriction site does not affect other restriction sites and errors of most restriction sites may lead to the failure of sequence reconstruction. Matlab 7.1 program is used to solve feasible solutions of the location of restriction sites, derive DNA fragment sequence and carry out the statistical analysis and error analysis. This paper focuses on basic computer algorithm implementation of rearrangement and sequencing rather than biochemical technology. The innovative application of the mathematical idea of 0–1 planning to DNA sequence mapping construction, to a certain extent, greatly simplifies the difficulty and complexity of calculation and accelerates the process of 'jigsaw' of DNA fragments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12539-021-00419-6. Springer Berlin Heidelberg 2021-02-20 2021 /pmc/articles/PMC7896547/ /pubmed/33609237 http://dx.doi.org/10.1007/s12539-021-00419-6 Text en © International Association of Scientists in the Interdisciplinary Areas 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research Article
Fang, Ningyuan
Wang, Kaifa
Tong, Dali
An Algorithm for Gene Fragment Reconstruction
title An Algorithm for Gene Fragment Reconstruction
title_full An Algorithm for Gene Fragment Reconstruction
title_fullStr An Algorithm for Gene Fragment Reconstruction
title_full_unstemmed An Algorithm for Gene Fragment Reconstruction
title_short An Algorithm for Gene Fragment Reconstruction
title_sort algorithm for gene fragment reconstruction
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7896547/
https://www.ncbi.nlm.nih.gov/pubmed/33609237
http://dx.doi.org/10.1007/s12539-021-00419-6
work_keys_str_mv AT fangningyuan analgorithmforgenefragmentreconstruction
AT wangkaifa analgorithmforgenefragmentreconstruction
AT tongdali analgorithmforgenefragmentreconstruction
AT fangningyuan algorithmforgenefragmentreconstruction
AT wangkaifa algorithmforgenefragmentreconstruction
AT tongdali algorithmforgenefragmentreconstruction