Cargando…

Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms

In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in se...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Haihe, Zhang, Xuchu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7056898/
https://www.ncbi.nlm.nih.gov/pubmed/32174970
http://dx.doi.org/10.3389/fgene.2020.00105
_version_ 1783503555731128320
author Shi, Haihe
Zhang, Xuchu
author_facet Shi, Haihe
Zhang, Xuchu
author_sort Shi, Haihe
collection PubMed
description In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in sequence homology detection, protein secondary and tertiary structure prediction, phylogenetic tree analysis, and other fields. Existing research mainly focuses on the specific steps of the algorithm or on specific problems, and there is a lack of high-level abstract domain algorithm frameworks. As a result, multiple sequence alignment algorithms are complex, redundant, and difficult to understand, and it is not easy for users to select the appropriate algorithm, which may lead to computing errors. Here, through in-depth study and analysis of the heuristic multiple sequence alignment algorithm (HMSAA) domain, a domain-feature model and an interactive model of HMSAA components have been established according to the generative programming method. With the support of the PAR (partition and recur) platform, the HMSAA algorithm component library is formalized and a specific alignment algorithm is assembled, thus improving the reliability of algorithm assembly. This work provides a valuable theoretical reference for the applications of other biological sequence analysis algorithms.
format Online
Article
Text
id pubmed-7056898
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70568982020-03-13 Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms Shi, Haihe Zhang, Xuchu Front Genet Genetics In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in sequence homology detection, protein secondary and tertiary structure prediction, phylogenetic tree analysis, and other fields. Existing research mainly focuses on the specific steps of the algorithm or on specific problems, and there is a lack of high-level abstract domain algorithm frameworks. As a result, multiple sequence alignment algorithms are complex, redundant, and difficult to understand, and it is not easy for users to select the appropriate algorithm, which may lead to computing errors. Here, through in-depth study and analysis of the heuristic multiple sequence alignment algorithm (HMSAA) domain, a domain-feature model and an interactive model of HMSAA components have been established according to the generative programming method. With the support of the PAR (partition and recur) platform, the HMSAA algorithm component library is formalized and a specific alignment algorithm is assembled, thus improving the reliability of algorithm assembly. This work provides a valuable theoretical reference for the applications of other biological sequence analysis algorithms. Frontiers Media S.A. 2020-02-27 /pmc/articles/PMC7056898/ /pubmed/32174970 http://dx.doi.org/10.3389/fgene.2020.00105 Text en Copyright © 2020 Shi and Zhang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Shi, Haihe
Zhang, Xuchu
Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_full Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_fullStr Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_full_unstemmed Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_short Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_sort component-based design and assembly of heuristic multiple sequence alignment algorithms
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7056898/
https://www.ncbi.nlm.nih.gov/pubmed/32174970
http://dx.doi.org/10.3389/fgene.2020.00105
work_keys_str_mv AT shihaihe componentbaseddesignandassemblyofheuristicmultiplesequencealignmentalgorithms
AT zhangxuchu componentbaseddesignandassemblyofheuristicmultiplesequencealignmentalgorithms