Cargando…
Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in se...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7056898/ https://www.ncbi.nlm.nih.gov/pubmed/32174970 http://dx.doi.org/10.3389/fgene.2020.00105 |
_version_ | 1783503555731128320 |
---|---|
author | Shi, Haihe Zhang, Xuchu |
author_facet | Shi, Haihe Zhang, Xuchu |
author_sort | Shi, Haihe |
collection | PubMed |
description | In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in sequence homology detection, protein secondary and tertiary structure prediction, phylogenetic tree analysis, and other fields. Existing research mainly focuses on the specific steps of the algorithm or on specific problems, and there is a lack of high-level abstract domain algorithm frameworks. As a result, multiple sequence alignment algorithms are complex, redundant, and difficult to understand, and it is not easy for users to select the appropriate algorithm, which may lead to computing errors. Here, through in-depth study and analysis of the heuristic multiple sequence alignment algorithm (HMSAA) domain, a domain-feature model and an interactive model of HMSAA components have been established according to the generative programming method. With the support of the PAR (partition and recur) platform, the HMSAA algorithm component library is formalized and a specific alignment algorithm is assembled, thus improving the reliability of algorithm assembly. This work provides a valuable theoretical reference for the applications of other biological sequence analysis algorithms. |
format | Online Article Text |
id | pubmed-7056898 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70568982020-03-13 Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms Shi, Haihe Zhang, Xuchu Front Genet Genetics In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in sequence homology detection, protein secondary and tertiary structure prediction, phylogenetic tree analysis, and other fields. Existing research mainly focuses on the specific steps of the algorithm or on specific problems, and there is a lack of high-level abstract domain algorithm frameworks. As a result, multiple sequence alignment algorithms are complex, redundant, and difficult to understand, and it is not easy for users to select the appropriate algorithm, which may lead to computing errors. Here, through in-depth study and analysis of the heuristic multiple sequence alignment algorithm (HMSAA) domain, a domain-feature model and an interactive model of HMSAA components have been established according to the generative programming method. With the support of the PAR (partition and recur) platform, the HMSAA algorithm component library is formalized and a specific alignment algorithm is assembled, thus improving the reliability of algorithm assembly. This work provides a valuable theoretical reference for the applications of other biological sequence analysis algorithms. Frontiers Media S.A. 2020-02-27 /pmc/articles/PMC7056898/ /pubmed/32174970 http://dx.doi.org/10.3389/fgene.2020.00105 Text en Copyright © 2020 Shi and Zhang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Shi, Haihe Zhang, Xuchu Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms |
title | Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms |
title_full | Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms |
title_fullStr | Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms |
title_full_unstemmed | Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms |
title_short | Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms |
title_sort | component-based design and assembly of heuristic multiple sequence alignment algorithms |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7056898/ https://www.ncbi.nlm.nih.gov/pubmed/32174970 http://dx.doi.org/10.3389/fgene.2020.00105 |
work_keys_str_mv | AT shihaihe componentbaseddesignandassemblyofheuristicmultiplesequencealignmentalgorithms AT zhangxuchu componentbaseddesignandassemblyofheuristicmultiplesequencealignmentalgorithms |