Cargando…

FAMCS: Finding All Maximal Common Substructures in Proteins

Finding the common substructures shared by two proteins is considered as one of the central issues in computational biology because of its usefulness in understanding the structure-function relationship and application in drug and vaccine design. In this paper, we propose a novel algorithm called FA...

Descripción completa

Detalles Bibliográficos
Autores principales: Yao, Zhen, Xiao, Juan, Tung, Anthony K.H., Sung, Wing Kin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5172543/
https://www.ncbi.nlm.nih.gov/pubmed/16393147
http://dx.doi.org/10.1016/S1672-0229(05)03015-9
_version_ 1782484145765613568
author Yao, Zhen
Xiao, Juan
Tung, Anthony K.H.
Sung, Wing Kin
author_facet Yao, Zhen
Xiao, Juan
Tung, Anthony K.H.
Sung, Wing Kin
author_sort Yao, Zhen
collection PubMed
description Finding the common substructures shared by two proteins is considered as one of the central issues in computational biology because of its usefulness in understanding the structure-function relationship and application in drug and vaccine design. In this paper, we propose a novel algorithm called FAMCS (Finding All Maximal Common Substructures) for the common substructure identification problem. Our method works initially at the protein secondary structural element (SSE) level and starts with the identification of all structurally similar SSE pairs. These SSE pairs are then merged into sets using a modified Apriori algorithm, which will test the similarity of various sets of SSE pairs incrementally until all the maximal sets of SSE pairs that deemed to be similar are found. The maximal common substructures of the two proteins will be formed from these maximal sets. A refinement algorithm is also proposed to fine tune the alignment from the SSE level to the residue level. Comparison of FAMCS with other methods on various proteins shows that FAMCS can address all four requirements and infer interesting biological discoveries.
format Online
Article
Text
id pubmed-5172543
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-51725432016-12-23 FAMCS: Finding All Maximal Common Substructures in Proteins Yao, Zhen Xiao, Juan Tung, Anthony K.H. Sung, Wing Kin Genomics Proteomics Bioinformatics Article Finding the common substructures shared by two proteins is considered as one of the central issues in computational biology because of its usefulness in understanding the structure-function relationship and application in drug and vaccine design. In this paper, we propose a novel algorithm called FAMCS (Finding All Maximal Common Substructures) for the common substructure identification problem. Our method works initially at the protein secondary structural element (SSE) level and starts with the identification of all structurally similar SSE pairs. These SSE pairs are then merged into sets using a modified Apriori algorithm, which will test the similarity of various sets of SSE pairs incrementally until all the maximal sets of SSE pairs that deemed to be similar are found. The maximal common substructures of the two proteins will be formed from these maximal sets. A refinement algorithm is also proposed to fine tune the alignment from the SSE level to the residue level. Comparison of FAMCS with other methods on various proteins shows that FAMCS can address all four requirements and infer interesting biological discoveries. Elsevier 2005 2016-11-28 /pmc/articles/PMC5172543/ /pubmed/16393147 http://dx.doi.org/10.1016/S1672-0229(05)03015-9 Text en . http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Yao, Zhen
Xiao, Juan
Tung, Anthony K.H.
Sung, Wing Kin
FAMCS: Finding All Maximal Common Substructures in Proteins
title FAMCS: Finding All Maximal Common Substructures in Proteins
title_full FAMCS: Finding All Maximal Common Substructures in Proteins
title_fullStr FAMCS: Finding All Maximal Common Substructures in Proteins
title_full_unstemmed FAMCS: Finding All Maximal Common Substructures in Proteins
title_short FAMCS: Finding All Maximal Common Substructures in Proteins
title_sort famcs: finding all maximal common substructures in proteins
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5172543/
https://www.ncbi.nlm.nih.gov/pubmed/16393147
http://dx.doi.org/10.1016/S1672-0229(05)03015-9
work_keys_str_mv AT yaozhen famcsfindingallmaximalcommonsubstructuresinproteins
AT xiaojuan famcsfindingallmaximalcommonsubstructuresinproteins
AT tunganthonykh famcsfindingallmaximalcommonsubstructuresinproteins
AT sungwingkin famcsfindingallmaximalcommonsubstructuresinproteins