Cargando…

Protein structure similarity from principle component correlation analysis

BACKGROUND: Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Curr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Xiaobo, Chou, James, Wong, Stephen TC
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386710/
https://www.ncbi.nlm.nih.gov/pubmed/16436213
http://dx.doi.org/10.1186/1471-2105-7-40
_version_ 1782126882511126528
author Zhou, Xiaobo
Chou, James
Wong, Stephen TC
author_facet Zhou, Xiaobo
Chou, James
Wong, Stephen TC
author_sort Zhou, Xiaobo
collection PubMed
description BACKGROUND: Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. RESULTS: We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. CONCLUSION: The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison.
format Text
id pubmed-1386710
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-13867102006-04-21 Protein structure similarity from principle component correlation analysis Zhou, Xiaobo Chou, James Wong, Stephen TC BMC Bioinformatics Methodology Article BACKGROUND: Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. RESULTS: We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. CONCLUSION: The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison. BioMed Central 2006-01-25 /pmc/articles/PMC1386710/ /pubmed/16436213 http://dx.doi.org/10.1186/1471-2105-7-40 Text en Copyright © 2006 Zhou et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zhou, Xiaobo
Chou, James
Wong, Stephen TC
Protein structure similarity from principle component correlation analysis
title Protein structure similarity from principle component correlation analysis
title_full Protein structure similarity from principle component correlation analysis
title_fullStr Protein structure similarity from principle component correlation analysis
title_full_unstemmed Protein structure similarity from principle component correlation analysis
title_short Protein structure similarity from principle component correlation analysis
title_sort protein structure similarity from principle component correlation analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386710/
https://www.ncbi.nlm.nih.gov/pubmed/16436213
http://dx.doi.org/10.1186/1471-2105-7-40
work_keys_str_mv AT zhouxiaobo proteinstructuresimilarityfromprinciplecomponentcorrelationanalysis
AT choujames proteinstructuresimilarityfromprinciplecomponentcorrelationanalysis
AT wongstephentc proteinstructuresimilarityfromprinciplecomponentcorrelationanalysis