Cargando…
Protein structure similarity from principle component correlation analysis
BACKGROUND: Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Curr...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386710/ https://www.ncbi.nlm.nih.gov/pubmed/16436213 http://dx.doi.org/10.1186/1471-2105-7-40 |
_version_ | 1782126882511126528 |
---|---|
author | Zhou, Xiaobo Chou, James Wong, Stephen TC |
author_facet | Zhou, Xiaobo Chou, James Wong, Stephen TC |
author_sort | Zhou, Xiaobo |
collection | PubMed |
description | BACKGROUND: Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. RESULTS: We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. CONCLUSION: The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison. |
format | Text |
id | pubmed-1386710 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-13867102006-04-21 Protein structure similarity from principle component correlation analysis Zhou, Xiaobo Chou, James Wong, Stephen TC BMC Bioinformatics Methodology Article BACKGROUND: Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. RESULTS: We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. CONCLUSION: The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison. BioMed Central 2006-01-25 /pmc/articles/PMC1386710/ /pubmed/16436213 http://dx.doi.org/10.1186/1471-2105-7-40 Text en Copyright © 2006 Zhou et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Zhou, Xiaobo Chou, James Wong, Stephen TC Protein structure similarity from principle component correlation analysis |
title | Protein structure similarity from principle component correlation analysis |
title_full | Protein structure similarity from principle component correlation analysis |
title_fullStr | Protein structure similarity from principle component correlation analysis |
title_full_unstemmed | Protein structure similarity from principle component correlation analysis |
title_short | Protein structure similarity from principle component correlation analysis |
title_sort | protein structure similarity from principle component correlation analysis |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386710/ https://www.ncbi.nlm.nih.gov/pubmed/16436213 http://dx.doi.org/10.1186/1471-2105-7-40 |
work_keys_str_mv | AT zhouxiaobo proteinstructuresimilarityfromprinciplecomponentcorrelationanalysis AT choujames proteinstructuresimilarityfromprinciplecomponentcorrelationanalysis AT wongstephentc proteinstructuresimilarityfromprinciplecomponentcorrelationanalysis |