Cargando…

Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices

Position-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singu...

Descripción completa

Detalles Bibliográficos
Autores principales: Kinjo, Akira R., Nakamura, Haruki
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2276316/
https://www.ncbi.nlm.nih.gov/pubmed/18398479
http://dx.doi.org/10.1371/journal.pone.0001963
_version_ 1782151989036056576
author Kinjo, Akira R.
Nakamura, Haruki
author_facet Kinjo, Akira R.
Nakamura, Haruki
author_sort Kinjo, Akira R.
collection PubMed
description Position-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singular value decomposition to a set of PSSMs and examine the properties of dominant right and left singular vectors. The first right singular vectors were correlated with various amino acid indices including relative mutability, amino acid composition in protein interior, hydropathy, or turn propensity, depending on proteins. A significant correlation between the first left singular vector and a measure of site conservation was observed. It is shown that the contribution of the first singular component to the PSSMs act to disfavor potentially but falsely functionally important residues at conserved sites. The second right singular vectors were highly correlated with hydrophobicity scales, and the corresponding left singular vectors with contact numbers of protein structures. It is suggested that sequence alignment with a PSSM is essentially equivalent to threading supplemented with functional information. In addition, singular vectors may be useful for analyzing and annotating the characteristics of conserved sites in protein families.
format Text
id pubmed-2276316
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-22763162008-04-09 Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices Kinjo, Akira R. Nakamura, Haruki PLoS One Research Article Position-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singular value decomposition to a set of PSSMs and examine the properties of dominant right and left singular vectors. The first right singular vectors were correlated with various amino acid indices including relative mutability, amino acid composition in protein interior, hydropathy, or turn propensity, depending on proteins. A significant correlation between the first left singular vector and a measure of site conservation was observed. It is shown that the contribution of the first singular component to the PSSMs act to disfavor potentially but falsely functionally important residues at conserved sites. The second right singular vectors were highly correlated with hydrophobicity scales, and the corresponding left singular vectors with contact numbers of protein structures. It is suggested that sequence alignment with a PSSM is essentially equivalent to threading supplemented with functional information. In addition, singular vectors may be useful for analyzing and annotating the characteristics of conserved sites in protein families. Public Library of Science 2008-04-09 /pmc/articles/PMC2276316/ /pubmed/18398479 http://dx.doi.org/10.1371/journal.pone.0001963 Text en Kinjo, Nakamura. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kinjo, Akira R.
Nakamura, Haruki
Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices
title Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices
title_full Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices
title_fullStr Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices
title_full_unstemmed Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices
title_short Nature of Protein Family Signatures: Insights from Singular Value Analysis of Position-Specific Scoring Matrices
title_sort nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2276316/
https://www.ncbi.nlm.nih.gov/pubmed/18398479
http://dx.doi.org/10.1371/journal.pone.0001963
work_keys_str_mv AT kinjoakirar natureofproteinfamilysignaturesinsightsfromsingularvalueanalysisofpositionspecificscoringmatrices
AT nakamuraharuki natureofproteinfamilysignaturesinsightsfromsingularvalueanalysisofpositionspecificscoringmatrices