Cargando…
The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
BACKGROUND: Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3176261/ https://www.ncbi.nlm.nih.gov/pubmed/21846337 http://dx.doi.org/10.1186/1471-2164-12-415 |
_version_ | 1782212205984350208 |
---|---|
author | McFerrin, Lisa G Stone, Eric A |
author_facet | McFerrin, Lisa G Stone, Eric A |
author_sort | McFerrin, Lisa G |
collection | PubMed |
description | BACKGROUND: Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary structure of a protein often result in loose segmentation of its primary structure into stretches of slowly- and rapidly-evolving amino acids. This clustering can be exploited, and existing methods have done so by relying on local sequence conservation as a signature of selection to help identify functionally important regions within proteins. We invert this paradigm by leveraging the regional nature of protein structure and function to both illuminate and make use of genome-wide patterns of local sequence conservation. RESULTS: Our hypothesis is that the regional nature of structural and functional constraints will assert a positive autocorrelation on the evolutionary rates of neighboring sites, which, in a pairwise comparison of orthologous proteins, will manifest itself as the clustering of non-synonymous changes across the amino acid sequence. We introduce a dispersion ratio statistic to test this and related hypotheses. Using genome-wide interspecific comparisons of orthologous protein pairs, we reveal a strong log-linear relationship between the degree of clustering and the intensity of constraint. We further demonstrate how this relationship varies with the evolutionary distance between the species being compared. We provide some evidence that proteins with a history of positive selection deviate from genome-wide trends. CONCLUSIONS: We find a significant association between the evolutionary rate of a protein and the degree to which non-synonymous changes cluster along its primary sequence. We show that clustering is a non-redundant predictor of evolutionary rate, and we speculate that conflicting signals of clustering and constraint may be indicative of a historical period of relaxed selection. |
format | Online Article Text |
id | pubmed-3176261 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31762612011-09-21 The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate McFerrin, Lisa G Stone, Eric A BMC Genomics Research Article BACKGROUND: Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary structure of a protein often result in loose segmentation of its primary structure into stretches of slowly- and rapidly-evolving amino acids. This clustering can be exploited, and existing methods have done so by relying on local sequence conservation as a signature of selection to help identify functionally important regions within proteins. We invert this paradigm by leveraging the regional nature of protein structure and function to both illuminate and make use of genome-wide patterns of local sequence conservation. RESULTS: Our hypothesis is that the regional nature of structural and functional constraints will assert a positive autocorrelation on the evolutionary rates of neighboring sites, which, in a pairwise comparison of orthologous proteins, will manifest itself as the clustering of non-synonymous changes across the amino acid sequence. We introduce a dispersion ratio statistic to test this and related hypotheses. Using genome-wide interspecific comparisons of orthologous protein pairs, we reveal a strong log-linear relationship between the degree of clustering and the intensity of constraint. We further demonstrate how this relationship varies with the evolutionary distance between the species being compared. We provide some evidence that proteins with a history of positive selection deviate from genome-wide trends. CONCLUSIONS: We find a significant association between the evolutionary rate of a protein and the degree to which non-synonymous changes cluster along its primary sequence. We show that clustering is a non-redundant predictor of evolutionary rate, and we speculate that conflicting signals of clustering and constraint may be indicative of a historical period of relaxed selection. BioMed Central 2011-08-16 /pmc/articles/PMC3176261/ /pubmed/21846337 http://dx.doi.org/10.1186/1471-2164-12-415 Text en Copyright ©2011 McFerrin and Stone; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article McFerrin, Lisa G Stone, Eric A The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
title | The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
title_full | The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
title_fullStr | The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
title_full_unstemmed | The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
title_short | The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
title_sort | non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3176261/ https://www.ncbi.nlm.nih.gov/pubmed/21846337 http://dx.doi.org/10.1186/1471-2164-12-415 |
work_keys_str_mv | AT mcferrinlisag thenonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate AT stoneerica thenonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate AT mcferrinlisag nonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate AT stoneerica nonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate |