Cargando…

The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate

BACKGROUND: Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary...

Descripción completa

Detalles Bibliográficos
Autores principales: McFerrin, Lisa G, Stone, Eric A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3176261/
https://www.ncbi.nlm.nih.gov/pubmed/21846337
http://dx.doi.org/10.1186/1471-2164-12-415
_version_ 1782212205984350208
author McFerrin, Lisa G
Stone, Eric A
author_facet McFerrin, Lisa G
Stone, Eric A
author_sort McFerrin, Lisa G
collection PubMed
description BACKGROUND: Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary structure of a protein often result in loose segmentation of its primary structure into stretches of slowly- and rapidly-evolving amino acids. This clustering can be exploited, and existing methods have done so by relying on local sequence conservation as a signature of selection to help identify functionally important regions within proteins. We invert this paradigm by leveraging the regional nature of protein structure and function to both illuminate and make use of genome-wide patterns of local sequence conservation. RESULTS: Our hypothesis is that the regional nature of structural and functional constraints will assert a positive autocorrelation on the evolutionary rates of neighboring sites, which, in a pairwise comparison of orthologous proteins, will manifest itself as the clustering of non-synonymous changes across the amino acid sequence. We introduce a dispersion ratio statistic to test this and related hypotheses. Using genome-wide interspecific comparisons of orthologous protein pairs, we reveal a strong log-linear relationship between the degree of clustering and the intensity of constraint. We further demonstrate how this relationship varies with the evolutionary distance between the species being compared. We provide some evidence that proteins with a history of positive selection deviate from genome-wide trends. CONCLUSIONS: We find a significant association between the evolutionary rate of a protein and the degree to which non-synonymous changes cluster along its primary sequence. We show that clustering is a non-redundant predictor of evolutionary rate, and we speculate that conflicting signals of clustering and constraint may be indicative of a historical period of relaxed selection.
format Online
Article
Text
id pubmed-3176261
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31762612011-09-21 The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate McFerrin, Lisa G Stone, Eric A BMC Genomics Research Article BACKGROUND: Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary structure of a protein often result in loose segmentation of its primary structure into stretches of slowly- and rapidly-evolving amino acids. This clustering can be exploited, and existing methods have done so by relying on local sequence conservation as a signature of selection to help identify functionally important regions within proteins. We invert this paradigm by leveraging the regional nature of protein structure and function to both illuminate and make use of genome-wide patterns of local sequence conservation. RESULTS: Our hypothesis is that the regional nature of structural and functional constraints will assert a positive autocorrelation on the evolutionary rates of neighboring sites, which, in a pairwise comparison of orthologous proteins, will manifest itself as the clustering of non-synonymous changes across the amino acid sequence. We introduce a dispersion ratio statistic to test this and related hypotheses. Using genome-wide interspecific comparisons of orthologous protein pairs, we reveal a strong log-linear relationship between the degree of clustering and the intensity of constraint. We further demonstrate how this relationship varies with the evolutionary distance between the species being compared. We provide some evidence that proteins with a history of positive selection deviate from genome-wide trends. CONCLUSIONS: We find a significant association between the evolutionary rate of a protein and the degree to which non-synonymous changes cluster along its primary sequence. We show that clustering is a non-redundant predictor of evolutionary rate, and we speculate that conflicting signals of clustering and constraint may be indicative of a historical period of relaxed selection. BioMed Central 2011-08-16 /pmc/articles/PMC3176261/ /pubmed/21846337 http://dx.doi.org/10.1186/1471-2164-12-415 Text en Copyright ©2011 McFerrin and Stone; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
McFerrin, Lisa G
Stone, Eric A
The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
title The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
title_full The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
title_fullStr The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
title_full_unstemmed The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
title_short The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
title_sort non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3176261/
https://www.ncbi.nlm.nih.gov/pubmed/21846337
http://dx.doi.org/10.1186/1471-2164-12-415
work_keys_str_mv AT mcferrinlisag thenonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate
AT stoneerica thenonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate
AT mcferrinlisag nonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate
AT stoneerica nonrandomclusteringofnonsynonymoussubstitutionsanditsrelationshiptoevolutionaryrate