Cargando…

Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data

BACKGROUND: The observation of variation in substitution rates among lineages has led to (1) a general rejection of the molecular clock model, and (2) the suggestion that a number of biological characteristics of organisms can cause rate variation. Accurate estimates of rate variation, and thus accu...

Descripción completa

Detalles Bibliográficos
Autores principales: Schwartz, Rachel S., Mueller, Rachel Lockridge
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2836374/
https://www.ncbi.nlm.nih.gov/pubmed/20300176
http://dx.doi.org/10.1371/journal.pone.0009649
_version_ 1782178713434062848
author Schwartz, Rachel S.
Mueller, Rachel Lockridge
author_facet Schwartz, Rachel S.
Mueller, Rachel Lockridge
author_sort Schwartz, Rachel S.
collection PubMed
description BACKGROUND: The observation of variation in substitution rates among lineages has led to (1) a general rejection of the molecular clock model, and (2) the suggestion that a number of biological characteristics of organisms can cause rate variation. Accurate estimates of rate variation, and thus accurate inferences regarding the causes of rate variation, depend on accurate estimates of substitution rates. However, theory suggests that even when the substitution process is clock-like, variable numbers of substitutions can occur among lineages because the substitution process is stochastic. Furthermore, substitution rates along lineages can be misestimated, particularly when multiple substitutions occur at some sites. Although these potential causes of error in rate estimation are well understood in theory, such error has not been examined in detail; consequently, empirical studies that estimate rate variation among lineages have been unable to determine whether their results could be impacted by estimation error. METHODOLOGY/PRINCIPAL FINDINGS: To evaluate the extent to which error in rate estimation could erroneously suggest rate variation among lineages, we examined rate variation estimated for datasets simulated under a molecular clock on trees with equal and variable branch lengths. Thus, any apparent rate variation in these datasets reflects error in rate estimation rather than true differences in the underlying substitution process. We observed substantial rate variation among lineages in our simulations; however, we did not observe rate variation when average substitution rates were compared between different clades. CONCLUSIONS/SIGNIFICANCE: Our results confirm previous theoretical work suggesting that observations of among lineage rate variation in empirical data may be due to the stochastic substitution process and error in the estimation of substitution rates, rather than true differences in the underlying substitution process among lineages. However, conclusions regarding rate variation drawn from rates averaged across multiple branches are likely due to real, systematic variation in rates between groups.
format Text
id pubmed-2836374
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28363742010-03-19 Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data Schwartz, Rachel S. Mueller, Rachel Lockridge PLoS One Research Article BACKGROUND: The observation of variation in substitution rates among lineages has led to (1) a general rejection of the molecular clock model, and (2) the suggestion that a number of biological characteristics of organisms can cause rate variation. Accurate estimates of rate variation, and thus accurate inferences regarding the causes of rate variation, depend on accurate estimates of substitution rates. However, theory suggests that even when the substitution process is clock-like, variable numbers of substitutions can occur among lineages because the substitution process is stochastic. Furthermore, substitution rates along lineages can be misestimated, particularly when multiple substitutions occur at some sites. Although these potential causes of error in rate estimation are well understood in theory, such error has not been examined in detail; consequently, empirical studies that estimate rate variation among lineages have been unable to determine whether their results could be impacted by estimation error. METHODOLOGY/PRINCIPAL FINDINGS: To evaluate the extent to which error in rate estimation could erroneously suggest rate variation among lineages, we examined rate variation estimated for datasets simulated under a molecular clock on trees with equal and variable branch lengths. Thus, any apparent rate variation in these datasets reflects error in rate estimation rather than true differences in the underlying substitution process. We observed substantial rate variation among lineages in our simulations; however, we did not observe rate variation when average substitution rates were compared between different clades. CONCLUSIONS/SIGNIFICANCE: Our results confirm previous theoretical work suggesting that observations of among lineage rate variation in empirical data may be due to the stochastic substitution process and error in the estimation of substitution rates, rather than true differences in the underlying substitution process among lineages. However, conclusions regarding rate variation drawn from rates averaged across multiple branches are likely due to real, systematic variation in rates between groups. Public Library of Science 2010-03-11 /pmc/articles/PMC2836374/ /pubmed/20300176 http://dx.doi.org/10.1371/journal.pone.0009649 Text en Schwartz, Mueller. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Schwartz, Rachel S.
Mueller, Rachel Lockridge
Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data
title Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data
title_full Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data
title_fullStr Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data
title_full_unstemmed Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data
title_short Variation in DNA Substitution Rates among Lineages Erroneously Inferred from Simulated Clock-Like Data
title_sort variation in dna substitution rates among lineages erroneously inferred from simulated clock-like data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2836374/
https://www.ncbi.nlm.nih.gov/pubmed/20300176
http://dx.doi.org/10.1371/journal.pone.0009649
work_keys_str_mv AT schwartzrachels variationindnasubstitutionratesamonglineageserroneouslyinferredfromsimulatedclocklikedata
AT muellerrachellockridge variationindnasubstitutionratesamonglineageserroneouslyinferredfromsimulatedclocklikedata