Cargando…

Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy

Phylostratigraphy, originally designed for gene age estimation by BLAST-based protein homology searches of sequenced genomes, has been widely used for studying patterns and inferring mechanisms of gene origination and evolution. We previously showed by computer simulation that phylostratigraphy unde...

Descripción completa

Detalles Bibliográficos
Autores principales: Moyers, Bryan A., Zhang, Jianzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5501971/
https://www.ncbi.nlm.nih.gov/pubmed/28637261
http://dx.doi.org/10.1093/gbe/evx109
_version_ 1783248885637971968
author Moyers, Bryan A.
Zhang, Jianzhi
author_facet Moyers, Bryan A.
Zhang, Jianzhi
author_sort Moyers, Bryan A.
collection PubMed
description Phylostratigraphy, originally designed for gene age estimation by BLAST-based protein homology searches of sequenced genomes, has been widely used for studying patterns and inferring mechanisms of gene origination and evolution. We previously showed by computer simulation that phylostratigraphy underestimates gene age for a nonnegligible fraction of genes and that the underestimation is severer for genes with certain properties such as fast evolution and short protein sequences. Consequently, many previously reported age distributions of gene properties may have been methodological artifacts rather than biological realities. Domazet-Lošo and colleagues recently argued that our simulations were flawed and that phylostratigraphic bias does not impact inferences about gene emergence and evolution. Here we discuss conceptual difficulties of phylostratigraphy, identify numerous problems in Domazet-Lošo et al.’s argument, reconfirm phylostratigraphic error using simulations suggested by Domazet-Lošo and colleagues, and demonstrate that a phylostratigraphic trend claimed to be robust to error disappears when genes likely to be error-resistant are analyzed. We conclude that extreme caution is needed in interpreting phylostratigraphic results because of the inherent biases of the method and that reanalysis using genes exhibiting no error in realistic simulations may help reduce spurious findings.
format Online
Article
Text
id pubmed-5501971
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55019712017-07-12 Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy Moyers, Bryan A. Zhang, Jianzhi Genome Biol Evol Research Article Phylostratigraphy, originally designed for gene age estimation by BLAST-based protein homology searches of sequenced genomes, has been widely used for studying patterns and inferring mechanisms of gene origination and evolution. We previously showed by computer simulation that phylostratigraphy underestimates gene age for a nonnegligible fraction of genes and that the underestimation is severer for genes with certain properties such as fast evolution and short protein sequences. Consequently, many previously reported age distributions of gene properties may have been methodological artifacts rather than biological realities. Domazet-Lošo and colleagues recently argued that our simulations were flawed and that phylostratigraphic bias does not impact inferences about gene emergence and evolution. Here we discuss conceptual difficulties of phylostratigraphy, identify numerous problems in Domazet-Lošo et al.’s argument, reconfirm phylostratigraphic error using simulations suggested by Domazet-Lošo and colleagues, and demonstrate that a phylostratigraphic trend claimed to be robust to error disappears when genes likely to be error-resistant are analyzed. We conclude that extreme caution is needed in interpreting phylostratigraphic results because of the inherent biases of the method and that reanalysis using genes exhibiting no error in realistic simulations may help reduce spurious findings. Oxford University Press 2017-06-14 /pmc/articles/PMC5501971/ /pubmed/28637261 http://dx.doi.org/10.1093/gbe/evx109 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research Article
Moyers, Bryan A.
Zhang, Jianzhi
Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
title Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
title_full Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
title_fullStr Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
title_full_unstemmed Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
title_short Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
title_sort further simulations and analyses demonstrate open problems of phylostratigraphy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5501971/
https://www.ncbi.nlm.nih.gov/pubmed/28637261
http://dx.doi.org/10.1093/gbe/evx109
work_keys_str_mv AT moyersbryana furthersimulationsandanalysesdemonstrateopenproblemsofphylostratigraphy
AT zhangjianzhi furthersimulationsandanalysesdemonstrateopenproblemsofphylostratigraphy