Cargando…
Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy
Phylostratigraphy, originally designed for gene age estimation by BLAST-based protein homology searches of sequenced genomes, has been widely used for studying patterns and inferring mechanisms of gene origination and evolution. We previously showed by computer simulation that phylostratigraphy unde...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5501971/ https://www.ncbi.nlm.nih.gov/pubmed/28637261 http://dx.doi.org/10.1093/gbe/evx109 |
_version_ | 1783248885637971968 |
---|---|
author | Moyers, Bryan A. Zhang, Jianzhi |
author_facet | Moyers, Bryan A. Zhang, Jianzhi |
author_sort | Moyers, Bryan A. |
collection | PubMed |
description | Phylostratigraphy, originally designed for gene age estimation by BLAST-based protein homology searches of sequenced genomes, has been widely used for studying patterns and inferring mechanisms of gene origination and evolution. We previously showed by computer simulation that phylostratigraphy underestimates gene age for a nonnegligible fraction of genes and that the underestimation is severer for genes with certain properties such as fast evolution and short protein sequences. Consequently, many previously reported age distributions of gene properties may have been methodological artifacts rather than biological realities. Domazet-Lošo and colleagues recently argued that our simulations were flawed and that phylostratigraphic bias does not impact inferences about gene emergence and evolution. Here we discuss conceptual difficulties of phylostratigraphy, identify numerous problems in Domazet-Lošo et al.’s argument, reconfirm phylostratigraphic error using simulations suggested by Domazet-Lošo and colleagues, and demonstrate that a phylostratigraphic trend claimed to be robust to error disappears when genes likely to be error-resistant are analyzed. We conclude that extreme caution is needed in interpreting phylostratigraphic results because of the inherent biases of the method and that reanalysis using genes exhibiting no error in realistic simulations may help reduce spurious findings. |
format | Online Article Text |
id | pubmed-5501971 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-55019712017-07-12 Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy Moyers, Bryan A. Zhang, Jianzhi Genome Biol Evol Research Article Phylostratigraphy, originally designed for gene age estimation by BLAST-based protein homology searches of sequenced genomes, has been widely used for studying patterns and inferring mechanisms of gene origination and evolution. We previously showed by computer simulation that phylostratigraphy underestimates gene age for a nonnegligible fraction of genes and that the underestimation is severer for genes with certain properties such as fast evolution and short protein sequences. Consequently, many previously reported age distributions of gene properties may have been methodological artifacts rather than biological realities. Domazet-Lošo and colleagues recently argued that our simulations were flawed and that phylostratigraphic bias does not impact inferences about gene emergence and evolution. Here we discuss conceptual difficulties of phylostratigraphy, identify numerous problems in Domazet-Lošo et al.’s argument, reconfirm phylostratigraphic error using simulations suggested by Domazet-Lošo and colleagues, and demonstrate that a phylostratigraphic trend claimed to be robust to error disappears when genes likely to be error-resistant are analyzed. We conclude that extreme caution is needed in interpreting phylostratigraphic results because of the inherent biases of the method and that reanalysis using genes exhibiting no error in realistic simulations may help reduce spurious findings. Oxford University Press 2017-06-14 /pmc/articles/PMC5501971/ /pubmed/28637261 http://dx.doi.org/10.1093/gbe/evx109 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Research Article Moyers, Bryan A. Zhang, Jianzhi Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy |
title | Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy |
title_full | Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy |
title_fullStr | Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy |
title_full_unstemmed | Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy |
title_short | Further Simulations and Analyses Demonstrate Open Problems of Phylostratigraphy |
title_sort | further simulations and analyses demonstrate open problems of phylostratigraphy |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5501971/ https://www.ncbi.nlm.nih.gov/pubmed/28637261 http://dx.doi.org/10.1093/gbe/evx109 |
work_keys_str_mv | AT moyersbryana furthersimulationsandanalysesdemonstrateopenproblemsofphylostratigraphy AT zhangjianzhi furthersimulationsandanalysesdemonstrateopenproblemsofphylostratigraphy |