Cargando…
Inferring the Deep Past from Molecular Data
There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss so...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8175050/ https://www.ncbi.nlm.nih.gov/pubmed/33772552 http://dx.doi.org/10.1093/gbe/evab067 |
_version_ | 1783702979005644800 |
---|---|
author | Williams, Tom A Schrempf, Dominik Szöllősi, Gergely J Cox, Cymon J Foster, Peter G Embley, T Martin |
author_facet | Williams, Tom A Schrempf, Dominik Szöllősi, Gergely J Cox, Cymon J Foster, Peter G Embley, T Martin |
author_sort | Williams, Tom A |
collection | PubMed |
description | There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods. |
format | Online Article Text |
id | pubmed-8175050 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-81750502021-06-04 Inferring the Deep Past from Molecular Data Williams, Tom A Schrempf, Dominik Szöllősi, Gergely J Cox, Cymon J Foster, Peter G Embley, T Martin Genome Biol Evol Review There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods. Oxford University Press 2021-03-27 /pmc/articles/PMC8175050/ /pubmed/33772552 http://dx.doi.org/10.1093/gbe/evab067 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Williams, Tom A Schrempf, Dominik Szöllősi, Gergely J Cox, Cymon J Foster, Peter G Embley, T Martin Inferring the Deep Past from Molecular Data |
title | Inferring the Deep Past from Molecular Data |
title_full | Inferring the Deep Past from Molecular Data |
title_fullStr | Inferring the Deep Past from Molecular Data |
title_full_unstemmed | Inferring the Deep Past from Molecular Data |
title_short | Inferring the Deep Past from Molecular Data |
title_sort | inferring the deep past from molecular data |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8175050/ https://www.ncbi.nlm.nih.gov/pubmed/33772552 http://dx.doi.org/10.1093/gbe/evab067 |
work_keys_str_mv | AT williamstoma inferringthedeeppastfrommoleculardata AT schrempfdominik inferringthedeeppastfrommoleculardata AT szollosigergelyj inferringthedeeppastfrommoleculardata AT coxcymonj inferringthedeeppastfrommoleculardata AT fosterpeterg inferringthedeeppastfrommoleculardata AT embleytmartin inferringthedeeppastfrommoleculardata |