Cargando…

Inferring the Deep Past from Molecular Data

There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss so...

Descripción completa

Detalles Bibliográficos
Autores principales: Williams, Tom A, Schrempf, Dominik, Szöllősi, Gergely J, Cox, Cymon J, Foster, Peter G, Embley, T Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8175050/
https://www.ncbi.nlm.nih.gov/pubmed/33772552
http://dx.doi.org/10.1093/gbe/evab067
_version_ 1783702979005644800
author Williams, Tom A
Schrempf, Dominik
Szöllősi, Gergely J
Cox, Cymon J
Foster, Peter G
Embley, T Martin
author_facet Williams, Tom A
Schrempf, Dominik
Szöllősi, Gergely J
Cox, Cymon J
Foster, Peter G
Embley, T Martin
author_sort Williams, Tom A
collection PubMed
description There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods.
format Online
Article
Text
id pubmed-8175050
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81750502021-06-04 Inferring the Deep Past from Molecular Data Williams, Tom A Schrempf, Dominik Szöllősi, Gergely J Cox, Cymon J Foster, Peter G Embley, T Martin Genome Biol Evol Review There is an expectation that analyses of molecular sequences might be able to distinguish between alternative hypotheses for ancient relationships, but the phylogenetic methods used and types of data analyzed are of critical importance in any attempt to recover historical signal. Here, we discuss some common issues that can influence the topology of trees obtained when using overly simple models to analyze molecular data that often display complicated patterns of sequence heterogeneity. To illustrate our discussion, we have used three examples of inferred relationships which have changed radically as models and methods of analysis have improved. In two of these examples, the sister-group relationship between thermophilic Thermus and mesophilic Deinococcus, and the position of long-branch Microsporidia among eukaryotes, we show that recovering what is now generally considered to be the correct tree is critically dependent on the fit between model and data. In the third example, the position of eukaryotes in the tree of life, the hypothesis that is currently supported by the best available methods is fundamentally different from the classical view of relationships between major cellular domains. Since heterogeneity appears to be pervasive and varied among all molecular sequence data, and even the best available models can still struggle to deal with some problems, the issues we discuss are generally relevant to phylogenetic analyses. It remains essential to maintain a critical attitude to all trees as hypotheses of relationship that may change with more data and better methods. Oxford University Press 2021-03-27 /pmc/articles/PMC8175050/ /pubmed/33772552 http://dx.doi.org/10.1093/gbe/evab067 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review
Williams, Tom A
Schrempf, Dominik
Szöllősi, Gergely J
Cox, Cymon J
Foster, Peter G
Embley, T Martin
Inferring the Deep Past from Molecular Data
title Inferring the Deep Past from Molecular Data
title_full Inferring the Deep Past from Molecular Data
title_fullStr Inferring the Deep Past from Molecular Data
title_full_unstemmed Inferring the Deep Past from Molecular Data
title_short Inferring the Deep Past from Molecular Data
title_sort inferring the deep past from molecular data
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8175050/
https://www.ncbi.nlm.nih.gov/pubmed/33772552
http://dx.doi.org/10.1093/gbe/evab067
work_keys_str_mv AT williamstoma inferringthedeeppastfrommoleculardata
AT schrempfdominik inferringthedeeppastfrommoleculardata
AT szollosigergelyj inferringthedeeppastfrommoleculardata
AT coxcymonj inferringthedeeppastfrommoleculardata
AT fosterpeterg inferringthedeeppastfrommoleculardata
AT embleytmartin inferringthedeeppastfrommoleculardata