Cargando…

Very few sites can reshape the inferred phylogenetic tree

The history of animal evolution, and the relative placement of extant animal phyla in this history is, in principle, testable from phylogenies derived from molecular sequence data. Though datasets have increased in size and quality in the past years, the contribution of individual genes (and ultimat...

Descripción completa

Detalles Bibliográficos
Autores principales: Francis, Warren R., Canfield, Donald E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7353913/
https://www.ncbi.nlm.nih.gov/pubmed/32714649
http://dx.doi.org/10.7717/peerj.8865
_version_ 1783557984468598784
author Francis, Warren R.
Canfield, Donald E.
author_facet Francis, Warren R.
Canfield, Donald E.
author_sort Francis, Warren R.
collection PubMed
description The history of animal evolution, and the relative placement of extant animal phyla in this history is, in principle, testable from phylogenies derived from molecular sequence data. Though datasets have increased in size and quality in the past years, the contribution of individual genes (and ultimately amino acid sites) to the final phylogeny is unequal across genes. Here we demonstrate that removing a small fraction of sites strongly favoring one topology can produce a highly-supported tree of an alternate topology. We explore this approach using a dataset for animal phylogeny, and create a highly-supported tree with a monophyletic group of sponges and ctenophores, a topology not usually recovered. Because of the high sensitivity of such an analysis to gene selection, and because most gene sets are neither standardized nor representative of the entire genome, researchers should be diligent about making intermediate analyses available with their phylogenetic studies. Effort is needed to ensure these datasets are maximally informative, by ensuring all genes are systematically sampled across relevant species. From there, it could be determined whether any gene or gene sets introduce bias, and then deal with those biases appropriately.
format Online
Article
Text
id pubmed-7353913
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-73539132020-07-24 Very few sites can reshape the inferred phylogenetic tree Francis, Warren R. Canfield, Donald E. PeerJ Bioinformatics The history of animal evolution, and the relative placement of extant animal phyla in this history is, in principle, testable from phylogenies derived from molecular sequence data. Though datasets have increased in size and quality in the past years, the contribution of individual genes (and ultimately amino acid sites) to the final phylogeny is unequal across genes. Here we demonstrate that removing a small fraction of sites strongly favoring one topology can produce a highly-supported tree of an alternate topology. We explore this approach using a dataset for animal phylogeny, and create a highly-supported tree with a monophyletic group of sponges and ctenophores, a topology not usually recovered. Because of the high sensitivity of such an analysis to gene selection, and because most gene sets are neither standardized nor representative of the entire genome, researchers should be diligent about making intermediate analyses available with their phylogenetic studies. Effort is needed to ensure these datasets are maximally informative, by ensuring all genes are systematically sampled across relevant species. From there, it could be determined whether any gene or gene sets introduce bias, and then deal with those biases appropriately. PeerJ Inc. 2020-07-08 /pmc/articles/PMC7353913/ /pubmed/32714649 http://dx.doi.org/10.7717/peerj.8865 Text en ©2020 Francis and Canfield https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Francis, Warren R.
Canfield, Donald E.
Very few sites can reshape the inferred phylogenetic tree
title Very few sites can reshape the inferred phylogenetic tree
title_full Very few sites can reshape the inferred phylogenetic tree
title_fullStr Very few sites can reshape the inferred phylogenetic tree
title_full_unstemmed Very few sites can reshape the inferred phylogenetic tree
title_short Very few sites can reshape the inferred phylogenetic tree
title_sort very few sites can reshape the inferred phylogenetic tree
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7353913/
https://www.ncbi.nlm.nih.gov/pubmed/32714649
http://dx.doi.org/10.7717/peerj.8865
work_keys_str_mv AT franciswarrenr veryfewsitescanreshapetheinferredphylogenetictree
AT canfielddonalde veryfewsitescanreshapetheinferredphylogenetictree