Cargando…
Very few sites can reshape the inferred phylogenetic tree
The history of animal evolution, and the relative placement of extant animal phyla in this history is, in principle, testable from phylogenies derived from molecular sequence data. Though datasets have increased in size and quality in the past years, the contribution of individual genes (and ultimat...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7353913/ https://www.ncbi.nlm.nih.gov/pubmed/32714649 http://dx.doi.org/10.7717/peerj.8865 |
_version_ | 1783557984468598784 |
---|---|
author | Francis, Warren R. Canfield, Donald E. |
author_facet | Francis, Warren R. Canfield, Donald E. |
author_sort | Francis, Warren R. |
collection | PubMed |
description | The history of animal evolution, and the relative placement of extant animal phyla in this history is, in principle, testable from phylogenies derived from molecular sequence data. Though datasets have increased in size and quality in the past years, the contribution of individual genes (and ultimately amino acid sites) to the final phylogeny is unequal across genes. Here we demonstrate that removing a small fraction of sites strongly favoring one topology can produce a highly-supported tree of an alternate topology. We explore this approach using a dataset for animal phylogeny, and create a highly-supported tree with a monophyletic group of sponges and ctenophores, a topology not usually recovered. Because of the high sensitivity of such an analysis to gene selection, and because most gene sets are neither standardized nor representative of the entire genome, researchers should be diligent about making intermediate analyses available with their phylogenetic studies. Effort is needed to ensure these datasets are maximally informative, by ensuring all genes are systematically sampled across relevant species. From there, it could be determined whether any gene or gene sets introduce bias, and then deal with those biases appropriately. |
format | Online Article Text |
id | pubmed-7353913 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-73539132020-07-24 Very few sites can reshape the inferred phylogenetic tree Francis, Warren R. Canfield, Donald E. PeerJ Bioinformatics The history of animal evolution, and the relative placement of extant animal phyla in this history is, in principle, testable from phylogenies derived from molecular sequence data. Though datasets have increased in size and quality in the past years, the contribution of individual genes (and ultimately amino acid sites) to the final phylogeny is unequal across genes. Here we demonstrate that removing a small fraction of sites strongly favoring one topology can produce a highly-supported tree of an alternate topology. We explore this approach using a dataset for animal phylogeny, and create a highly-supported tree with a monophyletic group of sponges and ctenophores, a topology not usually recovered. Because of the high sensitivity of such an analysis to gene selection, and because most gene sets are neither standardized nor representative of the entire genome, researchers should be diligent about making intermediate analyses available with their phylogenetic studies. Effort is needed to ensure these datasets are maximally informative, by ensuring all genes are systematically sampled across relevant species. From there, it could be determined whether any gene or gene sets introduce bias, and then deal with those biases appropriately. PeerJ Inc. 2020-07-08 /pmc/articles/PMC7353913/ /pubmed/32714649 http://dx.doi.org/10.7717/peerj.8865 Text en ©2020 Francis and Canfield https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Francis, Warren R. Canfield, Donald E. Very few sites can reshape the inferred phylogenetic tree |
title | Very few sites can reshape the inferred phylogenetic tree |
title_full | Very few sites can reshape the inferred phylogenetic tree |
title_fullStr | Very few sites can reshape the inferred phylogenetic tree |
title_full_unstemmed | Very few sites can reshape the inferred phylogenetic tree |
title_short | Very few sites can reshape the inferred phylogenetic tree |
title_sort | very few sites can reshape the inferred phylogenetic tree |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7353913/ https://www.ncbi.nlm.nih.gov/pubmed/32714649 http://dx.doi.org/10.7717/peerj.8865 |
work_keys_str_mv | AT franciswarrenr veryfewsitescanreshapetheinferredphylogenetictree AT canfielddonalde veryfewsitescanreshapetheinferredphylogenetictree |