Cargando…

Surprising results on phylogenetic tree building methods based on molecular sequences

BACKGROUND: We analyze phylogenetic tree building methods from molecular sequences (PTMS). These are methods which base their construction solely on sequences, coding DNA or amino acids. RESULTS: Our first result is a statistically significant evaluation of 176 PTMSs done by comparing trees derived...

Descripción completa

Detalles Bibliográficos
Autor principal: Gonnet, Gaston H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3447733/
https://www.ncbi.nlm.nih.gov/pubmed/22738078
http://dx.doi.org/10.1186/1471-2105-13-148
_version_ 1782244152744869888
author Gonnet, Gaston H
author_facet Gonnet, Gaston H
author_sort Gonnet, Gaston H
collection PubMed
description BACKGROUND: We analyze phylogenetic tree building methods from molecular sequences (PTMS). These are methods which base their construction solely on sequences, coding DNA or amino acids. RESULTS: Our first result is a statistically significant evaluation of 176 PTMSs done by comparing trees derived from 193138 orthologous groups of proteins using a new measure of quality between trees. This new measure, called the Intra measure, is very consistent between different groups of species and strong in the sense that it separates the methods with high confidence. The second result is the comparison of the trees against trees derived from accepted taxonomies, the Taxon measure. We consider the NCBI taxonomic classification and their derived topologies as the most accepted biological consensus on phylogenies, which are also available in electronic form. The correlation between the two measures is remarkably high, which supports both measures simultaneously. CONCLUSIONS: The big surprise of the evaluation is that the maximum likelihood methods do not score well, minimal evolution distance methods over MSA-induced alignments score consistently better. This comparison also allows us to rank different components of the tree building methods, like MSAs, substitution matrices, ML tree builders, distance methods, etc. It is also clear that there is a difference between Metazoa and the rest, which points out to evolution leaving different molecular traces. We also think that these measures of quality of trees will motivate the design of new PTMSs as it is now easier to evaluate them with certainty.
format Online
Article
Text
id pubmed-3447733
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34477332012-09-25 Surprising results on phylogenetic tree building methods based on molecular sequences Gonnet, Gaston H BMC Bioinformatics Methodology Article BACKGROUND: We analyze phylogenetic tree building methods from molecular sequences (PTMS). These are methods which base their construction solely on sequences, coding DNA or amino acids. RESULTS: Our first result is a statistically significant evaluation of 176 PTMSs done by comparing trees derived from 193138 orthologous groups of proteins using a new measure of quality between trees. This new measure, called the Intra measure, is very consistent between different groups of species and strong in the sense that it separates the methods with high confidence. The second result is the comparison of the trees against trees derived from accepted taxonomies, the Taxon measure. We consider the NCBI taxonomic classification and their derived topologies as the most accepted biological consensus on phylogenies, which are also available in electronic form. The correlation between the two measures is remarkably high, which supports both measures simultaneously. CONCLUSIONS: The big surprise of the evaluation is that the maximum likelihood methods do not score well, minimal evolution distance methods over MSA-induced alignments score consistently better. This comparison also allows us to rank different components of the tree building methods, like MSAs, substitution matrices, ML tree builders, distance methods, etc. It is also clear that there is a difference between Metazoa and the rest, which points out to evolution leaving different molecular traces. We also think that these measures of quality of trees will motivate the design of new PTMSs as it is now easier to evaluate them with certainty. BioMed Central 2012-06-27 /pmc/articles/PMC3447733/ /pubmed/22738078 http://dx.doi.org/10.1186/1471-2105-13-148 Text en Copyright ©2012 Gonnet; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Gonnet, Gaston H
Surprising results on phylogenetic tree building methods based on molecular sequences
title Surprising results on phylogenetic tree building methods based on molecular sequences
title_full Surprising results on phylogenetic tree building methods based on molecular sequences
title_fullStr Surprising results on phylogenetic tree building methods based on molecular sequences
title_full_unstemmed Surprising results on phylogenetic tree building methods based on molecular sequences
title_short Surprising results on phylogenetic tree building methods based on molecular sequences
title_sort surprising results on phylogenetic tree building methods based on molecular sequences
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3447733/
https://www.ncbi.nlm.nih.gov/pubmed/22738078
http://dx.doi.org/10.1186/1471-2105-13-148
work_keys_str_mv AT gonnetgastonh surprisingresultsonphylogenetictreebuildingmethodsbasedonmolecularsequences