Cargando…

Circumstances in Which Parsimony but not Compatibility will be Provably Misleading

Phylogenetic methods typically rely on an appropriate model of how data evolved in order to infer an accurate phylogenetic tree. For molecular data, standard statistical methods have provided an effective strategy for extracting phylogenetic information from aligned sequence data when each site (cha...

Descripción completa

Detalles Bibliográficos
Autores principales: Scotland, Robert W., Steel, Mike
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4395848/
https://www.ncbi.nlm.nih.gov/pubmed/25634097
http://dx.doi.org/10.1093/sysbio/syv008
_version_ 1782366502896271360
author Scotland, Robert W.
Steel, Mike
author_facet Scotland, Robert W.
Steel, Mike
author_sort Scotland, Robert W.
collection PubMed
description Phylogenetic methods typically rely on an appropriate model of how data evolved in order to infer an accurate phylogenetic tree. For molecular data, standard statistical methods have provided an effective strategy for extracting phylogenetic information from aligned sequence data when each site (character) is subject to a common process. However, for other types of data (e.g., morphological data), characters can be too ambiguous, homoplastic, or saturated to develop models that are effective at capturing the underlying process of change. To address this, we examine the properties of a classic but neglected method for inferring splits in an underlying tree, namely, maximum compatibility. By adopting a simple and extreme model in which each character either fits perfectly on some tree, or is entirely random (but it is not known which class any character belongs to) we are able to derive exact and explicit formulae regarding the performance of maximum compatibility. We show that this method is able to identify a set of non-trivial homoplasy-free characters, when the number [Formula: see text] of taxa is large, even when the number of random characters is large. In contrast, we show that a method that makes more uniform use of all the data—maximum parsimony—can provably estimate trees in which none of the original homoplasy-free characters support splits.
format Online
Article
Text
id pubmed-4395848
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43958482015-04-15 Circumstances in Which Parsimony but not Compatibility will be Provably Misleading Scotland, Robert W. Steel, Mike Syst Biol Regular Articles Phylogenetic methods typically rely on an appropriate model of how data evolved in order to infer an accurate phylogenetic tree. For molecular data, standard statistical methods have provided an effective strategy for extracting phylogenetic information from aligned sequence data when each site (character) is subject to a common process. However, for other types of data (e.g., morphological data), characters can be too ambiguous, homoplastic, or saturated to develop models that are effective at capturing the underlying process of change. To address this, we examine the properties of a classic but neglected method for inferring splits in an underlying tree, namely, maximum compatibility. By adopting a simple and extreme model in which each character either fits perfectly on some tree, or is entirely random (but it is not known which class any character belongs to) we are able to derive exact and explicit formulae regarding the performance of maximum compatibility. We show that this method is able to identify a set of non-trivial homoplasy-free characters, when the number [Formula: see text] of taxa is large, even when the number of random characters is large. In contrast, we show that a method that makes more uniform use of all the data—maximum parsimony—can provably estimate trees in which none of the original homoplasy-free characters support splits. Oxford University Press 2015-05 2015-01-28 /pmc/articles/PMC4395848/ /pubmed/25634097 http://dx.doi.org/10.1093/sysbio/syv008 Text en © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Scotland, Robert W.
Steel, Mike
Circumstances in Which Parsimony but not Compatibility will be Provably Misleading
title Circumstances in Which Parsimony but not Compatibility will be Provably Misleading
title_full Circumstances in Which Parsimony but not Compatibility will be Provably Misleading
title_fullStr Circumstances in Which Parsimony but not Compatibility will be Provably Misleading
title_full_unstemmed Circumstances in Which Parsimony but not Compatibility will be Provably Misleading
title_short Circumstances in Which Parsimony but not Compatibility will be Provably Misleading
title_sort circumstances in which parsimony but not compatibility will be provably misleading
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4395848/
https://www.ncbi.nlm.nih.gov/pubmed/25634097
http://dx.doi.org/10.1093/sysbio/syv008
work_keys_str_mv AT scotlandrobertw circumstancesinwhichparsimonybutnotcompatibilitywillbeprovablymisleading
AT steelmike circumstancesinwhichparsimonybutnotcompatibilitywillbeprovablymisleading