Cargando…

Probabilistic Distances Between Trees

Most existing measures of distance between phylogenetic trees are based on the geometry or topology of the trees. Instead, we consider distance measures which are based on the underlying probability distributions on genetic sequence data induced by trees. Monte Carlo schemes are necessary to calcula...

Descripción completa

Detalles Bibliográficos
Autores principales: Garba, Maryam K, Nye, Tom M W, Boys, Richard J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5837584/
https://www.ncbi.nlm.nih.gov/pubmed/29029295
http://dx.doi.org/10.1093/sysbio/syx080
_version_ 1783304122731069440
author Garba, Maryam K
Nye, Tom M W
Boys, Richard J
author_facet Garba, Maryam K
Nye, Tom M W
Boys, Richard J
author_sort Garba, Maryam K
collection PubMed
description Most existing measures of distance between phylogenetic trees are based on the geometry or topology of the trees. Instead, we consider distance measures which are based on the underlying probability distributions on genetic sequence data induced by trees. Monte Carlo schemes are necessary to calculate these distances approximately, and we describe efficient sampling procedures. Key features of the distances are the ability to include substitution model parameters and to handle trees with different taxon sets in a principled way. We demonstrate some of the properties of these new distance measures and compare them to existing distances, in particular by applying multidimensional scaling to data sets previously reported as containing phylogenetic islands. [Metric; probability distribution; multidimensional scaling; information geometry.
format Online
Article
Text
id pubmed-5837584
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58375842018-03-09 Probabilistic Distances Between Trees Garba, Maryam K Nye, Tom M W Boys, Richard J Syst Biol Regular Articles Most existing measures of distance between phylogenetic trees are based on the geometry or topology of the trees. Instead, we consider distance measures which are based on the underlying probability distributions on genetic sequence data induced by trees. Monte Carlo schemes are necessary to calculate these distances approximately, and we describe efficient sampling procedures. Key features of the distances are the ability to include substitution model parameters and to handle trees with different taxon sets in a principled way. We demonstrate some of the properties of these new distance measures and compare them to existing distances, in particular by applying multidimensional scaling to data sets previously reported as containing phylogenetic islands. [Metric; probability distribution; multidimensional scaling; information geometry. Oxford University Press 2018-03 2017-10-04 /pmc/articles/PMC5837584/ /pubmed/29029295 http://dx.doi.org/10.1093/sysbio/syx080 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For Permissions, please email: journals.permissions@oup.com
spellingShingle Regular Articles
Garba, Maryam K
Nye, Tom M W
Boys, Richard J
Probabilistic Distances Between Trees
title Probabilistic Distances Between Trees
title_full Probabilistic Distances Between Trees
title_fullStr Probabilistic Distances Between Trees
title_full_unstemmed Probabilistic Distances Between Trees
title_short Probabilistic Distances Between Trees
title_sort probabilistic distances between trees
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5837584/
https://www.ncbi.nlm.nih.gov/pubmed/29029295
http://dx.doi.org/10.1093/sysbio/syx080
work_keys_str_mv AT garbamaryamk probabilisticdistancesbetweentrees
AT nyetommw probabilisticdistancesbetweentrees
AT boysrichardj probabilisticdistancesbetweentrees