Cargando…

Statistical Phylogenetic Tree Analysis Using Differences of Means

We propose a statistical method to test whether two phylogenetic trees with given alignments are significantly incongruent. Our method compares the two distributions of phylogenetic trees given by two input alignments, instead of comparing point estimations of trees. This statistical approach can be...

Descripción completa

Detalles Bibliográficos
Autores principales: Arnaoudova, Elissaveta, Haws, David C., Huggins, Peter, Jaromczyk, Jerzy W., Moore, Neil, Schardl, Christopher L., Yoshida, Ruriko
Formato: Texto
Lenguaje:English
Publicado: Frontiers Research Foundation 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928700/
https://www.ncbi.nlm.nih.gov/pubmed/20802801
http://dx.doi.org/10.3389/fnins.2010.00047
_version_ 1782185869818462208
author Arnaoudova, Elissaveta
Haws, David C.
Huggins, Peter
Jaromczyk, Jerzy W.
Moore, Neil
Schardl, Christopher L.
Yoshida, Ruriko
author_facet Arnaoudova, Elissaveta
Haws, David C.
Huggins, Peter
Jaromczyk, Jerzy W.
Moore, Neil
Schardl, Christopher L.
Yoshida, Ruriko
author_sort Arnaoudova, Elissaveta
collection PubMed
description We propose a statistical method to test whether two phylogenetic trees with given alignments are significantly incongruent. Our method compares the two distributions of phylogenetic trees given by two input alignments, instead of comparing point estimations of trees. This statistical approach can be applied to gene tree analysis for example, detecting unusual events in genome evolution such as horizontal gene transfer and reshuffling. Our method uses difference of means to compare two distributions of trees, after mapping trees into a vector space. Bootstrapping alignment columns can then be applied to obtain p-values. To compute distances between means, we employ a “kernel method” which speeds up distance calculations when trees are mapped in a high-dimensional feature space, e.g., splits or quartets feature space. In this pilot study, first we test our statistical method on data sets simulated under a coalescence model, to test whether two alignments are generated by congruent gene trees. We follow our simulation results with applications to data sets of gophers and lice, grasses and their endophytes, and different fungal genes from the same genome. A companion toolkit, Phylotree, is provided to facilitate computational experiments.
format Text
id pubmed-2928700
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Frontiers Research Foundation
record_format MEDLINE/PubMed
spelling pubmed-29287002010-08-27 Statistical Phylogenetic Tree Analysis Using Differences of Means Arnaoudova, Elissaveta Haws, David C. Huggins, Peter Jaromczyk, Jerzy W. Moore, Neil Schardl, Christopher L. Yoshida, Ruriko Front Neurosci Neuroscience We propose a statistical method to test whether two phylogenetic trees with given alignments are significantly incongruent. Our method compares the two distributions of phylogenetic trees given by two input alignments, instead of comparing point estimations of trees. This statistical approach can be applied to gene tree analysis for example, detecting unusual events in genome evolution such as horizontal gene transfer and reshuffling. Our method uses difference of means to compare two distributions of trees, after mapping trees into a vector space. Bootstrapping alignment columns can then be applied to obtain p-values. To compute distances between means, we employ a “kernel method” which speeds up distance calculations when trees are mapped in a high-dimensional feature space, e.g., splits or quartets feature space. In this pilot study, first we test our statistical method on data sets simulated under a coalescence model, to test whether two alignments are generated by congruent gene trees. We follow our simulation results with applications to data sets of gophers and lice, grasses and their endophytes, and different fungal genes from the same genome. A companion toolkit, Phylotree, is provided to facilitate computational experiments. Frontiers Research Foundation 2010-08-03 /pmc/articles/PMC2928700/ /pubmed/20802801 http://dx.doi.org/10.3389/fnins.2010.00047 Text en Copyright © 2010 Arnaoudova, Haws, Huggins, Jaromczyk, Moore, Schardl and Yoshida. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited.
spellingShingle Neuroscience
Arnaoudova, Elissaveta
Haws, David C.
Huggins, Peter
Jaromczyk, Jerzy W.
Moore, Neil
Schardl, Christopher L.
Yoshida, Ruriko
Statistical Phylogenetic Tree Analysis Using Differences of Means
title Statistical Phylogenetic Tree Analysis Using Differences of Means
title_full Statistical Phylogenetic Tree Analysis Using Differences of Means
title_fullStr Statistical Phylogenetic Tree Analysis Using Differences of Means
title_full_unstemmed Statistical Phylogenetic Tree Analysis Using Differences of Means
title_short Statistical Phylogenetic Tree Analysis Using Differences of Means
title_sort statistical phylogenetic tree analysis using differences of means
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2928700/
https://www.ncbi.nlm.nih.gov/pubmed/20802801
http://dx.doi.org/10.3389/fnins.2010.00047
work_keys_str_mv AT arnaoudovaelissaveta statisticalphylogenetictreeanalysisusingdifferencesofmeans
AT hawsdavidc statisticalphylogenetictreeanalysisusingdifferencesofmeans
AT hugginspeter statisticalphylogenetictreeanalysisusingdifferencesofmeans
AT jaromczykjerzyw statisticalphylogenetictreeanalysisusingdifferencesofmeans
AT mooreneil statisticalphylogenetictreeanalysisusingdifferencesofmeans
AT schardlchristopherl statisticalphylogenetictreeanalysisusingdifferencesofmeans
AT yoshidaruriko statisticalphylogenetictreeanalysisusingdifferencesofmeans