Cargando…

A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets

Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delim...

Descripción completa

Detalles Bibliográficos
Autores principales: Fujisawa, Tomochika, Aswad, Amr, Barraclough, Timothy G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4997007/
https://www.ncbi.nlm.nih.gov/pubmed/27055648
http://dx.doi.org/10.1093/sysbio/syw028
_version_ 1782449688178327552
author Fujisawa, Tomochika
Aswad, Amr
Barraclough, Timothy G.
author_facet Fujisawa, Tomochika
Aswad, Amr
Barraclough, Timothy G.
author_sort Fujisawa, Tomochika
collection PubMed
description Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delimitations that must be compared and time-consuming likelihood calculations. New methods are therefore needed to open up the power of multilocus approaches to larger systematic surveys. Here, we present a rapid and scalable method that introduces 2 new innovations. First, the method reduces the complexity of likelihood calculations by decomposing the tree into rooted triplets. The distribution of topologies for a triplet across multiple loci has a uniform trinomial distribution when the 3 individuals belong to the same species, but a skewed distribution if they belong to separate species with a form that is specified by the multispecies coalescent. A Bayesian model comparison framework was developed and the best delimitation found by comparing the product of posterior probabilities of all triplets. The second innovation is a new dynamic programming algorithm for finding the optimum delimitation from all those compatible with a guide tree by successively analyzing subtrees defined by each node. This algorithm removes the need for heuristic searches used by current methods, and guarantees that the best solution is found and potentially could be used in other systematic applications. We assessed the performance of the method with simulated, published, and newly generated data. Analyses of simulated data demonstrate that the combined method has favorable statistical properties and scalability with increasing sample sizes. Analyses of empirical data from both eukaryotes and prokaryotes demonstrate its potential for delimiting species in real cases.
format Online
Article
Text
id pubmed-4997007
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49970072016-08-29 A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets Fujisawa, Tomochika Aswad, Amr Barraclough, Timothy G. Syst Biol Regular Articles Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delimitations that must be compared and time-consuming likelihood calculations. New methods are therefore needed to open up the power of multilocus approaches to larger systematic surveys. Here, we present a rapid and scalable method that introduces 2 new innovations. First, the method reduces the complexity of likelihood calculations by decomposing the tree into rooted triplets. The distribution of topologies for a triplet across multiple loci has a uniform trinomial distribution when the 3 individuals belong to the same species, but a skewed distribution if they belong to separate species with a form that is specified by the multispecies coalescent. A Bayesian model comparison framework was developed and the best delimitation found by comparing the product of posterior probabilities of all triplets. The second innovation is a new dynamic programming algorithm for finding the optimum delimitation from all those compatible with a guide tree by successively analyzing subtrees defined by each node. This algorithm removes the need for heuristic searches used by current methods, and guarantees that the best solution is found and potentially could be used in other systematic applications. We assessed the performance of the method with simulated, published, and newly generated data. Analyses of simulated data demonstrate that the combined method has favorable statistical properties and scalability with increasing sample sizes. Analyses of empirical data from both eukaryotes and prokaryotes demonstrate its potential for delimiting species in real cases. Oxford University Press 2016-09 2016-04-07 /pmc/articles/PMC4997007/ /pubmed/27055648 http://dx.doi.org/10.1093/sysbio/syw028 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Fujisawa, Tomochika
Aswad, Amr
Barraclough, Timothy G.
A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
title A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
title_full A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
title_fullStr A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
title_full_unstemmed A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
title_short A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
title_sort rapid and scalable method for multilocus species delimitation using bayesian model comparison and rooted triplets
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4997007/
https://www.ncbi.nlm.nih.gov/pubmed/27055648
http://dx.doi.org/10.1093/sysbio/syw028
work_keys_str_mv AT fujisawatomochika arapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets
AT aswadamr arapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets
AT barracloughtimothyg arapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets
AT fujisawatomochika rapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets
AT aswadamr rapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets
AT barracloughtimothyg rapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets