Cargando…
A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets
Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delim...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4997007/ https://www.ncbi.nlm.nih.gov/pubmed/27055648 http://dx.doi.org/10.1093/sysbio/syw028 |
_version_ | 1782449688178327552 |
---|---|
author | Fujisawa, Tomochika Aswad, Amr Barraclough, Timothy G. |
author_facet | Fujisawa, Tomochika Aswad, Amr Barraclough, Timothy G. |
author_sort | Fujisawa, Tomochika |
collection | PubMed |
description | Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delimitations that must be compared and time-consuming likelihood calculations. New methods are therefore needed to open up the power of multilocus approaches to larger systematic surveys. Here, we present a rapid and scalable method that introduces 2 new innovations. First, the method reduces the complexity of likelihood calculations by decomposing the tree into rooted triplets. The distribution of topologies for a triplet across multiple loci has a uniform trinomial distribution when the 3 individuals belong to the same species, but a skewed distribution if they belong to separate species with a form that is specified by the multispecies coalescent. A Bayesian model comparison framework was developed and the best delimitation found by comparing the product of posterior probabilities of all triplets. The second innovation is a new dynamic programming algorithm for finding the optimum delimitation from all those compatible with a guide tree by successively analyzing subtrees defined by each node. This algorithm removes the need for heuristic searches used by current methods, and guarantees that the best solution is found and potentially could be used in other systematic applications. We assessed the performance of the method with simulated, published, and newly generated data. Analyses of simulated data demonstrate that the combined method has favorable statistical properties and scalability with increasing sample sizes. Analyses of empirical data from both eukaryotes and prokaryotes demonstrate its potential for delimiting species in real cases. |
format | Online Article Text |
id | pubmed-4997007 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-49970072016-08-29 A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets Fujisawa, Tomochika Aswad, Amr Barraclough, Timothy G. Syst Biol Regular Articles Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delimitations that must be compared and time-consuming likelihood calculations. New methods are therefore needed to open up the power of multilocus approaches to larger systematic surveys. Here, we present a rapid and scalable method that introduces 2 new innovations. First, the method reduces the complexity of likelihood calculations by decomposing the tree into rooted triplets. The distribution of topologies for a triplet across multiple loci has a uniform trinomial distribution when the 3 individuals belong to the same species, but a skewed distribution if they belong to separate species with a form that is specified by the multispecies coalescent. A Bayesian model comparison framework was developed and the best delimitation found by comparing the product of posterior probabilities of all triplets. The second innovation is a new dynamic programming algorithm for finding the optimum delimitation from all those compatible with a guide tree by successively analyzing subtrees defined by each node. This algorithm removes the need for heuristic searches used by current methods, and guarantees that the best solution is found and potentially could be used in other systematic applications. We assessed the performance of the method with simulated, published, and newly generated data. Analyses of simulated data demonstrate that the combined method has favorable statistical properties and scalability with increasing sample sizes. Analyses of empirical data from both eukaryotes and prokaryotes demonstrate its potential for delimiting species in real cases. Oxford University Press 2016-09 2016-04-07 /pmc/articles/PMC4997007/ /pubmed/27055648 http://dx.doi.org/10.1093/sysbio/syw028 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Regular Articles Fujisawa, Tomochika Aswad, Amr Barraclough, Timothy G. A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets |
title | A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets |
title_full | A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets |
title_fullStr | A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets |
title_full_unstemmed | A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets |
title_short | A Rapid and Scalable Method for Multilocus Species Delimitation Using Bayesian Model Comparison and Rooted Triplets |
title_sort | rapid and scalable method for multilocus species delimitation using bayesian model comparison and rooted triplets |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4997007/ https://www.ncbi.nlm.nih.gov/pubmed/27055648 http://dx.doi.org/10.1093/sysbio/syw028 |
work_keys_str_mv | AT fujisawatomochika arapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets AT aswadamr arapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets AT barracloughtimothyg arapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets AT fujisawatomochika rapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets AT aswadamr rapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets AT barracloughtimothyg rapidandscalablemethodformultilocusspeciesdelimitationusingbayesianmodelcomparisonandrootedtriplets |