Cargando…
The Spectre of Too Many Species
Recent simulation studies examining the performance of Bayesian species delimitation as implemented in the bpp program have suggested that bpp may detect population splits but not species divergences and that it tends to over-split when data of many loci are analyzed. Here, we confirm these results...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292489/ https://www.ncbi.nlm.nih.gov/pubmed/29982825 http://dx.doi.org/10.1093/sysbio/syy051 |
_version_ | 1783380400929767424 |
---|---|
author | Leaché, Adam D Zhu, Tianqi Rannala, Bruce Yang, Ziheng |
author_facet | Leaché, Adam D Zhu, Tianqi Rannala, Bruce Yang, Ziheng |
author_sort | Leaché, Adam D |
collection | PubMed |
description | Recent simulation studies examining the performance of Bayesian species delimitation as implemented in the bpp program have suggested that bpp may detect population splits but not species divergences and that it tends to over-split when data of many loci are analyzed. Here, we confirm these results and provide the mathematical justifications. We point out that the distinction between population and species splits made in the protracted speciation model (PSM) has no influence on the generation of gene trees and sequence data, which explains why no method can use such data to distinguish between population splits and speciation. We suggest that the PSM is unrealistic as its mechanism for assigning species status assumes instantaneous speciation, contradicting prevailing taxonomic practice. We confirm the suggestion, based on simulation, that in the case of speciation with gene flow, Bayesian model selection as implemented in bpp tends to detect population splits when the amount of data (the number of loci) increases. We discuss the use of a recently proposed empirical genealogical divergence index (gdi) for species delimitation and illustrate that parameter estimates produced by a full likelihood analysis as implemented in bpp provide much more reliable inference under the gdi than the approximate method phrapl. We distinguish between Bayesian model selection and parameter estimation and suggest that the model selection approach is useful for identifying sympatric cryptic species, while the parameter estimation approach may be used to implement empirical criteria for determining species status among allopatric populations. |
format | Online Article Text |
id | pubmed-6292489 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-62924892018-12-19 The Spectre of Too Many Species Leaché, Adam D Zhu, Tianqi Rannala, Bruce Yang, Ziheng Syst Biol Points of View Recent simulation studies examining the performance of Bayesian species delimitation as implemented in the bpp program have suggested that bpp may detect population splits but not species divergences and that it tends to over-split when data of many loci are analyzed. Here, we confirm these results and provide the mathematical justifications. We point out that the distinction between population and species splits made in the protracted speciation model (PSM) has no influence on the generation of gene trees and sequence data, which explains why no method can use such data to distinguish between population splits and speciation. We suggest that the PSM is unrealistic as its mechanism for assigning species status assumes instantaneous speciation, contradicting prevailing taxonomic practice. We confirm the suggestion, based on simulation, that in the case of speciation with gene flow, Bayesian model selection as implemented in bpp tends to detect population splits when the amount of data (the number of loci) increases. We discuss the use of a recently proposed empirical genealogical divergence index (gdi) for species delimitation and illustrate that parameter estimates produced by a full likelihood analysis as implemented in bpp provide much more reliable inference under the gdi than the approximate method phrapl. We distinguish between Bayesian model selection and parameter estimation and suggest that the model selection approach is useful for identifying sympatric cryptic species, while the parameter estimation approach may be used to implement empirical criteria for determining species status among allopatric populations. Oxford University Press 2019-01 2018-07-05 /pmc/articles/PMC6292489/ /pubmed/29982825 http://dx.doi.org/10.1093/sysbio/syy051 Text en © The Author(s) 2018. Published by Oxford University Press, on behalf of the Society of Systematic Biologists http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For Permissions, please email: journals.permissions@oup.com |
spellingShingle | Points of View Leaché, Adam D Zhu, Tianqi Rannala, Bruce Yang, Ziheng The Spectre of Too Many Species |
title | The Spectre of Too Many Species |
title_full | The Spectre of Too Many Species |
title_fullStr | The Spectre of Too Many Species |
title_full_unstemmed | The Spectre of Too Many Species |
title_short | The Spectre of Too Many Species |
title_sort | spectre of too many species |
topic | Points of View |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292489/ https://www.ncbi.nlm.nih.gov/pubmed/29982825 http://dx.doi.org/10.1093/sysbio/syy051 |
work_keys_str_mv | AT leacheadamd thespectreoftoomanyspecies AT zhutianqi thespectreoftoomanyspecies AT rannalabruce thespectreoftoomanyspecies AT yangziheng thespectreoftoomanyspecies AT leacheadamd spectreoftoomanyspecies AT zhutianqi spectreoftoomanyspecies AT rannalabruce spectreoftoomanyspecies AT yangziheng spectreoftoomanyspecies |