Cargando…

The Spectre of Too Many Species

Recent simulation studies examining the performance of Bayesian species delimitation as implemented in the bpp program have suggested that bpp may detect population splits but not species divergences and that it tends to over-split when data of many loci are analyzed. Here, we confirm these results...

Descripción completa

Detalles Bibliográficos
Autores principales: Leaché, Adam D, Zhu, Tianqi, Rannala, Bruce, Yang, Ziheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292489/
https://www.ncbi.nlm.nih.gov/pubmed/29982825
http://dx.doi.org/10.1093/sysbio/syy051
_version_ 1783380400929767424
author Leaché, Adam D
Zhu, Tianqi
Rannala, Bruce
Yang, Ziheng
author_facet Leaché, Adam D
Zhu, Tianqi
Rannala, Bruce
Yang, Ziheng
author_sort Leaché, Adam D
collection PubMed
description Recent simulation studies examining the performance of Bayesian species delimitation as implemented in the bpp program have suggested that bpp may detect population splits but not species divergences and that it tends to over-split when data of many loci are analyzed. Here, we confirm these results and provide the mathematical justifications. We point out that the distinction between population and species splits made in the protracted speciation model (PSM) has no influence on the generation of gene trees and sequence data, which explains why no method can use such data to distinguish between population splits and speciation. We suggest that the PSM is unrealistic as its mechanism for assigning species status assumes instantaneous speciation, contradicting prevailing taxonomic practice. We confirm the suggestion, based on simulation, that in the case of speciation with gene flow, Bayesian model selection as implemented in bpp tends to detect population splits when the amount of data (the number of loci) increases. We discuss the use of a recently proposed empirical genealogical divergence index (gdi) for species delimitation and illustrate that parameter estimates produced by a full likelihood analysis as implemented in bpp provide much more reliable inference under the gdi than the approximate method phrapl. We distinguish between Bayesian model selection and parameter estimation and suggest that the model selection approach is useful for identifying sympatric cryptic species, while the parameter estimation approach may be used to implement empirical criteria for determining species status among allopatric populations.
format Online
Article
Text
id pubmed-6292489
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62924892018-12-19 The Spectre of Too Many Species Leaché, Adam D Zhu, Tianqi Rannala, Bruce Yang, Ziheng Syst Biol Points of View Recent simulation studies examining the performance of Bayesian species delimitation as implemented in the bpp program have suggested that bpp may detect population splits but not species divergences and that it tends to over-split when data of many loci are analyzed. Here, we confirm these results and provide the mathematical justifications. We point out that the distinction between population and species splits made in the protracted speciation model (PSM) has no influence on the generation of gene trees and sequence data, which explains why no method can use such data to distinguish between population splits and speciation. We suggest that the PSM is unrealistic as its mechanism for assigning species status assumes instantaneous speciation, contradicting prevailing taxonomic practice. We confirm the suggestion, based on simulation, that in the case of speciation with gene flow, Bayesian model selection as implemented in bpp tends to detect population splits when the amount of data (the number of loci) increases. We discuss the use of a recently proposed empirical genealogical divergence index (gdi) for species delimitation and illustrate that parameter estimates produced by a full likelihood analysis as implemented in bpp provide much more reliable inference under the gdi than the approximate method phrapl. We distinguish between Bayesian model selection and parameter estimation and suggest that the model selection approach is useful for identifying sympatric cryptic species, while the parameter estimation approach may be used to implement empirical criteria for determining species status among allopatric populations. Oxford University Press 2019-01 2018-07-05 /pmc/articles/PMC6292489/ /pubmed/29982825 http://dx.doi.org/10.1093/sysbio/syy051 Text en © The Author(s) 2018. Published by Oxford University Press, on behalf of the Society of Systematic Biologists http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For Permissions, please email: journals.permissions@oup.com
spellingShingle Points of View
Leaché, Adam D
Zhu, Tianqi
Rannala, Bruce
Yang, Ziheng
The Spectre of Too Many Species
title The Spectre of Too Many Species
title_full The Spectre of Too Many Species
title_fullStr The Spectre of Too Many Species
title_full_unstemmed The Spectre of Too Many Species
title_short The Spectre of Too Many Species
title_sort spectre of too many species
topic Points of View
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6292489/
https://www.ncbi.nlm.nih.gov/pubmed/29982825
http://dx.doi.org/10.1093/sysbio/syy051
work_keys_str_mv AT leacheadamd thespectreoftoomanyspecies
AT zhutianqi thespectreoftoomanyspecies
AT rannalabruce thespectreoftoomanyspecies
AT yangziheng thespectreoftoomanyspecies
AT leacheadamd spectreoftoomanyspecies
AT zhutianqi spectreoftoomanyspecies
AT rannalabruce spectreoftoomanyspecies
AT yangziheng spectreoftoomanyspecies