Cargando…

Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times

BACKGROUND: Homoplasy affects demographic inference estimates. This effect has been recognized and corrective methods have been developed. However, no studies so far have defined what homoplasy metrics best describe the effects on demographic inference, or have attempted to estimate such metrics in...

Descripción completa

Detalles Bibliográficos
Autores principales: Ortega-Del Vecchyo, Diego, Piñero, Daniel, Jardón-Barbolla, Lev, van Heerwaarden, Joost
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5594565/
https://www.ncbi.nlm.nih.gov/pubmed/28893173
http://dx.doi.org/10.1186/s12862-017-1046-4
_version_ 1783263228808134656
author Ortega-Del Vecchyo, Diego
Piñero, Daniel
Jardón-Barbolla, Lev
van Heerwaarden, Joost
author_facet Ortega-Del Vecchyo, Diego
Piñero, Daniel
Jardón-Barbolla, Lev
van Heerwaarden, Joost
author_sort Ortega-Del Vecchyo, Diego
collection PubMed
description BACKGROUND: Homoplasy affects demographic inference estimates. This effect has been recognized and corrective methods have been developed. However, no studies so far have defined what homoplasy metrics best describe the effects on demographic inference, or have attempted to estimate such metrics in real data. Here we study how homoplasy in chloroplast microsatellites (cpSSR) affects inference of population expansion time. cpSSRs are popular markers for inferring historical demography in plants due to their high mutation rate and limited recombination. RESULTS: In cpSSRs, homoplasy is usually quantified as the probability that two markers or haplotypes that are identical by state are not identical by descent (Homoplasy index, P). Here we propose a new measure of multi-locus homoplasy in linked SSR called Distance Homoplasy (DH), which measures the proportion of pairwise differences not observed due to homoplasy, and we compare it to P and its per cpSSR locus average, which we call Mean Size Homoplasy (MSH). We use simulations and analytical derivations to show that, out of the three homoplasy metrics analyzed, MSH and DH are more correlated to changes in the population expansion time and to the underestimation of that demographic parameter using cpSSR. We perform simulations to show that Approximate Bayesian Computation (ABC) can be used to obtain reasonable estimates of MSH and DH. Finally, we use ABC to estimate the expansion time, MSH and DH from a chloroplast SSR dataset in Pinus caribaea. To our knowledge, this is the first time that homoplasy has been estimated in population genetic data. CONCLUSIONS: We show that MSH and DH should be used to quantify how homoplasy affects estimates of population expansion time. We also demonstrate how ABC provides a methodology to estimate homoplasy in population genetic data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-017-1046-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5594565
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55945652017-09-14 Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times Ortega-Del Vecchyo, Diego Piñero, Daniel Jardón-Barbolla, Lev van Heerwaarden, Joost BMC Evol Biol Research Article BACKGROUND: Homoplasy affects demographic inference estimates. This effect has been recognized and corrective methods have been developed. However, no studies so far have defined what homoplasy metrics best describe the effects on demographic inference, or have attempted to estimate such metrics in real data. Here we study how homoplasy in chloroplast microsatellites (cpSSR) affects inference of population expansion time. cpSSRs are popular markers for inferring historical demography in plants due to their high mutation rate and limited recombination. RESULTS: In cpSSRs, homoplasy is usually quantified as the probability that two markers or haplotypes that are identical by state are not identical by descent (Homoplasy index, P). Here we propose a new measure of multi-locus homoplasy in linked SSR called Distance Homoplasy (DH), which measures the proportion of pairwise differences not observed due to homoplasy, and we compare it to P and its per cpSSR locus average, which we call Mean Size Homoplasy (MSH). We use simulations and analytical derivations to show that, out of the three homoplasy metrics analyzed, MSH and DH are more correlated to changes in the population expansion time and to the underestimation of that demographic parameter using cpSSR. We perform simulations to show that Approximate Bayesian Computation (ABC) can be used to obtain reasonable estimates of MSH and DH. Finally, we use ABC to estimate the expansion time, MSH and DH from a chloroplast SSR dataset in Pinus caribaea. To our knowledge, this is the first time that homoplasy has been estimated in population genetic data. CONCLUSIONS: We show that MSH and DH should be used to quantify how homoplasy affects estimates of population expansion time. We also demonstrate how ABC provides a methodology to estimate homoplasy in population genetic data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-017-1046-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-11 /pmc/articles/PMC5594565/ /pubmed/28893173 http://dx.doi.org/10.1186/s12862-017-1046-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ortega-Del Vecchyo, Diego
Piñero, Daniel
Jardón-Barbolla, Lev
van Heerwaarden, Joost
Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
title Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
title_full Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
title_fullStr Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
title_full_unstemmed Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
title_short Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
title_sort appropriate homoplasy metrics in linked ssrs to predict an underestimation of demographic expansion times
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5594565/
https://www.ncbi.nlm.nih.gov/pubmed/28893173
http://dx.doi.org/10.1186/s12862-017-1046-4
work_keys_str_mv AT ortegadelvecchyodiego appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes
AT pinerodaniel appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes
AT jardonbarbollalev appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes
AT vanheerwaardenjoost appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes