Cargando…
Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times
BACKGROUND: Homoplasy affects demographic inference estimates. This effect has been recognized and corrective methods have been developed. However, no studies so far have defined what homoplasy metrics best describe the effects on demographic inference, or have attempted to estimate such metrics in...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5594565/ https://www.ncbi.nlm.nih.gov/pubmed/28893173 http://dx.doi.org/10.1186/s12862-017-1046-4 |
_version_ | 1783263228808134656 |
---|---|
author | Ortega-Del Vecchyo, Diego Piñero, Daniel Jardón-Barbolla, Lev van Heerwaarden, Joost |
author_facet | Ortega-Del Vecchyo, Diego Piñero, Daniel Jardón-Barbolla, Lev van Heerwaarden, Joost |
author_sort | Ortega-Del Vecchyo, Diego |
collection | PubMed |
description | BACKGROUND: Homoplasy affects demographic inference estimates. This effect has been recognized and corrective methods have been developed. However, no studies so far have defined what homoplasy metrics best describe the effects on demographic inference, or have attempted to estimate such metrics in real data. Here we study how homoplasy in chloroplast microsatellites (cpSSR) affects inference of population expansion time. cpSSRs are popular markers for inferring historical demography in plants due to their high mutation rate and limited recombination. RESULTS: In cpSSRs, homoplasy is usually quantified as the probability that two markers or haplotypes that are identical by state are not identical by descent (Homoplasy index, P). Here we propose a new measure of multi-locus homoplasy in linked SSR called Distance Homoplasy (DH), which measures the proportion of pairwise differences not observed due to homoplasy, and we compare it to P and its per cpSSR locus average, which we call Mean Size Homoplasy (MSH). We use simulations and analytical derivations to show that, out of the three homoplasy metrics analyzed, MSH and DH are more correlated to changes in the population expansion time and to the underestimation of that demographic parameter using cpSSR. We perform simulations to show that Approximate Bayesian Computation (ABC) can be used to obtain reasonable estimates of MSH and DH. Finally, we use ABC to estimate the expansion time, MSH and DH from a chloroplast SSR dataset in Pinus caribaea. To our knowledge, this is the first time that homoplasy has been estimated in population genetic data. CONCLUSIONS: We show that MSH and DH should be used to quantify how homoplasy affects estimates of population expansion time. We also demonstrate how ABC provides a methodology to estimate homoplasy in population genetic data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-017-1046-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5594565 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55945652017-09-14 Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times Ortega-Del Vecchyo, Diego Piñero, Daniel Jardón-Barbolla, Lev van Heerwaarden, Joost BMC Evol Biol Research Article BACKGROUND: Homoplasy affects demographic inference estimates. This effect has been recognized and corrective methods have been developed. However, no studies so far have defined what homoplasy metrics best describe the effects on demographic inference, or have attempted to estimate such metrics in real data. Here we study how homoplasy in chloroplast microsatellites (cpSSR) affects inference of population expansion time. cpSSRs are popular markers for inferring historical demography in plants due to their high mutation rate and limited recombination. RESULTS: In cpSSRs, homoplasy is usually quantified as the probability that two markers or haplotypes that are identical by state are not identical by descent (Homoplasy index, P). Here we propose a new measure of multi-locus homoplasy in linked SSR called Distance Homoplasy (DH), which measures the proportion of pairwise differences not observed due to homoplasy, and we compare it to P and its per cpSSR locus average, which we call Mean Size Homoplasy (MSH). We use simulations and analytical derivations to show that, out of the three homoplasy metrics analyzed, MSH and DH are more correlated to changes in the population expansion time and to the underestimation of that demographic parameter using cpSSR. We perform simulations to show that Approximate Bayesian Computation (ABC) can be used to obtain reasonable estimates of MSH and DH. Finally, we use ABC to estimate the expansion time, MSH and DH from a chloroplast SSR dataset in Pinus caribaea. To our knowledge, this is the first time that homoplasy has been estimated in population genetic data. CONCLUSIONS: We show that MSH and DH should be used to quantify how homoplasy affects estimates of population expansion time. We also demonstrate how ABC provides a methodology to estimate homoplasy in population genetic data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-017-1046-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-11 /pmc/articles/PMC5594565/ /pubmed/28893173 http://dx.doi.org/10.1186/s12862-017-1046-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Ortega-Del Vecchyo, Diego Piñero, Daniel Jardón-Barbolla, Lev van Heerwaarden, Joost Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times |
title | Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times |
title_full | Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times |
title_fullStr | Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times |
title_full_unstemmed | Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times |
title_short | Appropriate homoplasy metrics in linked SSRs to predict an underestimation of demographic expansion times |
title_sort | appropriate homoplasy metrics in linked ssrs to predict an underestimation of demographic expansion times |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5594565/ https://www.ncbi.nlm.nih.gov/pubmed/28893173 http://dx.doi.org/10.1186/s12862-017-1046-4 |
work_keys_str_mv | AT ortegadelvecchyodiego appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes AT pinerodaniel appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes AT jardonbarbollalev appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes AT vanheerwaardenjoost appropriatehomoplasymetricsinlinkedssrstopredictanunderestimationofdemographicexpansiontimes |