Cargando…
Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline—phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker,...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Blackwell Publishing Ltd
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3084510/ https://www.ncbi.nlm.nih.gov/pubmed/21244537 http://dx.doi.org/10.1111/j.1365-294X.2010.04990.x |
_version_ | 1782202515336462336 |
---|---|
author | OROZCO-terWENGEL, PABLO CORANDER, JUKKA SCHLÖTTERER, CHRISTIAN |
author_facet | OROZCO-terWENGEL, PABLO CORANDER, JUKKA SCHLÖTTERER, CHRISTIAN |
author_sort | OROZCO-terWENGEL, PABLO |
collection | PubMed |
description | Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline—phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker, it has become apparent that selection and genealogical lineage sorting could lead to erroneous inferences. As it is not clear to what extent these forces affect a given marker, it has become common practice to use the combined evidence from a set of molecular markers as an attempt to recover the signals that approximate the true underlying demography. Typically, the number of markers used is determined by either budget constraints or by statistical power required to recognize significant population differentiation. Using microsatellite markers from Drosophila and humans, we show that even large numbers of loci (>50) can frequently result in statistically well-supported, but incorrect inference of population structure using the software baps. Most importantly, genomic features, such as chromosomal location, variability of the markers, or recombination rate, cannot explain this observation. Instead, it can be attributed to sampling variation among loci with different realizations of the stochastic lineage sorting. This phenomenon is particularly pronounced for low levels of population differentiation. Our results have important implications for ongoing studies of population differentiation, as we unambiguously demonstrate that statistical significance of population structure inferred from a random set of genetic markers cannot necessarily be taken as evidence for a reliable demographic inference. |
format | Text |
id | pubmed-3084510 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Blackwell Publishing Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-30845102011-05-11 Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure OROZCO-terWENGEL, PABLO CORANDER, JUKKA SCHLÖTTERER, CHRISTIAN Mol Ecol Original Articles Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline—phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker, it has become apparent that selection and genealogical lineage sorting could lead to erroneous inferences. As it is not clear to what extent these forces affect a given marker, it has become common practice to use the combined evidence from a set of molecular markers as an attempt to recover the signals that approximate the true underlying demography. Typically, the number of markers used is determined by either budget constraints or by statistical power required to recognize significant population differentiation. Using microsatellite markers from Drosophila and humans, we show that even large numbers of loci (>50) can frequently result in statistically well-supported, but incorrect inference of population structure using the software baps. Most importantly, genomic features, such as chromosomal location, variability of the markers, or recombination rate, cannot explain this observation. Instead, it can be attributed to sampling variation among loci with different realizations of the stochastic lineage sorting. This phenomenon is particularly pronounced for low levels of population differentiation. Our results have important implications for ongoing studies of population differentiation, as we unambiguously demonstrate that statistical significance of population structure inferred from a random set of genetic markers cannot necessarily be taken as evidence for a reliable demographic inference. Blackwell Publishing Ltd 2011-03 /pmc/articles/PMC3084510/ /pubmed/21244537 http://dx.doi.org/10.1111/j.1365-294X.2010.04990.x Text en Copyright © 2011 Blackwell Publishing Ltd http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation. |
spellingShingle | Original Articles OROZCO-terWENGEL, PABLO CORANDER, JUKKA SCHLÖTTERER, CHRISTIAN Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure |
title | Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure |
title_full | Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure |
title_fullStr | Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure |
title_full_unstemmed | Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure |
title_short | Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure |
title_sort | genealogical lineage sorting leads to significant, but incorrect bayesian multilocus inference of population structure |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3084510/ https://www.ncbi.nlm.nih.gov/pubmed/21244537 http://dx.doi.org/10.1111/j.1365-294X.2010.04990.x |
work_keys_str_mv | AT orozcoterwengelpablo genealogicallineagesortingleadstosignificantbutincorrectbayesianmultilocusinferenceofpopulationstructure AT coranderjukka genealogicallineagesortingleadstosignificantbutincorrectbayesianmultilocusinferenceofpopulationstructure AT schlottererchristian genealogicallineagesortingleadstosignificantbutincorrectbayesianmultilocusinferenceofpopulationstructure |