Cargando…

Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure

Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline—phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker,...

Descripción completa

Detalles Bibliográficos
Autores principales: OROZCO-terWENGEL, PABLO, CORANDER, JUKKA, SCHLÖTTERER, CHRISTIAN
Formato: Texto
Lenguaje:English
Publicado: Blackwell Publishing Ltd 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3084510/
https://www.ncbi.nlm.nih.gov/pubmed/21244537
http://dx.doi.org/10.1111/j.1365-294X.2010.04990.x
_version_ 1782202515336462336
author OROZCO-terWENGEL, PABLO
CORANDER, JUKKA
SCHLÖTTERER, CHRISTIAN
author_facet OROZCO-terWENGEL, PABLO
CORANDER, JUKKA
SCHLÖTTERER, CHRISTIAN
author_sort OROZCO-terWENGEL, PABLO
collection PubMed
description Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline—phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker, it has become apparent that selection and genealogical lineage sorting could lead to erroneous inferences. As it is not clear to what extent these forces affect a given marker, it has become common practice to use the combined evidence from a set of molecular markers as an attempt to recover the signals that approximate the true underlying demography. Typically, the number of markers used is determined by either budget constraints or by statistical power required to recognize significant population differentiation. Using microsatellite markers from Drosophila and humans, we show that even large numbers of loci (>50) can frequently result in statistically well-supported, but incorrect inference of population structure using the software baps. Most importantly, genomic features, such as chromosomal location, variability of the markers, or recombination rate, cannot explain this observation. Instead, it can be attributed to sampling variation among loci with different realizations of the stochastic lineage sorting. This phenomenon is particularly pronounced for low levels of population differentiation. Our results have important implications for ongoing studies of population differentiation, as we unambiguously demonstrate that statistical significance of population structure inferred from a random set of genetic markers cannot necessarily be taken as evidence for a reliable demographic inference.
format Text
id pubmed-3084510
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Blackwell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-30845102011-05-11 Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure OROZCO-terWENGEL, PABLO CORANDER, JUKKA SCHLÖTTERER, CHRISTIAN Mol Ecol Original Articles Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline—phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker, it has become apparent that selection and genealogical lineage sorting could lead to erroneous inferences. As it is not clear to what extent these forces affect a given marker, it has become common practice to use the combined evidence from a set of molecular markers as an attempt to recover the signals that approximate the true underlying demography. Typically, the number of markers used is determined by either budget constraints or by statistical power required to recognize significant population differentiation. Using microsatellite markers from Drosophila and humans, we show that even large numbers of loci (>50) can frequently result in statistically well-supported, but incorrect inference of population structure using the software baps. Most importantly, genomic features, such as chromosomal location, variability of the markers, or recombination rate, cannot explain this observation. Instead, it can be attributed to sampling variation among loci with different realizations of the stochastic lineage sorting. This phenomenon is particularly pronounced for low levels of population differentiation. Our results have important implications for ongoing studies of population differentiation, as we unambiguously demonstrate that statistical significance of population structure inferred from a random set of genetic markers cannot necessarily be taken as evidence for a reliable demographic inference. Blackwell Publishing Ltd 2011-03 /pmc/articles/PMC3084510/ /pubmed/21244537 http://dx.doi.org/10.1111/j.1365-294X.2010.04990.x Text en Copyright © 2011 Blackwell Publishing Ltd http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Original Articles
OROZCO-terWENGEL, PABLO
CORANDER, JUKKA
SCHLÖTTERER, CHRISTIAN
Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
title Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
title_full Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
title_fullStr Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
title_full_unstemmed Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
title_short Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure
title_sort genealogical lineage sorting leads to significant, but incorrect bayesian multilocus inference of population structure
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3084510/
https://www.ncbi.nlm.nih.gov/pubmed/21244537
http://dx.doi.org/10.1111/j.1365-294X.2010.04990.x
work_keys_str_mv AT orozcoterwengelpablo genealogicallineagesortingleadstosignificantbutincorrectbayesianmultilocusinferenceofpopulationstructure
AT coranderjukka genealogicallineagesortingleadstosignificantbutincorrectbayesianmultilocusinferenceofpopulationstructure
AT schlottererchristian genealogicallineagesortingleadstosignificantbutincorrectbayesianmultilocusinferenceofpopulationstructure