Cargando…

Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel

Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-...

Descripción completa

Detalles Bibliográficos
Autores principales: Eriksson, Anders, Manica, Andrea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3276161/
https://www.ncbi.nlm.nih.gov/pubmed/22384358
http://dx.doi.org/10.1534/g3.111.001016
_version_ 1782223340332646400
author Eriksson, Anders
Manica, Andrea
author_facet Eriksson, Anders
Manica, Andrea
author_sort Eriksson, Anders
collection PubMed
description Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-CEPH panel. We develop a novel framework based on rarefaction to compare heterozygosity across markers with different mutation rates. We find that, whereas di- and tri-nucleotides show similar patterns of within- and between-population heterozygosity, tetra-nucleotides are inconsistent with the other two motifs. In addition, di- and tri-nucleotides are consistent with 16 unbiased tetra-nucleotide markers, whereas the HPGP-CEPH tetra-nucleotides are significantly different. This discrepancy is due to the HGDP-CEPH tetra-nucleotides being too homogeneous across Eurasia, even after their slower mutation rate is taken into account by rarefying the other markers. The most likely explanation for this pattern is ascertainment bias. We strongly advocate the exclusion of tetra-nucleotides from future population genetics analysis of this dataset, and we argue that other microsatellite datasets should be investigated for the presence of bias using the approach outlined in this article.
format Online
Article
Text
id pubmed-3276161
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-32761612012-03-01 Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel Eriksson, Anders Manica, Andrea G3 (Bethesda) Investigation Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-CEPH panel. We develop a novel framework based on rarefaction to compare heterozygosity across markers with different mutation rates. We find that, whereas di- and tri-nucleotides show similar patterns of within- and between-population heterozygosity, tetra-nucleotides are inconsistent with the other two motifs. In addition, di- and tri-nucleotides are consistent with 16 unbiased tetra-nucleotide markers, whereas the HPGP-CEPH tetra-nucleotides are significantly different. This discrepancy is due to the HGDP-CEPH tetra-nucleotides being too homogeneous across Eurasia, even after their slower mutation rate is taken into account by rarefying the other markers. The most likely explanation for this pattern is ascertainment bias. We strongly advocate the exclusion of tetra-nucleotides from future population genetics analysis of this dataset, and we argue that other microsatellite datasets should be investigated for the presence of bias using the approach outlined in this article. Genetics Society of America 2011-11-01 /pmc/articles/PMC3276161/ /pubmed/22384358 http://dx.doi.org/10.1534/g3.111.001016 Text en Copyright © 2011 Eriksson, Manica http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Unported License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Eriksson, Anders
Manica, Andrea
Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
title Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
title_full Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
title_fullStr Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
title_full_unstemmed Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
title_short Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
title_sort detecting and removing ascertainment bias in microsatellites from the hgdp-ceph panel
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3276161/
https://www.ncbi.nlm.nih.gov/pubmed/22384358
http://dx.doi.org/10.1534/g3.111.001016
work_keys_str_mv AT erikssonanders detectingandremovingascertainmentbiasinmicrosatellitesfromthehgdpcephpanel
AT manicaandrea detectingandremovingascertainmentbiasinmicrosatellitesfromthehgdpcephpanel