Cargando…

Effects of single nucleotide polymorphism ascertainment on population structure inferences

Single nucleotide polymorphism (SNP) data are widely used in research on natural populations. Although they are useful, SNP genotyping data are known to contain bias, normally referred to as ascertainment bias, because they are conditioned by already confirmed variants. This bias is introduced durin...

Descripción completa

Detalles Bibliográficos
Autores principales: Dokan, Kotaro, Kawamura, Sayu, Teshima, Kosuke M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8496283/
https://www.ncbi.nlm.nih.gov/pubmed/33871576
http://dx.doi.org/10.1093/g3journal/jkab128
_version_ 1784579723962089472
author Dokan, Kotaro
Kawamura, Sayu
Teshima, Kosuke M
author_facet Dokan, Kotaro
Kawamura, Sayu
Teshima, Kosuke M
author_sort Dokan, Kotaro
collection PubMed
description Single nucleotide polymorphism (SNP) data are widely used in research on natural populations. Although they are useful, SNP genotyping data are known to contain bias, normally referred to as ascertainment bias, because they are conditioned by already confirmed variants. This bias is introduced during the genotyping process, including the selection of populations for novel SNP discovery and the number of individuals involved in the discovery panel and selection of SNP markers. It is widely recognized that ascertainment bias can cause inaccurate inferences in population genetics and several methods to address these bias issues have been proposed. However, especially in natural populations, it is not always possible to apply an ideal ascertainment scheme because natural populations tend to have complex structures and histories. In addition, it was not fully assessed if ascertainment bias has the same effect on different types of population structure. Here, we examine the effects of bias produced during the selection of population for SNP discovery and consequent SNP marker selection processes under three demographic models: the island, stepping-stone, and population split models. Results show that site frequency spectra and summary statistics contain biases that depend on the joint effect of population structure and ascertainment schemes. Additionally, population structure inferences are also affected by ascertainment bias. Based on these results, it is recommended to evaluate the validity of the ascertainment strategy prior to the actual typing process because the direction and extent of ascertainment bias vary depending on several factors.
format Online
Article
Text
id pubmed-8496283
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84962832021-10-07 Effects of single nucleotide polymorphism ascertainment on population structure inferences Dokan, Kotaro Kawamura, Sayu Teshima, Kosuke M G3 (Bethesda) Investigation Single nucleotide polymorphism (SNP) data are widely used in research on natural populations. Although they are useful, SNP genotyping data are known to contain bias, normally referred to as ascertainment bias, because they are conditioned by already confirmed variants. This bias is introduced during the genotyping process, including the selection of populations for novel SNP discovery and the number of individuals involved in the discovery panel and selection of SNP markers. It is widely recognized that ascertainment bias can cause inaccurate inferences in population genetics and several methods to address these bias issues have been proposed. However, especially in natural populations, it is not always possible to apply an ideal ascertainment scheme because natural populations tend to have complex structures and histories. In addition, it was not fully assessed if ascertainment bias has the same effect on different types of population structure. Here, we examine the effects of bias produced during the selection of population for SNP discovery and consequent SNP marker selection processes under three demographic models: the island, stepping-stone, and population split models. Results show that site frequency spectra and summary statistics contain biases that depend on the joint effect of population structure and ascertainment schemes. Additionally, population structure inferences are also affected by ascertainment bias. Based on these results, it is recommended to evaluate the validity of the ascertainment strategy prior to the actual typing process because the direction and extent of ascertainment bias vary depending on several factors. Oxford University Press 2021-04-19 /pmc/articles/PMC8496283/ /pubmed/33871576 http://dx.doi.org/10.1093/g3journal/jkab128 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Dokan, Kotaro
Kawamura, Sayu
Teshima, Kosuke M
Effects of single nucleotide polymorphism ascertainment on population structure inferences
title Effects of single nucleotide polymorphism ascertainment on population structure inferences
title_full Effects of single nucleotide polymorphism ascertainment on population structure inferences
title_fullStr Effects of single nucleotide polymorphism ascertainment on population structure inferences
title_full_unstemmed Effects of single nucleotide polymorphism ascertainment on population structure inferences
title_short Effects of single nucleotide polymorphism ascertainment on population structure inferences
title_sort effects of single nucleotide polymorphism ascertainment on population structure inferences
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8496283/
https://www.ncbi.nlm.nih.gov/pubmed/33871576
http://dx.doi.org/10.1093/g3journal/jkab128
work_keys_str_mv AT dokankotaro effectsofsinglenucleotidepolymorphismascertainmentonpopulationstructureinferences
AT kawamurasayu effectsofsinglenucleotidepolymorphismascertainmentonpopulationstructureinferences
AT teshimakosukem effectsofsinglenucleotidepolymorphismascertainmentonpopulationstructureinferences