Cargando…

Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data

Real geography is continuous, but standard models in population genetics are based on discrete, well-mixed populations. As a result, many methods of analyzing genetic data assume that samples are a random draw from a well-mixed population, but are applied to clustered samples from populations that a...

Descripción completa

Detalles Bibliográficos
Autores principales: Battey, C. J., Ralph, Peter L., Kern, Andrew D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7198281/
https://www.ncbi.nlm.nih.gov/pubmed/32209569
http://dx.doi.org/10.1534/genetics.120.303143
_version_ 1783528967678984192
author Battey, C. J.
Ralph, Peter L.
Kern, Andrew D.
author_facet Battey, C. J.
Ralph, Peter L.
Kern, Andrew D.
author_sort Battey, C. J.
collection PubMed
description Real geography is continuous, but standard models in population genetics are based on discrete, well-mixed populations. As a result, many methods of analyzing genetic data assume that samples are a random draw from a well-mixed population, but are applied to clustered samples from populations that are structured clinally over space. Here, we use simulations of populations living in continuous geography to study the impacts of dispersal and sampling strategy on population genetic summary statistics, demographic inference, and genome-wide association studies (GWAS). We find that most common summary statistics have distributions that differ substantially from those seen in well-mixed populations, especially when Wright’s neighborhood size is < 100 and sampling is spatially clustered. “Stepping-stone” models reproduce some of these effects, but discretizing the landscape introduces artifacts that in some cases are exacerbated at higher resolutions. The combination of low dispersal and clustered sampling causes demographic inference from the site frequency spectrum to infer more turbulent demographic histories, but averaged results across multiple simulations revealed surprisingly little systematic bias. We also show that the combination of spatially autocorrelated environments and limited dispersal causes GWAS to identify spurious signals of genetic association with purely environmentally determined phenotypes, and that this bias is only partially corrected by regressing out principal components of ancestry. Last, we discuss the relevance of our simulation results for inference from genetic variation in real organisms.
format Online
Article
Text
id pubmed-7198281
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-71982812020-05-08 Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data Battey, C. J. Ralph, Peter L. Kern, Andrew D. Genetics Investigations Real geography is continuous, but standard models in population genetics are based on discrete, well-mixed populations. As a result, many methods of analyzing genetic data assume that samples are a random draw from a well-mixed population, but are applied to clustered samples from populations that are structured clinally over space. Here, we use simulations of populations living in continuous geography to study the impacts of dispersal and sampling strategy on population genetic summary statistics, demographic inference, and genome-wide association studies (GWAS). We find that most common summary statistics have distributions that differ substantially from those seen in well-mixed populations, especially when Wright’s neighborhood size is < 100 and sampling is spatially clustered. “Stepping-stone” models reproduce some of these effects, but discretizing the landscape introduces artifacts that in some cases are exacerbated at higher resolutions. The combination of low dispersal and clustered sampling causes demographic inference from the site frequency spectrum to infer more turbulent demographic histories, but averaged results across multiple simulations revealed surprisingly little systematic bias. We also show that the combination of spatially autocorrelated environments and limited dispersal causes GWAS to identify spurious signals of genetic association with purely environmentally determined phenotypes, and that this bias is only partially corrected by regressing out principal components of ancestry. Last, we discuss the relevance of our simulation results for inference from genetic variation in real organisms. Genetics Society of America 2020-05 2020-03-24 /pmc/articles/PMC7198281/ /pubmed/32209569 http://dx.doi.org/10.1534/genetics.120.303143 Text en Copyright © 2020 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Battey, C. J.
Ralph, Peter L.
Kern, Andrew D.
Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
title Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
title_full Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
title_fullStr Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
title_full_unstemmed Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
title_short Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data
title_sort space is the place: effects of continuous spatial structure on analysis of population genetic data
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7198281/
https://www.ncbi.nlm.nih.gov/pubmed/32209569
http://dx.doi.org/10.1534/genetics.120.303143
work_keys_str_mv AT batteycj spaceistheplaceeffectsofcontinuousspatialstructureonanalysisofpopulationgeneticdata
AT ralphpeterl spaceistheplaceeffectsofcontinuousspatialstructureonanalysisofpopulationgeneticdata
AT kernandrewd spaceistheplaceeffectsofcontinuousspatialstructureonanalysisofpopulationgeneticdata