Cargando…

Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach

An increasing number of studies are using landscape genomics to investigate local adaptation in wild and domestic populations. Implementation of this approach requires the sampling phase to consider the complexity of environmental settings and the burden of logistical constraints. These important as...

Descripción completa

Detalles Bibliográficos
Autores principales: Selmoni, Oliver, Vajana, Elia, Guillaume, Annie, Rochat, Estelle, Joost, Stéphane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6972490/
https://www.ncbi.nlm.nih.gov/pubmed/31550072
http://dx.doi.org/10.1111/1755-0998.13095
_version_ 1783489842280136704
author Selmoni, Oliver
Vajana, Elia
Guillaume, Annie
Rochat, Estelle
Joost, Stéphane
author_facet Selmoni, Oliver
Vajana, Elia
Guillaume, Annie
Rochat, Estelle
Joost, Stéphane
author_sort Selmoni, Oliver
collection PubMed
description An increasing number of studies are using landscape genomics to investigate local adaptation in wild and domestic populations. Implementation of this approach requires the sampling phase to consider the complexity of environmental settings and the burden of logistical constraints. These important aspects are often underestimated in the literature dedicated to sampling strategies. In this study, we computed simulated genomic data sets to run against actual environmental data in order to trial landscape genomics experiments under distinct sampling strategies. These strategies differed by design approach (to enhance environmental and/or geographical representativeness at study sites), number of sampling locations and sample sizes. We then evaluated how these elements affected statistical performances (power and false discoveries) under two antithetical demographic scenarios. Our results highlight the importance of selecting an appropriate sample size, which should be modified based on the demographic characteristics of the studied population. For species with limited dispersal, sample sizes above 200 units are generally sufficient to detect most adaptive signals, while in random mating populations this threshold should be increased to 400 units. Furthermore, we describe a design approach that maximizes both environmental and geographical representativeness of sampling sites and show how it systematically outperforms random or regular sampling schemes. Finally, we show that although having more sampling locations (between 40 and 50 sites) increase statistical power and reduce false discovery rate, similar results can be achieved with a moderate number of sites (20 sites). Overall, this study provides valuable guidelines for optimizing sampling strategies for landscape genomics experiments.
format Online
Article
Text
id pubmed-6972490
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-69724902020-01-27 Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach Selmoni, Oliver Vajana, Elia Guillaume, Annie Rochat, Estelle Joost, Stéphane Mol Ecol Resour RESOURCE ARTICLES An increasing number of studies are using landscape genomics to investigate local adaptation in wild and domestic populations. Implementation of this approach requires the sampling phase to consider the complexity of environmental settings and the burden of logistical constraints. These important aspects are often underestimated in the literature dedicated to sampling strategies. In this study, we computed simulated genomic data sets to run against actual environmental data in order to trial landscape genomics experiments under distinct sampling strategies. These strategies differed by design approach (to enhance environmental and/or geographical representativeness at study sites), number of sampling locations and sample sizes. We then evaluated how these elements affected statistical performances (power and false discoveries) under two antithetical demographic scenarios. Our results highlight the importance of selecting an appropriate sample size, which should be modified based on the demographic characteristics of the studied population. For species with limited dispersal, sample sizes above 200 units are generally sufficient to detect most adaptive signals, while in random mating populations this threshold should be increased to 400 units. Furthermore, we describe a design approach that maximizes both environmental and geographical representativeness of sampling sites and show how it systematically outperforms random or regular sampling schemes. Finally, we show that although having more sampling locations (between 40 and 50 sites) increase statistical power and reduce false discovery rate, similar results can be achieved with a moderate number of sites (20 sites). Overall, this study provides valuable guidelines for optimizing sampling strategies for landscape genomics experiments. John Wiley and Sons Inc. 2019-10-21 2020-01 /pmc/articles/PMC6972490/ /pubmed/31550072 http://dx.doi.org/10.1111/1755-0998.13095 Text en © 2019 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle RESOURCE ARTICLES
Selmoni, Oliver
Vajana, Elia
Guillaume, Annie
Rochat, Estelle
Joost, Stéphane
Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach
title Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach
title_full Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach
title_fullStr Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach
title_full_unstemmed Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach
title_short Sampling strategy optimization to increase statistical power in landscape genomics: A simulation‐based approach
title_sort sampling strategy optimization to increase statistical power in landscape genomics: a simulation‐based approach
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6972490/
https://www.ncbi.nlm.nih.gov/pubmed/31550072
http://dx.doi.org/10.1111/1755-0998.13095
work_keys_str_mv AT selmonioliver samplingstrategyoptimizationtoincreasestatisticalpowerinlandscapegenomicsasimulationbasedapproach
AT vajanaelia samplingstrategyoptimizationtoincreasestatisticalpowerinlandscapegenomicsasimulationbasedapproach
AT guillaumeannie samplingstrategyoptimizationtoincreasestatisticalpowerinlandscapegenomicsasimulationbasedapproach
AT rochatestelle samplingstrategyoptimizationtoincreasestatisticalpowerinlandscapegenomicsasimulationbasedapproach
AT jooststephane samplingstrategyoptimizationtoincreasestatisticalpowerinlandscapegenomicsasimulationbasedapproach