Cargando…

Dispersal inference from population genetic variation using a convolutional neural network

The geographic nature of biological dispersal shapes patterns of genetic variation over landscapes, making it possible to infer properties of dispersal from genetic variation data. Here, we present an inference tool that uses geographically distributed genotype data in combination with a convolution...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Chris C R, Tittes, Silas, Ralph, Peter L, Kern, Andrew D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10213498/
https://www.ncbi.nlm.nih.gov/pubmed/37052957
http://dx.doi.org/10.1093/genetics/iyad068
_version_ 1785047632869064704
author Smith, Chris C R
Tittes, Silas
Ralph, Peter L
Kern, Andrew D
author_facet Smith, Chris C R
Tittes, Silas
Ralph, Peter L
Kern, Andrew D
author_sort Smith, Chris C R
collection PubMed
description The geographic nature of biological dispersal shapes patterns of genetic variation over landscapes, making it possible to infer properties of dispersal from genetic variation data. Here, we present an inference tool that uses geographically distributed genotype data in combination with a convolutional neural network to estimate a critical population parameter: the mean per-generation dispersal distance. Using extensive simulation, we show that our deep learning approach is competitive with or outperforms state-of-the-art methods, particularly at small sample sizes. In addition, we evaluate varying nuisance parameters during training—including population density, demographic history, habitat size, and sampling area—and show that this strategy is effective for estimating dispersal distance when other model parameters are unknown. Whereas competing methods depend on information about local population density or accurate inference of identity-by-descent tracts, our method uses only single-nucleotide-polymorphism data and the spatial scale of sampling as input. Strikingly, and unlike other methods, our method does not use the geographic coordinates of the genotyped individuals. These features make our method, which we call “disperseNN,” a potentially valuable new tool for estimating dispersal distance in nonmodel systems with whole genome data or reduced representation data. We apply disperseNN to 12 different species with publicly available data, yielding reasonable estimates for most species. Importantly, our method estimated consistently larger dispersal distances than mark-recapture calculations in the same species, which may be due to the limited geographic sampling area covered by some mark-recapture studies. Thus genetic tools like ours complement direct methods for improving our understanding of dispersal.
format Online
Article
Text
id pubmed-10213498
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102134982023-05-27 Dispersal inference from population genetic variation using a convolutional neural network Smith, Chris C R Tittes, Silas Ralph, Peter L Kern, Andrew D Genetics Investigation The geographic nature of biological dispersal shapes patterns of genetic variation over landscapes, making it possible to infer properties of dispersal from genetic variation data. Here, we present an inference tool that uses geographically distributed genotype data in combination with a convolutional neural network to estimate a critical population parameter: the mean per-generation dispersal distance. Using extensive simulation, we show that our deep learning approach is competitive with or outperforms state-of-the-art methods, particularly at small sample sizes. In addition, we evaluate varying nuisance parameters during training—including population density, demographic history, habitat size, and sampling area—and show that this strategy is effective for estimating dispersal distance when other model parameters are unknown. Whereas competing methods depend on information about local population density or accurate inference of identity-by-descent tracts, our method uses only single-nucleotide-polymorphism data and the spatial scale of sampling as input. Strikingly, and unlike other methods, our method does not use the geographic coordinates of the genotyped individuals. These features make our method, which we call “disperseNN,” a potentially valuable new tool for estimating dispersal distance in nonmodel systems with whole genome data or reduced representation data. We apply disperseNN to 12 different species with publicly available data, yielding reasonable estimates for most species. Importantly, our method estimated consistently larger dispersal distances than mark-recapture calculations in the same species, which may be due to the limited geographic sampling area covered by some mark-recapture studies. Thus genetic tools like ours complement direct methods for improving our understanding of dispersal. Oxford University Press 2023-04-13 /pmc/articles/PMC10213498/ /pubmed/37052957 http://dx.doi.org/10.1093/genetics/iyad068 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of The Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Smith, Chris C R
Tittes, Silas
Ralph, Peter L
Kern, Andrew D
Dispersal inference from population genetic variation using a convolutional neural network
title Dispersal inference from population genetic variation using a convolutional neural network
title_full Dispersal inference from population genetic variation using a convolutional neural network
title_fullStr Dispersal inference from population genetic variation using a convolutional neural network
title_full_unstemmed Dispersal inference from population genetic variation using a convolutional neural network
title_short Dispersal inference from population genetic variation using a convolutional neural network
title_sort dispersal inference from population genetic variation using a convolutional neural network
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10213498/
https://www.ncbi.nlm.nih.gov/pubmed/37052957
http://dx.doi.org/10.1093/genetics/iyad068
work_keys_str_mv AT smithchriscr dispersalinferencefrompopulationgeneticvariationusingaconvolutionalneuralnetwork
AT tittessilas dispersalinferencefrompopulationgeneticvariationusingaconvolutionalneuralnetwork
AT ralphpeterl dispersalinferencefrompopulationgeneticvariationusingaconvolutionalneuralnetwork
AT kernandrewd dispersalinferencefrompopulationgeneticvariationusingaconvolutionalneuralnetwork