Cargando…

disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data

Spatial genetic variation is shaped in part by an organism’s dispersal ability. We present a deep learning tool, disperseNN2, for estimating the mean per-generation dispersal distance from georeferenced polymorphism data. Our neural network performs feature extraction on pairs of genotypes, and uses...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Chris C. R., Kern, Andrew D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10566146/
https://www.ncbi.nlm.nih.gov/pubmed/37817115
http://dx.doi.org/10.1186/s12859-023-05522-7
_version_ 1785118860302614528
author Smith, Chris C. R.
Kern, Andrew D.
author_facet Smith, Chris C. R.
Kern, Andrew D.
author_sort Smith, Chris C. R.
collection PubMed
description Spatial genetic variation is shaped in part by an organism’s dispersal ability. We present a deep learning tool, disperseNN2, for estimating the mean per-generation dispersal distance from georeferenced polymorphism data. Our neural network performs feature extraction on pairs of genotypes, and uses the geographic information that comes with each sample. These attributes led disperseNN2 to outperform a state-of-the-art deep learning method that does not use explicit spatial information: the mean relative absolute error was reduced by 33% and 48% using sample sizes of 10 and 100 individuals, respectively. disperseNN2 is particularly useful for non-model organisms or systems with sparse genomic resources, as it uses unphased, single nucleotide polymorphisms as its input. The software is open source and available from https://github.com/kr-colab/disperseNN2, with documentation located at https://dispersenn2.readthedocs.io/en/latest/.
format Online
Article
Text
id pubmed-10566146
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105661462023-10-12 disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data Smith, Chris C. R. Kern, Andrew D. BMC Bioinformatics Software Spatial genetic variation is shaped in part by an organism’s dispersal ability. We present a deep learning tool, disperseNN2, for estimating the mean per-generation dispersal distance from georeferenced polymorphism data. Our neural network performs feature extraction on pairs of genotypes, and uses the geographic information that comes with each sample. These attributes led disperseNN2 to outperform a state-of-the-art deep learning method that does not use explicit spatial information: the mean relative absolute error was reduced by 33% and 48% using sample sizes of 10 and 100 individuals, respectively. disperseNN2 is particularly useful for non-model organisms or systems with sparse genomic resources, as it uses unphased, single nucleotide polymorphisms as its input. The software is open source and available from https://github.com/kr-colab/disperseNN2, with documentation located at https://dispersenn2.readthedocs.io/en/latest/. BioMed Central 2023-10-11 /pmc/articles/PMC10566146/ /pubmed/37817115 http://dx.doi.org/10.1186/s12859-023-05522-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Smith, Chris C. R.
Kern, Andrew D.
disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data
title disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data
title_full disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data
title_fullStr disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data
title_full_unstemmed disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data
title_short disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data
title_sort dispersenn2: a neural network for estimating dispersal distance from georeferenced polymorphism data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10566146/
https://www.ncbi.nlm.nih.gov/pubmed/37817115
http://dx.doi.org/10.1186/s12859-023-05522-7
work_keys_str_mv AT smithchriscr dispersenn2aneuralnetworkforestimatingdispersaldistancefromgeoreferencedpolymorphismdata
AT kernandrewd dispersenn2aneuralnetworkforestimatingdispersaldistancefromgeoreferencedpolymorphismdata