Cargando…

Deep Learning in Population Genetics

Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine lear...

Descripción completa

Detalles Bibliográficos
Autores principales: Korfmann, Kevin, Gaggiotti, Oscar E, Fumagalli, Matteo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897193/
https://www.ncbi.nlm.nih.gov/pubmed/36683406
http://dx.doi.org/10.1093/gbe/evad008
_version_ 1784882201181028352
author Korfmann, Kevin
Gaggiotti, Oscar E
Fumagalli, Matteo
author_facet Korfmann, Kevin
Gaggiotti, Oscar E
Fumagalli, Matteo
author_sort Korfmann, Kevin
collection PubMed
description Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine learning, and in particular deep learning, algorithms are emerging as popular techniques for population genetic inferences. These approaches rely on algorithms that learn non-linear relationships between the input data and the model parameters being estimated through representation learning from training data sets. Deep learning algorithms currently employed in the field comprise discriminative and generative models with fully connected, convolutional, or recurrent layers. Additionally, a wide range of powerful simulators to generate training data under complex scenarios are now available. The application of deep learning to empirical data sets mostly replicates previous findings of demography reconstruction and signals of natural selection in model organisms. To showcase the feasibility of deep learning to tackle new challenges, we designed a branched architecture to detect signals of recent balancing selection from temporal haplotypic data, which exhibited good predictive performance on simulated data. Investigations on the interpretability of neural networks, their robustness to uncertain training data, and creative representation of population genetic data, will provide further opportunities for technological advancements in the field.
format Online
Article
Text
id pubmed-9897193
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98971932023-02-06 Deep Learning in Population Genetics Korfmann, Kevin Gaggiotti, Oscar E Fumagalli, Matteo Genome Biol Evol Review Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine learning, and in particular deep learning, algorithms are emerging as popular techniques for population genetic inferences. These approaches rely on algorithms that learn non-linear relationships between the input data and the model parameters being estimated through representation learning from training data sets. Deep learning algorithms currently employed in the field comprise discriminative and generative models with fully connected, convolutional, or recurrent layers. Additionally, a wide range of powerful simulators to generate training data under complex scenarios are now available. The application of deep learning to empirical data sets mostly replicates previous findings of demography reconstruction and signals of natural selection in model organisms. To showcase the feasibility of deep learning to tackle new challenges, we designed a branched architecture to detect signals of recent balancing selection from temporal haplotypic data, which exhibited good predictive performance on simulated data. Investigations on the interpretability of neural networks, their robustness to uncertain training data, and creative representation of population genetic data, will provide further opportunities for technological advancements in the field. Oxford University Press 2023-01-23 /pmc/articles/PMC9897193/ /pubmed/36683406 http://dx.doi.org/10.1093/gbe/evad008 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review
Korfmann, Kevin
Gaggiotti, Oscar E
Fumagalli, Matteo
Deep Learning in Population Genetics
title Deep Learning in Population Genetics
title_full Deep Learning in Population Genetics
title_fullStr Deep Learning in Population Genetics
title_full_unstemmed Deep Learning in Population Genetics
title_short Deep Learning in Population Genetics
title_sort deep learning in population genetics
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897193/
https://www.ncbi.nlm.nih.gov/pubmed/36683406
http://dx.doi.org/10.1093/gbe/evad008
work_keys_str_mv AT korfmannkevin deeplearninginpopulationgenetics
AT gaggiottioscare deeplearninginpopulationgenetics
AT fumagallimatteo deeplearninginpopulationgenetics