Cargando…
Deep Learning in Population Genetics
Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine lear...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897193/ https://www.ncbi.nlm.nih.gov/pubmed/36683406 http://dx.doi.org/10.1093/gbe/evad008 |
_version_ | 1784882201181028352 |
---|---|
author | Korfmann, Kevin Gaggiotti, Oscar E Fumagalli, Matteo |
author_facet | Korfmann, Kevin Gaggiotti, Oscar E Fumagalli, Matteo |
author_sort | Korfmann, Kevin |
collection | PubMed |
description | Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine learning, and in particular deep learning, algorithms are emerging as popular techniques for population genetic inferences. These approaches rely on algorithms that learn non-linear relationships between the input data and the model parameters being estimated through representation learning from training data sets. Deep learning algorithms currently employed in the field comprise discriminative and generative models with fully connected, convolutional, or recurrent layers. Additionally, a wide range of powerful simulators to generate training data under complex scenarios are now available. The application of deep learning to empirical data sets mostly replicates previous findings of demography reconstruction and signals of natural selection in model organisms. To showcase the feasibility of deep learning to tackle new challenges, we designed a branched architecture to detect signals of recent balancing selection from temporal haplotypic data, which exhibited good predictive performance on simulated data. Investigations on the interpretability of neural networks, their robustness to uncertain training data, and creative representation of population genetic data, will provide further opportunities for technological advancements in the field. |
format | Online Article Text |
id | pubmed-9897193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98971932023-02-06 Deep Learning in Population Genetics Korfmann, Kevin Gaggiotti, Oscar E Fumagalli, Matteo Genome Biol Evol Review Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine learning, and in particular deep learning, algorithms are emerging as popular techniques for population genetic inferences. These approaches rely on algorithms that learn non-linear relationships between the input data and the model parameters being estimated through representation learning from training data sets. Deep learning algorithms currently employed in the field comprise discriminative and generative models with fully connected, convolutional, or recurrent layers. Additionally, a wide range of powerful simulators to generate training data under complex scenarios are now available. The application of deep learning to empirical data sets mostly replicates previous findings of demography reconstruction and signals of natural selection in model organisms. To showcase the feasibility of deep learning to tackle new challenges, we designed a branched architecture to detect signals of recent balancing selection from temporal haplotypic data, which exhibited good predictive performance on simulated data. Investigations on the interpretability of neural networks, their robustness to uncertain training data, and creative representation of population genetic data, will provide further opportunities for technological advancements in the field. Oxford University Press 2023-01-23 /pmc/articles/PMC9897193/ /pubmed/36683406 http://dx.doi.org/10.1093/gbe/evad008 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Korfmann, Kevin Gaggiotti, Oscar E Fumagalli, Matteo Deep Learning in Population Genetics |
title | Deep Learning in Population Genetics |
title_full | Deep Learning in Population Genetics |
title_fullStr | Deep Learning in Population Genetics |
title_full_unstemmed | Deep Learning in Population Genetics |
title_short | Deep Learning in Population Genetics |
title_sort | deep learning in population genetics |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897193/ https://www.ncbi.nlm.nih.gov/pubmed/36683406 http://dx.doi.org/10.1093/gbe/evad008 |
work_keys_str_mv | AT korfmannkevin deeplearninginpopulationgenetics AT gaggiottioscare deeplearninginpopulationgenetics AT fumagallimatteo deeplearninginpopulationgenetics |