Cargando…

dnadna: a deep learning framework for population genetics inference

MOTIVATION: We present dnadna, a flexible python-based software for deep learning inference in population genetics. It is task-agnostic and aims at facilitating the development, reproducibility, dissemination and re-usability of neural networks designed for population genetic data. RESULTS: dnadna d...

Descripción completa

Detalles Bibliográficos
Autores principales: Sanchez, Théophile, Bray, Erik Madison, Jobic, Pierre, Guez, Jérémy, Letournel, Anne-Catherine, Charpiat, Guillaume, Cury, Jean, Jay, Flora
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825738/
https://www.ncbi.nlm.nih.gov/pubmed/36445000
http://dx.doi.org/10.1093/bioinformatics/btac765
_version_ 1784866687476039680
author Sanchez, Théophile
Bray, Erik Madison
Jobic, Pierre
Guez, Jérémy
Letournel, Anne-Catherine
Charpiat, Guillaume
Cury, Jean
Jay, Flora
author_facet Sanchez, Théophile
Bray, Erik Madison
Jobic, Pierre
Guez, Jérémy
Letournel, Anne-Catherine
Charpiat, Guillaume
Cury, Jean
Jay, Flora
author_sort Sanchez, Théophile
collection PubMed
description MOTIVATION: We present dnadna, a flexible python-based software for deep learning inference in population genetics. It is task-agnostic and aims at facilitating the development, reproducibility, dissemination and re-usability of neural networks designed for population genetic data. RESULTS: dnadna defines multiple user-friendly workflows. First, users can implement new architectures and tasks, while benefiting from dnadna utility functions, training procedure and test environment, which saves time and decreases the likelihood of bugs. Second, the implemented networks can be re-optimized based on user-specified training sets and/or tasks. Newly implemented architectures and pre-trained networks are easily shareable with the community for further benchmarking or other applications. Finally, users can apply pre-trained networks in order to predict evolutionary history from alternative real or simulated genetic datasets, without requiring extensive knowledge in deep learning or coding in general. dnadna comes with a peer-reviewed, exchangeable neural network, allowing demographic inference from SNP data, that can be used directly or retrained to solve other tasks. Toy networks are also available to ease the exploration of the software, and we expect that the range of available architectures will keep expanding thanks to community contributions. AVAILABILITY AND IMPLEMENTATION: dnadna is a Python (≥3.7) package, its repository is available at gitlab.com/mlgenetics/dnadna and its associated documentation at mlgenetics.gitlab.io/dnadna/.
format Online
Article
Text
id pubmed-9825738
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98257382023-01-10 dnadna: a deep learning framework for population genetics inference Sanchez, Théophile Bray, Erik Madison Jobic, Pierre Guez, Jérémy Letournel, Anne-Catherine Charpiat, Guillaume Cury, Jean Jay, Flora Bioinformatics Applications Note MOTIVATION: We present dnadna, a flexible python-based software for deep learning inference in population genetics. It is task-agnostic and aims at facilitating the development, reproducibility, dissemination and re-usability of neural networks designed for population genetic data. RESULTS: dnadna defines multiple user-friendly workflows. First, users can implement new architectures and tasks, while benefiting from dnadna utility functions, training procedure and test environment, which saves time and decreases the likelihood of bugs. Second, the implemented networks can be re-optimized based on user-specified training sets and/or tasks. Newly implemented architectures and pre-trained networks are easily shareable with the community for further benchmarking or other applications. Finally, users can apply pre-trained networks in order to predict evolutionary history from alternative real or simulated genetic datasets, without requiring extensive knowledge in deep learning or coding in general. dnadna comes with a peer-reviewed, exchangeable neural network, allowing demographic inference from SNP data, that can be used directly or retrained to solve other tasks. Toy networks are also available to ease the exploration of the software, and we expect that the range of available architectures will keep expanding thanks to community contributions. AVAILABILITY AND IMPLEMENTATION: dnadna is a Python (≥3.7) package, its repository is available at gitlab.com/mlgenetics/dnadna and its associated documentation at mlgenetics.gitlab.io/dnadna/. Oxford University Press 2022-11-29 /pmc/articles/PMC9825738/ /pubmed/36445000 http://dx.doi.org/10.1093/bioinformatics/btac765 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Sanchez, Théophile
Bray, Erik Madison
Jobic, Pierre
Guez, Jérémy
Letournel, Anne-Catherine
Charpiat, Guillaume
Cury, Jean
Jay, Flora
dnadna: a deep learning framework for population genetics inference
title dnadna: a deep learning framework for population genetics inference
title_full dnadna: a deep learning framework for population genetics inference
title_fullStr dnadna: a deep learning framework for population genetics inference
title_full_unstemmed dnadna: a deep learning framework for population genetics inference
title_short dnadna: a deep learning framework for population genetics inference
title_sort dnadna: a deep learning framework for population genetics inference
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825738/
https://www.ncbi.nlm.nih.gov/pubmed/36445000
http://dx.doi.org/10.1093/bioinformatics/btac765
work_keys_str_mv AT sancheztheophile dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT brayerikmadison dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT jobicpierre dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT guezjeremy dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT letournelannecatherine dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT charpiatguillaume dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT curyjean dnadnaadeeplearningframeworkforpopulationgeneticsinference
AT jayflora dnadnaadeeplearningframeworkforpopulationgeneticsinference