Cargando…

Genome-wide scans for selective sweeps using convolutional neural networks

MOTIVATION: Recent methods for selective sweep detection cast the problem as a classification task and use summary statistics as features to capture region characteristics that are indicative of a selective sweep, thereby being sensitive to confounding factors. Furthermore, they are not designed to...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Hanqing, Souilljee, Matthijs, Pavlidis, Pavlos, Alachiotis, Nikolaos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311404/
https://www.ncbi.nlm.nih.gov/pubmed/37387128
http://dx.doi.org/10.1093/bioinformatics/btad265
_version_ 1785066733347799040
author Zhao, Hanqing
Souilljee, Matthijs
Pavlidis, Pavlos
Alachiotis, Nikolaos
author_facet Zhao, Hanqing
Souilljee, Matthijs
Pavlidis, Pavlos
Alachiotis, Nikolaos
author_sort Zhao, Hanqing
collection PubMed
description MOTIVATION: Recent methods for selective sweep detection cast the problem as a classification task and use summary statistics as features to capture region characteristics that are indicative of a selective sweep, thereby being sensitive to confounding factors. Furthermore, they are not designed to perform whole-genome scans or to estimate the extent of the genomic region that was affected by positive selection; both are required for identifying candidate genes and the time and strength of selection. RESULTS: We present ASDEC (https://github.com/pephco/ASDEC), a neural-network-based framework that can scan whole genomes for selective sweeps. ASDEC achieves similar classification performance to other convolutional neural network-based classifiers that rely on summary statistics, but it is trained 10× faster and classifies genomic regions 5× faster by inferring region characteristics from the raw sequence data directly. Deploying ASDEC for genomic scans achieved up to 15.2× higher sensitivity, 19.4× higher success rates, and 4× higher detection accuracy than state-of-the-art methods. We used ASDEC to scan human chromosome 1 of the Yoruba population (1000Genomes project), identifying nine known candidate genes.
format Online
Article
Text
id pubmed-10311404
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103114042023-07-01 Genome-wide scans for selective sweeps using convolutional neural networks Zhao, Hanqing Souilljee, Matthijs Pavlidis, Pavlos Alachiotis, Nikolaos Bioinformatics Evolutionary, Comparative and Population Genomics MOTIVATION: Recent methods for selective sweep detection cast the problem as a classification task and use summary statistics as features to capture region characteristics that are indicative of a selective sweep, thereby being sensitive to confounding factors. Furthermore, they are not designed to perform whole-genome scans or to estimate the extent of the genomic region that was affected by positive selection; both are required for identifying candidate genes and the time and strength of selection. RESULTS: We present ASDEC (https://github.com/pephco/ASDEC), a neural-network-based framework that can scan whole genomes for selective sweeps. ASDEC achieves similar classification performance to other convolutional neural network-based classifiers that rely on summary statistics, but it is trained 10× faster and classifies genomic regions 5× faster by inferring region characteristics from the raw sequence data directly. Deploying ASDEC for genomic scans achieved up to 15.2× higher sensitivity, 19.4× higher success rates, and 4× higher detection accuracy than state-of-the-art methods. We used ASDEC to scan human chromosome 1 of the Yoruba population (1000Genomes project), identifying nine known candidate genes. Oxford University Press 2023-06-30 /pmc/articles/PMC10311404/ /pubmed/37387128 http://dx.doi.org/10.1093/bioinformatics/btad265 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Evolutionary, Comparative and Population Genomics
Zhao, Hanqing
Souilljee, Matthijs
Pavlidis, Pavlos
Alachiotis, Nikolaos
Genome-wide scans for selective sweeps using convolutional neural networks
title Genome-wide scans for selective sweeps using convolutional neural networks
title_full Genome-wide scans for selective sweeps using convolutional neural networks
title_fullStr Genome-wide scans for selective sweeps using convolutional neural networks
title_full_unstemmed Genome-wide scans for selective sweeps using convolutional neural networks
title_short Genome-wide scans for selective sweeps using convolutional neural networks
title_sort genome-wide scans for selective sweeps using convolutional neural networks
topic Evolutionary, Comparative and Population Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311404/
https://www.ncbi.nlm.nih.gov/pubmed/37387128
http://dx.doi.org/10.1093/bioinformatics/btad265
work_keys_str_mv AT zhaohanqing genomewidescansforselectivesweepsusingconvolutionalneuralnetworks
AT souilljeematthijs genomewidescansforselectivesweepsusingconvolutionalneuralnetworks
AT pavlidispavlos genomewidescansforselectivesweepsusingconvolutionalneuralnetworks
AT alachiotisnikolaos genomewidescansforselectivesweepsusingconvolutionalneuralnetworks