Cargando…
Genome-wide scans for selective sweeps using convolutional neural networks
MOTIVATION: Recent methods for selective sweep detection cast the problem as a classification task and use summary statistics as features to capture region characteristics that are indicative of a selective sweep, thereby being sensitive to confounding factors. Furthermore, they are not designed to...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311404/ https://www.ncbi.nlm.nih.gov/pubmed/37387128 http://dx.doi.org/10.1093/bioinformatics/btad265 |
_version_ | 1785066733347799040 |
---|---|
author | Zhao, Hanqing Souilljee, Matthijs Pavlidis, Pavlos Alachiotis, Nikolaos |
author_facet | Zhao, Hanqing Souilljee, Matthijs Pavlidis, Pavlos Alachiotis, Nikolaos |
author_sort | Zhao, Hanqing |
collection | PubMed |
description | MOTIVATION: Recent methods for selective sweep detection cast the problem as a classification task and use summary statistics as features to capture region characteristics that are indicative of a selective sweep, thereby being sensitive to confounding factors. Furthermore, they are not designed to perform whole-genome scans or to estimate the extent of the genomic region that was affected by positive selection; both are required for identifying candidate genes and the time and strength of selection. RESULTS: We present ASDEC (https://github.com/pephco/ASDEC), a neural-network-based framework that can scan whole genomes for selective sweeps. ASDEC achieves similar classification performance to other convolutional neural network-based classifiers that rely on summary statistics, but it is trained 10× faster and classifies genomic regions 5× faster by inferring region characteristics from the raw sequence data directly. Deploying ASDEC for genomic scans achieved up to 15.2× higher sensitivity, 19.4× higher success rates, and 4× higher detection accuracy than state-of-the-art methods. We used ASDEC to scan human chromosome 1 of the Yoruba population (1000Genomes project), identifying nine known candidate genes. |
format | Online Article Text |
id | pubmed-10311404 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-103114042023-07-01 Genome-wide scans for selective sweeps using convolutional neural networks Zhao, Hanqing Souilljee, Matthijs Pavlidis, Pavlos Alachiotis, Nikolaos Bioinformatics Evolutionary, Comparative and Population Genomics MOTIVATION: Recent methods for selective sweep detection cast the problem as a classification task and use summary statistics as features to capture region characteristics that are indicative of a selective sweep, thereby being sensitive to confounding factors. Furthermore, they are not designed to perform whole-genome scans or to estimate the extent of the genomic region that was affected by positive selection; both are required for identifying candidate genes and the time and strength of selection. RESULTS: We present ASDEC (https://github.com/pephco/ASDEC), a neural-network-based framework that can scan whole genomes for selective sweeps. ASDEC achieves similar classification performance to other convolutional neural network-based classifiers that rely on summary statistics, but it is trained 10× faster and classifies genomic regions 5× faster by inferring region characteristics from the raw sequence data directly. Deploying ASDEC for genomic scans achieved up to 15.2× higher sensitivity, 19.4× higher success rates, and 4× higher detection accuracy than state-of-the-art methods. We used ASDEC to scan human chromosome 1 of the Yoruba population (1000Genomes project), identifying nine known candidate genes. Oxford University Press 2023-06-30 /pmc/articles/PMC10311404/ /pubmed/37387128 http://dx.doi.org/10.1093/bioinformatics/btad265 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Evolutionary, Comparative and Population Genomics Zhao, Hanqing Souilljee, Matthijs Pavlidis, Pavlos Alachiotis, Nikolaos Genome-wide scans for selective sweeps using convolutional neural networks |
title | Genome-wide scans for selective sweeps using convolutional neural networks |
title_full | Genome-wide scans for selective sweeps using convolutional neural networks |
title_fullStr | Genome-wide scans for selective sweeps using convolutional neural networks |
title_full_unstemmed | Genome-wide scans for selective sweeps using convolutional neural networks |
title_short | Genome-wide scans for selective sweeps using convolutional neural networks |
title_sort | genome-wide scans for selective sweeps using convolutional neural networks |
topic | Evolutionary, Comparative and Population Genomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311404/ https://www.ncbi.nlm.nih.gov/pubmed/37387128 http://dx.doi.org/10.1093/bioinformatics/btad265 |
work_keys_str_mv | AT zhaohanqing genomewidescansforselectivesweepsusingconvolutionalneuralnetworks AT souilljeematthijs genomewidescansforselectivesweepsusingconvolutionalneuralnetworks AT pavlidispavlos genomewidescansforselectivesweepsusingconvolutionalneuralnetworks AT alachiotisnikolaos genomewidescansforselectivesweepsusingconvolutionalneuralnetworks |