Cargando…

DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing

BACKGROUND: Nanopore sequencing allows selective sequencing, the ability to programmatically reject unwanted reads in a sample. Selective sequencing has many present and future applications in genomics research and the classification of species from a pool of species is an example. Existing methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Senanayake, Anjana, Gamaarachchi, Hasindu, Herath, Damayanthi, Ragel, Roshan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883605/
https://www.ncbi.nlm.nih.gov/pubmed/36709261
http://dx.doi.org/10.1186/s12859-023-05151-0
_version_ 1784879544173330432
author Senanayake, Anjana
Gamaarachchi, Hasindu
Herath, Damayanthi
Ragel, Roshan
author_facet Senanayake, Anjana
Gamaarachchi, Hasindu
Herath, Damayanthi
Ragel, Roshan
author_sort Senanayake, Anjana
collection PubMed
description BACKGROUND: Nanopore sequencing allows selective sequencing, the ability to programmatically reject unwanted reads in a sample. Selective sequencing has many present and future applications in genomics research and the classification of species from a pool of species is an example. Existing methods for selective sequencing for species classification are still immature and the accuracy highly varies depending on the datasets. For the five datasets we tested, the accuracy of existing methods varied in the range of [Formula: see text]  77 to 97% (average accuracy < 89%). Here we present DeepSelectNet, an accurate deep-learning-based method that can directly classify nanopore current signals belonging to a particular species. DeepSelectNet utilizes novel data preprocessing techniques and improved neural network architecture for regularization. RESULTS: For the five datasets tested, DeepSelectNet’s accuracy varied between [Formula: see text]  91 and 99% (average accuracy [Formula: see text]  95%). At its best performance, DeepSelectNet achieved a nearly 12% accuracy increase compared to its deep learning-based predecessor SquiggleNet. Furthermore, precision and recall evaluated for DeepSelectNet on average were always > 89% (average [Formula: see text]  95%). In terms of execution performance, DeepSelectNet outperformed SquiggleNet by [Formula: see text]  13% on average. Thus, DeepSelectNet is a practically viable method to improve the effectiveness of selective sequencing. CONCLUSIONS: Compared to base alignment and deep learning predecessors, DeepSelectNet can significantly improve the accuracy to enable real-time species classification using selective sequencing. The source code of DeepSelectNet is available at https://github.com/AnjanaSenanayake/DeepSelectNet.
format Online
Article
Text
id pubmed-9883605
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-98836052023-01-29 DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing Senanayake, Anjana Gamaarachchi, Hasindu Herath, Damayanthi Ragel, Roshan BMC Bioinformatics Research BACKGROUND: Nanopore sequencing allows selective sequencing, the ability to programmatically reject unwanted reads in a sample. Selective sequencing has many present and future applications in genomics research and the classification of species from a pool of species is an example. Existing methods for selective sequencing for species classification are still immature and the accuracy highly varies depending on the datasets. For the five datasets we tested, the accuracy of existing methods varied in the range of [Formula: see text]  77 to 97% (average accuracy < 89%). Here we present DeepSelectNet, an accurate deep-learning-based method that can directly classify nanopore current signals belonging to a particular species. DeepSelectNet utilizes novel data preprocessing techniques and improved neural network architecture for regularization. RESULTS: For the five datasets tested, DeepSelectNet’s accuracy varied between [Formula: see text]  91 and 99% (average accuracy [Formula: see text]  95%). At its best performance, DeepSelectNet achieved a nearly 12% accuracy increase compared to its deep learning-based predecessor SquiggleNet. Furthermore, precision and recall evaluated for DeepSelectNet on average were always > 89% (average [Formula: see text]  95%). In terms of execution performance, DeepSelectNet outperformed SquiggleNet by [Formula: see text]  13% on average. Thus, DeepSelectNet is a practically viable method to improve the effectiveness of selective sequencing. CONCLUSIONS: Compared to base alignment and deep learning predecessors, DeepSelectNet can significantly improve the accuracy to enable real-time species classification using selective sequencing. The source code of DeepSelectNet is available at https://github.com/AnjanaSenanayake/DeepSelectNet. BioMed Central 2023-01-28 /pmc/articles/PMC9883605/ /pubmed/36709261 http://dx.doi.org/10.1186/s12859-023-05151-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Senanayake, Anjana
Gamaarachchi, Hasindu
Herath, Damayanthi
Ragel, Roshan
DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
title DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
title_full DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
title_fullStr DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
title_full_unstemmed DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
title_short DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
title_sort deepselectnet: deep neural network based selective sequencing for oxford nanopore sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883605/
https://www.ncbi.nlm.nih.gov/pubmed/36709261
http://dx.doi.org/10.1186/s12859-023-05151-0
work_keys_str_mv AT senanayakeanjana deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing
AT gamaarachchihasindu deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing
AT herathdamayanthi deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing
AT ragelroshan deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing