Cargando…
DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing
BACKGROUND: Nanopore sequencing allows selective sequencing, the ability to programmatically reject unwanted reads in a sample. Selective sequencing has many present and future applications in genomics research and the classification of species from a pool of species is an example. Existing methods...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883605/ https://www.ncbi.nlm.nih.gov/pubmed/36709261 http://dx.doi.org/10.1186/s12859-023-05151-0 |
_version_ | 1784879544173330432 |
---|---|
author | Senanayake, Anjana Gamaarachchi, Hasindu Herath, Damayanthi Ragel, Roshan |
author_facet | Senanayake, Anjana Gamaarachchi, Hasindu Herath, Damayanthi Ragel, Roshan |
author_sort | Senanayake, Anjana |
collection | PubMed |
description | BACKGROUND: Nanopore sequencing allows selective sequencing, the ability to programmatically reject unwanted reads in a sample. Selective sequencing has many present and future applications in genomics research and the classification of species from a pool of species is an example. Existing methods for selective sequencing for species classification are still immature and the accuracy highly varies depending on the datasets. For the five datasets we tested, the accuracy of existing methods varied in the range of [Formula: see text] 77 to 97% (average accuracy < 89%). Here we present DeepSelectNet, an accurate deep-learning-based method that can directly classify nanopore current signals belonging to a particular species. DeepSelectNet utilizes novel data preprocessing techniques and improved neural network architecture for regularization. RESULTS: For the five datasets tested, DeepSelectNet’s accuracy varied between [Formula: see text] 91 and 99% (average accuracy [Formula: see text] 95%). At its best performance, DeepSelectNet achieved a nearly 12% accuracy increase compared to its deep learning-based predecessor SquiggleNet. Furthermore, precision and recall evaluated for DeepSelectNet on average were always > 89% (average [Formula: see text] 95%). In terms of execution performance, DeepSelectNet outperformed SquiggleNet by [Formula: see text] 13% on average. Thus, DeepSelectNet is a practically viable method to improve the effectiveness of selective sequencing. CONCLUSIONS: Compared to base alignment and deep learning predecessors, DeepSelectNet can significantly improve the accuracy to enable real-time species classification using selective sequencing. The source code of DeepSelectNet is available at https://github.com/AnjanaSenanayake/DeepSelectNet. |
format | Online Article Text |
id | pubmed-9883605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-98836052023-01-29 DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing Senanayake, Anjana Gamaarachchi, Hasindu Herath, Damayanthi Ragel, Roshan BMC Bioinformatics Research BACKGROUND: Nanopore sequencing allows selective sequencing, the ability to programmatically reject unwanted reads in a sample. Selective sequencing has many present and future applications in genomics research and the classification of species from a pool of species is an example. Existing methods for selective sequencing for species classification are still immature and the accuracy highly varies depending on the datasets. For the five datasets we tested, the accuracy of existing methods varied in the range of [Formula: see text] 77 to 97% (average accuracy < 89%). Here we present DeepSelectNet, an accurate deep-learning-based method that can directly classify nanopore current signals belonging to a particular species. DeepSelectNet utilizes novel data preprocessing techniques and improved neural network architecture for regularization. RESULTS: For the five datasets tested, DeepSelectNet’s accuracy varied between [Formula: see text] 91 and 99% (average accuracy [Formula: see text] 95%). At its best performance, DeepSelectNet achieved a nearly 12% accuracy increase compared to its deep learning-based predecessor SquiggleNet. Furthermore, precision and recall evaluated for DeepSelectNet on average were always > 89% (average [Formula: see text] 95%). In terms of execution performance, DeepSelectNet outperformed SquiggleNet by [Formula: see text] 13% on average. Thus, DeepSelectNet is a practically viable method to improve the effectiveness of selective sequencing. CONCLUSIONS: Compared to base alignment and deep learning predecessors, DeepSelectNet can significantly improve the accuracy to enable real-time species classification using selective sequencing. The source code of DeepSelectNet is available at https://github.com/AnjanaSenanayake/DeepSelectNet. BioMed Central 2023-01-28 /pmc/articles/PMC9883605/ /pubmed/36709261 http://dx.doi.org/10.1186/s12859-023-05151-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Senanayake, Anjana Gamaarachchi, Hasindu Herath, Damayanthi Ragel, Roshan DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing |
title | DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing |
title_full | DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing |
title_fullStr | DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing |
title_full_unstemmed | DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing |
title_short | DeepSelectNet: deep neural network based selective sequencing for oxford nanopore sequencing |
title_sort | deepselectnet: deep neural network based selective sequencing for oxford nanopore sequencing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9883605/ https://www.ncbi.nlm.nih.gov/pubmed/36709261 http://dx.doi.org/10.1186/s12859-023-05151-0 |
work_keys_str_mv | AT senanayakeanjana deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing AT gamaarachchihasindu deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing AT herathdamayanthi deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing AT ragelroshan deepselectnetdeepneuralnetworkbasedselectivesequencingforoxfordnanoporesequencing |