Cargando…

baseLess: lightweight detection of sequences in raw MinION data

SUMMARY: With its candybar form factor and low initial investment cost, the MinION brought affordable portable nucleic acid analysis within reach. However, translating the electrical signal it outputs into a sequence of bases still requires mid-tier computer hardware, which remains a caveat when aim...

Descripción completa

Detalles Bibliográficos
Autores principales: Noordijk, Ben, Nijland, Reindert, Carrion, Victor J, Raaijmakers, Jos M, de Ridder, Dick, de Lannoy, Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9936955/
https://www.ncbi.nlm.nih.gov/pubmed/36818730
http://dx.doi.org/10.1093/bioadv/vbad017
_version_ 1784890331712454656
author Noordijk, Ben
Nijland, Reindert
Carrion, Victor J
Raaijmakers, Jos M
de Ridder, Dick
de Lannoy, Carlos
author_facet Noordijk, Ben
Nijland, Reindert
Carrion, Victor J
Raaijmakers, Jos M
de Ridder, Dick
de Lannoy, Carlos
author_sort Noordijk, Ben
collection PubMed
description SUMMARY: With its candybar form factor and low initial investment cost, the MinION brought affordable portable nucleic acid analysis within reach. However, translating the electrical signal it outputs into a sequence of bases still requires mid-tier computer hardware, which remains a caveat when aiming for deployment of many devices at once or usage in remote areas. For applications focusing on detection of a target sequence, such as infectious disease monitoring or species identification, the computational cost of analysis may be reduced by directly detecting the target sequence in the electrical signal instead. Here, we present baseLess, a computational tool that enables such target-detection-only analysis. BaseLess makes use of an array of small neural networks, each of which efficiently detects a fixed-size subsequence of the target sequence directly from the electrical signal. We show that baseLess can accurately determine the identity of reads between three closely related fish species and can classify sequences in mixtures of 20 bacterial species, on an inexpensive single-board computer. AVAILABILITY AND IMPLEMENTATION: baseLess and all code used in data preparation and validation are available on Github at https://github.com/cvdelannoy/baseLess, under an MIT license. Used validation data and scripts can be found at https://doi.org/10.4121/20261392, under an MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9936955
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99369552023-02-18 baseLess: lightweight detection of sequences in raw MinION data Noordijk, Ben Nijland, Reindert Carrion, Victor J Raaijmakers, Jos M de Ridder, Dick de Lannoy, Carlos Bioinform Adv Original Paper SUMMARY: With its candybar form factor and low initial investment cost, the MinION brought affordable portable nucleic acid analysis within reach. However, translating the electrical signal it outputs into a sequence of bases still requires mid-tier computer hardware, which remains a caveat when aiming for deployment of many devices at once or usage in remote areas. For applications focusing on detection of a target sequence, such as infectious disease monitoring or species identification, the computational cost of analysis may be reduced by directly detecting the target sequence in the electrical signal instead. Here, we present baseLess, a computational tool that enables such target-detection-only analysis. BaseLess makes use of an array of small neural networks, each of which efficiently detects a fixed-size subsequence of the target sequence directly from the electrical signal. We show that baseLess can accurately determine the identity of reads between three closely related fish species and can classify sequences in mixtures of 20 bacterial species, on an inexpensive single-board computer. AVAILABILITY AND IMPLEMENTATION: baseLess and all code used in data preparation and validation are available on Github at https://github.com/cvdelannoy/baseLess, under an MIT license. Used validation data and scripts can be found at https://doi.org/10.4121/20261392, under an MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2023-02-15 /pmc/articles/PMC9936955/ /pubmed/36818730 http://dx.doi.org/10.1093/bioadv/vbad017 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Noordijk, Ben
Nijland, Reindert
Carrion, Victor J
Raaijmakers, Jos M
de Ridder, Dick
de Lannoy, Carlos
baseLess: lightweight detection of sequences in raw MinION data
title baseLess: lightweight detection of sequences in raw MinION data
title_full baseLess: lightweight detection of sequences in raw MinION data
title_fullStr baseLess: lightweight detection of sequences in raw MinION data
title_full_unstemmed baseLess: lightweight detection of sequences in raw MinION data
title_short baseLess: lightweight detection of sequences in raw MinION data
title_sort baseless: lightweight detection of sequences in raw minion data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9936955/
https://www.ncbi.nlm.nih.gov/pubmed/36818730
http://dx.doi.org/10.1093/bioadv/vbad017
work_keys_str_mv AT noordijkben baselesslightweightdetectionofsequencesinrawminiondata
AT nijlandreindert baselesslightweightdetectionofsequencesinrawminiondata
AT carrionvictorj baselesslightweightdetectionofsequencesinrawminiondata
AT raaijmakersjosm baselesslightweightdetectionofsequencesinrawminiondata
AT deridderdick baselesslightweightdetectionofsequencesinrawminiondata
AT delannoycarlos baselesslightweightdetectionofsequencesinrawminiondata