Cargando…

SquiggleNet: real-time, direct classification of nanopore signals

We present SquiggleNet, the first deep-learning model that can classify nanopore reads directly from their electrical signals. SquiggleNet operates faster than DNA passes through the pore, allowing real-time classification and read ejection. Using 1 s of sequencing data, the classifier achieves sign...

Descripción completa

Detalles Bibliográficos
Autores principales: Bao, Yuwei, Wadden, Jack, Erb-Downward, John R., Ranjan, Piyush, Zhou, Weichen, McDonald, Torrin L., Mills, Ryan E., Boyle, Alan P., Dickson, Robert P., Blaauw, David, Welch, Joshua D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548853/
https://www.ncbi.nlm.nih.gov/pubmed/34706748
http://dx.doi.org/10.1186/s13059-021-02511-y
Descripción
Sumario:We present SquiggleNet, the first deep-learning model that can classify nanopore reads directly from their electrical signals. SquiggleNet operates faster than DNA passes through the pore, allowing real-time classification and read ejection. Using 1 s of sequencing data, the classifier achieves significantly higher accuracy than base calling followed by sequence alignment. Our approach is also faster and requires an order of magnitude less memory than alignment-based approaches. SquiggleNet distinguished human from bacterial DNA with over 90% accuracy, generalized to unseen bacterial species in a human respiratory meta genome sample, and accurately classified sequences containing human long interspersed repeat elements. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s13059-021-02511-y).