Cargando…

Molecular barcoding of native RNAs using nanopore sequencing and deep learning

Nanopore sequencing enables direct measurement of RNA molecules without conversion to cDNA, thus opening the gates to a new era for RNA biology. However, the lack of molecular barcoding of direct RNA nanopore sequencing data sets severely affects the applicability of this technology to biological sa...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Martin A., Ersavas, Tansel, Ferguson, James M., Liu, Huanle, Lucas, Morghan C., Begik, Oguzhan, Bojarski, Lilly, Barton, Kirston, Novoa, Eva Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7545146/
https://www.ncbi.nlm.nih.gov/pubmed/32907883
http://dx.doi.org/10.1101/gr.260836.120
_version_ 1783591973890818048
author Smith, Martin A.
Ersavas, Tansel
Ferguson, James M.
Liu, Huanle
Lucas, Morghan C.
Begik, Oguzhan
Bojarski, Lilly
Barton, Kirston
Novoa, Eva Maria
author_facet Smith, Martin A.
Ersavas, Tansel
Ferguson, James M.
Liu, Huanle
Lucas, Morghan C.
Begik, Oguzhan
Bojarski, Lilly
Barton, Kirston
Novoa, Eva Maria
author_sort Smith, Martin A.
collection PubMed
description Nanopore sequencing enables direct measurement of RNA molecules without conversion to cDNA, thus opening the gates to a new era for RNA biology. However, the lack of molecular barcoding of direct RNA nanopore sequencing data sets severely affects the applicability of this technology to biological samples, where RNA availability is often limited. Here, we provide the first experimental protocol and associated algorithm to barcode and demultiplex direct RNA nanopore sequencing data sets. Specifically, we present a novel and robust approach to accurately classify raw nanopore signal data by transforming current intensities into images or arrays of pixels, followed by classification using a deep learning algorithm. We demonstrate the power of this strategy by developing the first experimental protocol for barcoding and demultiplexing direct RNA sequencing libraries. Our method, DeePlexiCon, can classify 93% of reads with 95.1% accuracy or 60% of reads with 99.9% accuracy. The availability of an efficient and simple multiplexing strategy for native RNA sequencing will improve the cost-effectiveness of this technology, as well as facilitate the analysis of lower-input biological samples. Overall, our work exemplifies the power, simplicity, and robustness of signal-to-image conversion for nanopore data analysis using deep learning.
format Online
Article
Text
id pubmed-7545146
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-75451462021-03-01 Molecular barcoding of native RNAs using nanopore sequencing and deep learning Smith, Martin A. Ersavas, Tansel Ferguson, James M. Liu, Huanle Lucas, Morghan C. Begik, Oguzhan Bojarski, Lilly Barton, Kirston Novoa, Eva Maria Genome Res Method Nanopore sequencing enables direct measurement of RNA molecules without conversion to cDNA, thus opening the gates to a new era for RNA biology. However, the lack of molecular barcoding of direct RNA nanopore sequencing data sets severely affects the applicability of this technology to biological samples, where RNA availability is often limited. Here, we provide the first experimental protocol and associated algorithm to barcode and demultiplex direct RNA nanopore sequencing data sets. Specifically, we present a novel and robust approach to accurately classify raw nanopore signal data by transforming current intensities into images or arrays of pixels, followed by classification using a deep learning algorithm. We demonstrate the power of this strategy by developing the first experimental protocol for barcoding and demultiplexing direct RNA sequencing libraries. Our method, DeePlexiCon, can classify 93% of reads with 95.1% accuracy or 60% of reads with 99.9% accuracy. The availability of an efficient and simple multiplexing strategy for native RNA sequencing will improve the cost-effectiveness of this technology, as well as facilitate the analysis of lower-input biological samples. Overall, our work exemplifies the power, simplicity, and robustness of signal-to-image conversion for nanopore data analysis using deep learning. Cold Spring Harbor Laboratory Press 2020-09 /pmc/articles/PMC7545146/ /pubmed/32907883 http://dx.doi.org/10.1101/gr.260836.120 Text en © 2020 Smith et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Smith, Martin A.
Ersavas, Tansel
Ferguson, James M.
Liu, Huanle
Lucas, Morghan C.
Begik, Oguzhan
Bojarski, Lilly
Barton, Kirston
Novoa, Eva Maria
Molecular barcoding of native RNAs using nanopore sequencing and deep learning
title Molecular barcoding of native RNAs using nanopore sequencing and deep learning
title_full Molecular barcoding of native RNAs using nanopore sequencing and deep learning
title_fullStr Molecular barcoding of native RNAs using nanopore sequencing and deep learning
title_full_unstemmed Molecular barcoding of native RNAs using nanopore sequencing and deep learning
title_short Molecular barcoding of native RNAs using nanopore sequencing and deep learning
title_sort molecular barcoding of native rnas using nanopore sequencing and deep learning
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7545146/
https://www.ncbi.nlm.nih.gov/pubmed/32907883
http://dx.doi.org/10.1101/gr.260836.120
work_keys_str_mv AT smithmartina molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT ersavastansel molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT fergusonjamesm molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT liuhuanle molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT lucasmorghanc molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT begikoguzhan molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT bojarskililly molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT bartonkirston molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning
AT novoaevamaria molecularbarcodingofnativernasusingnanoporesequencinganddeeplearning