Cargando…

Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks

Multiplexing, the simultaneous sequencing of multiple barcoded DNA samples on a single flow cell, has made Oxford Nanopore sequencing cost-effective for small genomes. However, it depends on the ability to sort the resulting sequencing reads by barcode, and current demultiplexing tools fail to class...

Descripción completa

Detalles Bibliográficos
Autores principales: Wick, Ryan R., Judd, Louise M., Holt, Kathryn E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245502/
https://www.ncbi.nlm.nih.gov/pubmed/30458005
http://dx.doi.org/10.1371/journal.pcbi.1006583
_version_ 1783372255672139776
author Wick, Ryan R.
Judd, Louise M.
Holt, Kathryn E.
author_facet Wick, Ryan R.
Judd, Louise M.
Holt, Kathryn E.
author_sort Wick, Ryan R.
collection PubMed
description Multiplexing, the simultaneous sequencing of multiple barcoded DNA samples on a single flow cell, has made Oxford Nanopore sequencing cost-effective for small genomes. However, it depends on the ability to sort the resulting sequencing reads by barcode, and current demultiplexing tools fail to classify many reads. Here we present Deepbinner, a tool for Oxford Nanopore demultiplexing that uses a deep neural network to classify reads based on the raw electrical read signal. This ‘signal-space’ approach allows for greater accuracy than existing ‘base-space’ tools (Albacore and Porechop) for which signals must first be converted to DNA base calls, itself a complex problem that can introduce noise into the barcode sequence. To assess Deepbinner and existing tools, we performed multiplex sequencing on 12 amplicons chosen for their distinguishability. This allowed us to establish a ground truth classification for each read based on internal sequence alone. Deepbinner had the lowest rate of unclassified reads (7.8%) and the highest demultiplexing precision (98.5% of classified reads were correctly assigned). It can be used alone (to maximise the number of classified reads) or in conjunction with other demultiplexers (to maximise precision and minimise false positive classifications). We also found cross-sample chimeric reads (0.3%) and evidence of barcode switching (0.3%) in our dataset, which likely arise during library preparation and may be detrimental for quantitative studies that use multiplexing. Deepbinner is open source (GPLv3) and available at https://github.com/rrwick/Deepbinner.
format Online
Article
Text
id pubmed-6245502
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62455022018-12-01 Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks Wick, Ryan R. Judd, Louise M. Holt, Kathryn E. PLoS Comput Biol Research Article Multiplexing, the simultaneous sequencing of multiple barcoded DNA samples on a single flow cell, has made Oxford Nanopore sequencing cost-effective for small genomes. However, it depends on the ability to sort the resulting sequencing reads by barcode, and current demultiplexing tools fail to classify many reads. Here we present Deepbinner, a tool for Oxford Nanopore demultiplexing that uses a deep neural network to classify reads based on the raw electrical read signal. This ‘signal-space’ approach allows for greater accuracy than existing ‘base-space’ tools (Albacore and Porechop) for which signals must first be converted to DNA base calls, itself a complex problem that can introduce noise into the barcode sequence. To assess Deepbinner and existing tools, we performed multiplex sequencing on 12 amplicons chosen for their distinguishability. This allowed us to establish a ground truth classification for each read based on internal sequence alone. Deepbinner had the lowest rate of unclassified reads (7.8%) and the highest demultiplexing precision (98.5% of classified reads were correctly assigned). It can be used alone (to maximise the number of classified reads) or in conjunction with other demultiplexers (to maximise precision and minimise false positive classifications). We also found cross-sample chimeric reads (0.3%) and evidence of barcode switching (0.3%) in our dataset, which likely arise during library preparation and may be detrimental for quantitative studies that use multiplexing. Deepbinner is open source (GPLv3) and available at https://github.com/rrwick/Deepbinner. Public Library of Science 2018-11-20 /pmc/articles/PMC6245502/ /pubmed/30458005 http://dx.doi.org/10.1371/journal.pcbi.1006583 Text en © 2018 Wick et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wick, Ryan R.
Judd, Louise M.
Holt, Kathryn E.
Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks
title Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks
title_full Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks
title_fullStr Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks
title_full_unstemmed Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks
title_short Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks
title_sort deepbinner: demultiplexing barcoded oxford nanopore reads with deep convolutional neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245502/
https://www.ncbi.nlm.nih.gov/pubmed/30458005
http://dx.doi.org/10.1371/journal.pcbi.1006583
work_keys_str_mv AT wickryanr deepbinnerdemultiplexingbarcodedoxfordnanoporereadswithdeepconvolutionalneuralnetworks
AT juddlouisem deepbinnerdemultiplexingbarcodedoxfordnanoporereadswithdeepconvolutionalneuralnetworks
AT holtkathryne deepbinnerdemultiplexingbarcodedoxfordnanoporereadswithdeepconvolutionalneuralnetworks