Cargando…

Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing

Plasmid construction is central to life science research, and sequence verification is arguably its costliest step. Long-read sequencing has emerged as a competitor to Sanger sequencing, with the principal benefit that whole plasmids can be sequenced in a single run. Nevertheless, the current cost o...

Descripción completa

Detalles Bibliográficos
Autores principales: Uematsu, Masaaki, Baskin, Jeremy M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120676/
https://www.ncbi.nlm.nih.gov/pubmed/37090656
http://dx.doi.org/10.1101/2023.04.12.536413
_version_ 1785029222282035200
author Uematsu, Masaaki
Baskin, Jeremy M.
author_facet Uematsu, Masaaki
Baskin, Jeremy M.
author_sort Uematsu, Masaaki
collection PubMed
description Plasmid construction is central to life science research, and sequence verification is arguably its costliest step. Long-read sequencing has emerged as a competitor to Sanger sequencing, with the principal benefit that whole plasmids can be sequenced in a single run. Nevertheless, the current cost of nanopore sequencing is still prohibitive for routine sequencing during plasmid construction. We develop a computational approach termed Simple Algorithm for Very Efficient Multiplexing of Oxford Nanopore Experiments for You (SAVEMONEY) that guides researchers to mix multiple plasmids and subsequently computationally de-mixes the resultant sequences. SAVEMONEY defines optimal mixtures in a pre-survey step, and following sequencing, executes a post-analysis workflow involving sequence classification, alignment, and consensus determination. By using Bayesian analysis with prior probability of expected plasmid construction error rate, high-confidence sequences can be obtained for each plasmid in the mixture. Plasmids differing by as little as two bases can be mixed for submission as a single sample for nanopore sequencing, and routine multiplexing of even six plasmids can still maintain high accuracy of consensus sequencing. SAVEMONEY should further democratize whole-plasmid sequencing by nanopore and related technologies, driving down the effective cost of whole-plasmid sequencing to lower than that of a single Sanger sequencing run.
format Online
Article
Text
id pubmed-10120676
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101206762023-04-22 Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing Uematsu, Masaaki Baskin, Jeremy M. bioRxiv Article Plasmid construction is central to life science research, and sequence verification is arguably its costliest step. Long-read sequencing has emerged as a competitor to Sanger sequencing, with the principal benefit that whole plasmids can be sequenced in a single run. Nevertheless, the current cost of nanopore sequencing is still prohibitive for routine sequencing during plasmid construction. We develop a computational approach termed Simple Algorithm for Very Efficient Multiplexing of Oxford Nanopore Experiments for You (SAVEMONEY) that guides researchers to mix multiple plasmids and subsequently computationally de-mixes the resultant sequences. SAVEMONEY defines optimal mixtures in a pre-survey step, and following sequencing, executes a post-analysis workflow involving sequence classification, alignment, and consensus determination. By using Bayesian analysis with prior probability of expected plasmid construction error rate, high-confidence sequences can be obtained for each plasmid in the mixture. Plasmids differing by as little as two bases can be mixed for submission as a single sample for nanopore sequencing, and routine multiplexing of even six plasmids can still maintain high accuracy of consensus sequencing. SAVEMONEY should further democratize whole-plasmid sequencing by nanopore and related technologies, driving down the effective cost of whole-plasmid sequencing to lower than that of a single Sanger sequencing run. Cold Spring Harbor Laboratory 2023-04-21 /pmc/articles/PMC10120676/ /pubmed/37090656 http://dx.doi.org/10.1101/2023.04.12.536413 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Uematsu, Masaaki
Baskin, Jeremy M.
Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing
title Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing
title_full Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing
title_fullStr Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing
title_full_unstemmed Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing
title_short Barcode-free multiplex plasmid sequencing using Bayesian analysis and nanopore sequencing
title_sort barcode-free multiplex plasmid sequencing using bayesian analysis and nanopore sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120676/
https://www.ncbi.nlm.nih.gov/pubmed/37090656
http://dx.doi.org/10.1101/2023.04.12.536413
work_keys_str_mv AT uematsumasaaki barcodefreemultiplexplasmidsequencingusingbayesiananalysisandnanoporesequencing
AT baskinjeremym barcodefreemultiplexplasmidsequencingusingbayesiananalysisandnanoporesequencing