Cargando…

Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design

Nanopore sequencers can select which DNA molecules to sequence, rejecting a molecule after analysis of a small initial part. Currently, selection is based on predetermined regions of interest that remain constant throughout an experiment. Sequencing efforts, thus, cannot be re-focused on molecules l...

Descripción completa

Detalles Bibliográficos
Autores principales: Weilguny, Lukas, De Maio, Nicola, Munro, Rory, Manser, Charlotte, Birney, Ewan, Loose, Matthew, Goldman, Nick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10344778/
https://www.ncbi.nlm.nih.gov/pubmed/36593407
http://dx.doi.org/10.1038/s41587-022-01580-z
_version_ 1785072935126433792
author Weilguny, Lukas
De Maio, Nicola
Munro, Rory
Manser, Charlotte
Birney, Ewan
Loose, Matthew
Goldman, Nick
author_facet Weilguny, Lukas
De Maio, Nicola
Munro, Rory
Manser, Charlotte
Birney, Ewan
Loose, Matthew
Goldman, Nick
author_sort Weilguny, Lukas
collection PubMed
description Nanopore sequencers can select which DNA molecules to sequence, rejecting a molecule after analysis of a small initial part. Currently, selection is based on predetermined regions of interest that remain constant throughout an experiment. Sequencing efforts, thus, cannot be re-focused on molecules likely contributing most to experimental success. Here we present BOSS-RUNS, an algorithmic framework and software to generate dynamically updated decision strategies. We quantify uncertainty at each genome position with real-time updates from data already observed. For each DNA fragment, we decide whether the expected decrease in uncertainty that it would provide warrants fully sequencing it, thus optimizing information gain. BOSS-RUNS mitigates coverage bias between and within members of a microbial community, leading to improved variant calling; for example, low-coverage sites of a species at 1% abundance were reduced by 87.5%, with 12.5% more single-nucleotide polymorphisms detected. Such data-driven updates to molecule selection are applicable to many sequencing scenarios, such as enriching for regions with increased divergence or low coverage, reducing time-to-answer.
format Online
Article
Text
id pubmed-10344778
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-103447782023-07-15 Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design Weilguny, Lukas De Maio, Nicola Munro, Rory Manser, Charlotte Birney, Ewan Loose, Matthew Goldman, Nick Nat Biotechnol Article Nanopore sequencers can select which DNA molecules to sequence, rejecting a molecule after analysis of a small initial part. Currently, selection is based on predetermined regions of interest that remain constant throughout an experiment. Sequencing efforts, thus, cannot be re-focused on molecules likely contributing most to experimental success. Here we present BOSS-RUNS, an algorithmic framework and software to generate dynamically updated decision strategies. We quantify uncertainty at each genome position with real-time updates from data already observed. For each DNA fragment, we decide whether the expected decrease in uncertainty that it would provide warrants fully sequencing it, thus optimizing information gain. BOSS-RUNS mitigates coverage bias between and within members of a microbial community, leading to improved variant calling; for example, low-coverage sites of a species at 1% abundance were reduced by 87.5%, with 12.5% more single-nucleotide polymorphisms detected. Such data-driven updates to molecule selection are applicable to many sequencing scenarios, such as enriching for regions with increased divergence or low coverage, reducing time-to-answer. Nature Publishing Group US 2023-01-02 2023 /pmc/articles/PMC10344778/ /pubmed/36593407 http://dx.doi.org/10.1038/s41587-022-01580-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Weilguny, Lukas
De Maio, Nicola
Munro, Rory
Manser, Charlotte
Birney, Ewan
Loose, Matthew
Goldman, Nick
Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design
title Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design
title_full Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design
title_fullStr Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design
title_full_unstemmed Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design
title_short Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design
title_sort dynamic, adaptive sampling during nanopore sequencing using bayesian experimental design
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10344778/
https://www.ncbi.nlm.nih.gov/pubmed/36593407
http://dx.doi.org/10.1038/s41587-022-01580-z
work_keys_str_mv AT weilgunylukas dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign
AT demaionicola dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign
AT munrorory dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign
AT mansercharlotte dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign
AT birneyewan dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign
AT loosematthew dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign
AT goldmannick dynamicadaptivesamplingduringnanoporesequencingusingbayesianexperimentaldesign