Cargando…

ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing

MOTIVATION: Nanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either...

Descripción completa

Detalles Bibliográficos
Autores principales: Ulrich, Jens-Uwe, Lutfi, Ahmad, Rutzen, Kilian, Renard, Bernhard Y
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235500/
https://www.ncbi.nlm.nih.gov/pubmed/35758774
http://dx.doi.org/10.1093/bioinformatics/btac223
_version_ 1784736325321097216
author Ulrich, Jens-Uwe
Lutfi, Ahmad
Rutzen, Kilian
Renard, Bernhard Y
author_facet Ulrich, Jens-Uwe
Lutfi, Ahmad
Rutzen, Kilian
Renard, Bernhard Y
author_sort Ulrich, Jens-Uwe
collection PubMed
description MOTIVATION: Nanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either apply signal alignment, which cannot handle human-sized reference sequences, or apply read mapping in sequence space relying on fast graphical processing units (GPU) base callers for real-time read rejection. Using nanopore long-read mapping tools is also not optimal when mapping shorter reads as usually analyzed in adaptive sampling applications. RESULTS: Here, we present a new approach for nanopore adaptive sampling that combines fast CPU and GPU base calling with read classification based on Interleaved Bloom Filters. ReadBouncer improves the potential enrichment of low abundance sequences by its high read classification sensitivity and specificity, outperforming existing tools in the field. It robustly removes even reads belonging to large reference sequences while running on commodity hardware without GPUs, making adaptive sampling accessible for in-field researchers. Readbouncer also provides a user-friendly interface and installer files for end-users without a bioinformatics background. AVAILABILITY AND IMPLEMENTATION: The C++ source code is available at https://gitlab.com/dacs-hpi/readbouncer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9235500
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92355002022-06-29 ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing Ulrich, Jens-Uwe Lutfi, Ahmad Rutzen, Kilian Renard, Bernhard Y Bioinformatics ISCB/Ismb 2022 MOTIVATION: Nanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either apply signal alignment, which cannot handle human-sized reference sequences, or apply read mapping in sequence space relying on fast graphical processing units (GPU) base callers for real-time read rejection. Using nanopore long-read mapping tools is also not optimal when mapping shorter reads as usually analyzed in adaptive sampling applications. RESULTS: Here, we present a new approach for nanopore adaptive sampling that combines fast CPU and GPU base calling with read classification based on Interleaved Bloom Filters. ReadBouncer improves the potential enrichment of low abundance sequences by its high read classification sensitivity and specificity, outperforming existing tools in the field. It robustly removes even reads belonging to large reference sequences while running on commodity hardware without GPUs, making adaptive sampling accessible for in-field researchers. Readbouncer also provides a user-friendly interface and installer files for end-users without a bioinformatics background. AVAILABILITY AND IMPLEMENTATION: The C++ source code is available at https://gitlab.com/dacs-hpi/readbouncer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-06-27 /pmc/articles/PMC9235500/ /pubmed/35758774 http://dx.doi.org/10.1093/bioinformatics/btac223 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle ISCB/Ismb 2022
Ulrich, Jens-Uwe
Lutfi, Ahmad
Rutzen, Kilian
Renard, Bernhard Y
ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing
title ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing
title_full ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing
title_fullStr ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing
title_full_unstemmed ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing
title_short ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing
title_sort readbouncer: precise and scalable adaptive sampling for nanopore sequencing
topic ISCB/Ismb 2022
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235500/
https://www.ncbi.nlm.nih.gov/pubmed/35758774
http://dx.doi.org/10.1093/bioinformatics/btac223
work_keys_str_mv AT ulrichjensuwe readbouncerpreciseandscalableadaptivesamplingfornanoporesequencing
AT lutfiahmad readbouncerpreciseandscalableadaptivesamplingfornanoporesequencing
AT rutzenkilian readbouncerpreciseandscalableadaptivesamplingfornanoporesequencing
AT renardbernhardy readbouncerpreciseandscalableadaptivesamplingfornanoporesequencing