Cargando…

Real-time mapping of nanopore raw signals

MOTIVATION: Oxford Nanopore Technologies sequencing devices support adaptive sequencing, in which undesired reads can be ejected from a pore in real time. This feature allows targeted sequencing aided by computational methods for mapping partial reads, rather than complex library preparation protoco...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Haowen, Li, Haoran, Jain, Chirag, Cheng, Haoyu, Au, Kin Fai, Li, Heng, Aluru, Srinivas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8336444/
https://www.ncbi.nlm.nih.gov/pubmed/34252938
http://dx.doi.org/10.1093/bioinformatics/btab264
_version_ 1783733321158623232
author Zhang, Haowen
Li, Haoran
Jain, Chirag
Cheng, Haoyu
Au, Kin Fai
Li, Heng
Aluru, Srinivas
author_facet Zhang, Haowen
Li, Haoran
Jain, Chirag
Cheng, Haoyu
Au, Kin Fai
Li, Heng
Aluru, Srinivas
author_sort Zhang, Haowen
collection PubMed
description MOTIVATION: Oxford Nanopore Technologies sequencing devices support adaptive sequencing, in which undesired reads can be ejected from a pore in real time. This feature allows targeted sequencing aided by computational methods for mapping partial reads, rather than complex library preparation protocols. However, existing mapping methods either require a computationally expensive base-calling procedure before using aligners to map partial reads or work well only on small genomes. RESULTS: In this work, we present a new streaming method that can map nanopore raw signals for real-time selective sequencing. Rather than converting read signals to bases, we propose to convert reference genomes to signals and fully operate in the signal space. Our method features a new way to index reference genomes using k-d trees, a novel seed selection strategy and a seed chaining algorithm tailored toward the current signal characteristics. We implemented the method as a tool Sigmap. Then we evaluated it on both simulated and real data and compared it to the state-of-the-art nanopore raw signal mapper Uncalled. Our results show that Sigmap yields comparable performance on mapping yeast simulated raw signals, and better mapping accuracy on mapping yeast real raw signals with a 4.4× speedup. Moreover, our method performed well on mapping raw signals to genomes of size >100 Mbp and correctly mapped 11.49% more real raw signals of green algae, which leads to a significantly higher F(1)-score (0.9354 versus 0.8660). AVAILABILITY AND IMPLEMENTATION: Sigmap code is accessible at https://github.com/haowenz/sigmap. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8336444
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83364442021-08-09 Real-time mapping of nanopore raw signals Zhang, Haowen Li, Haoran Jain, Chirag Cheng, Haoyu Au, Kin Fai Li, Heng Aluru, Srinivas Bioinformatics General Computational Biology MOTIVATION: Oxford Nanopore Technologies sequencing devices support adaptive sequencing, in which undesired reads can be ejected from a pore in real time. This feature allows targeted sequencing aided by computational methods for mapping partial reads, rather than complex library preparation protocols. However, existing mapping methods either require a computationally expensive base-calling procedure before using aligners to map partial reads or work well only on small genomes. RESULTS: In this work, we present a new streaming method that can map nanopore raw signals for real-time selective sequencing. Rather than converting read signals to bases, we propose to convert reference genomes to signals and fully operate in the signal space. Our method features a new way to index reference genomes using k-d trees, a novel seed selection strategy and a seed chaining algorithm tailored toward the current signal characteristics. We implemented the method as a tool Sigmap. Then we evaluated it on both simulated and real data and compared it to the state-of-the-art nanopore raw signal mapper Uncalled. Our results show that Sigmap yields comparable performance on mapping yeast simulated raw signals, and better mapping accuracy on mapping yeast real raw signals with a 4.4× speedup. Moreover, our method performed well on mapping raw signals to genomes of size >100 Mbp and correctly mapped 11.49% more real raw signals of green algae, which leads to a significantly higher F(1)-score (0.9354 versus 0.8660). AVAILABILITY AND IMPLEMENTATION: Sigmap code is accessible at https://github.com/haowenz/sigmap. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-12 /pmc/articles/PMC8336444/ /pubmed/34252938 http://dx.doi.org/10.1093/bioinformatics/btab264 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle General Computational Biology
Zhang, Haowen
Li, Haoran
Jain, Chirag
Cheng, Haoyu
Au, Kin Fai
Li, Heng
Aluru, Srinivas
Real-time mapping of nanopore raw signals
title Real-time mapping of nanopore raw signals
title_full Real-time mapping of nanopore raw signals
title_fullStr Real-time mapping of nanopore raw signals
title_full_unstemmed Real-time mapping of nanopore raw signals
title_short Real-time mapping of nanopore raw signals
title_sort real-time mapping of nanopore raw signals
topic General Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8336444/
https://www.ncbi.nlm.nih.gov/pubmed/34252938
http://dx.doi.org/10.1093/bioinformatics/btab264
work_keys_str_mv AT zhanghaowen realtimemappingofnanoporerawsignals
AT lihaoran realtimemappingofnanoporerawsignals
AT jainchirag realtimemappingofnanoporerawsignals
AT chenghaoyu realtimemappingofnanoporerawsignals
AT aukinfai realtimemappingofnanoporerawsignals
AT liheng realtimemappingofnanoporerawsignals
AT alurusrinivas realtimemappingofnanoporerawsignals