Cargando…
Real-time mapping of nanopore raw signals
MOTIVATION: Oxford Nanopore Technologies sequencing devices support adaptive sequencing, in which undesired reads can be ejected from a pore in real time. This feature allows targeted sequencing aided by computational methods for mapping partial reads, rather than complex library preparation protoco...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8336444/ https://www.ncbi.nlm.nih.gov/pubmed/34252938 http://dx.doi.org/10.1093/bioinformatics/btab264 |
_version_ | 1783733321158623232 |
---|---|
author | Zhang, Haowen Li, Haoran Jain, Chirag Cheng, Haoyu Au, Kin Fai Li, Heng Aluru, Srinivas |
author_facet | Zhang, Haowen Li, Haoran Jain, Chirag Cheng, Haoyu Au, Kin Fai Li, Heng Aluru, Srinivas |
author_sort | Zhang, Haowen |
collection | PubMed |
description | MOTIVATION: Oxford Nanopore Technologies sequencing devices support adaptive sequencing, in which undesired reads can be ejected from a pore in real time. This feature allows targeted sequencing aided by computational methods for mapping partial reads, rather than complex library preparation protocols. However, existing mapping methods either require a computationally expensive base-calling procedure before using aligners to map partial reads or work well only on small genomes. RESULTS: In this work, we present a new streaming method that can map nanopore raw signals for real-time selective sequencing. Rather than converting read signals to bases, we propose to convert reference genomes to signals and fully operate in the signal space. Our method features a new way to index reference genomes using k-d trees, a novel seed selection strategy and a seed chaining algorithm tailored toward the current signal characteristics. We implemented the method as a tool Sigmap. Then we evaluated it on both simulated and real data and compared it to the state-of-the-art nanopore raw signal mapper Uncalled. Our results show that Sigmap yields comparable performance on mapping yeast simulated raw signals, and better mapping accuracy on mapping yeast real raw signals with a 4.4× speedup. Moreover, our method performed well on mapping raw signals to genomes of size >100 Mbp and correctly mapped 11.49% more real raw signals of green algae, which leads to a significantly higher F(1)-score (0.9354 versus 0.8660). AVAILABILITY AND IMPLEMENTATION: Sigmap code is accessible at https://github.com/haowenz/sigmap. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8336444 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-83364442021-08-09 Real-time mapping of nanopore raw signals Zhang, Haowen Li, Haoran Jain, Chirag Cheng, Haoyu Au, Kin Fai Li, Heng Aluru, Srinivas Bioinformatics General Computational Biology MOTIVATION: Oxford Nanopore Technologies sequencing devices support adaptive sequencing, in which undesired reads can be ejected from a pore in real time. This feature allows targeted sequencing aided by computational methods for mapping partial reads, rather than complex library preparation protocols. However, existing mapping methods either require a computationally expensive base-calling procedure before using aligners to map partial reads or work well only on small genomes. RESULTS: In this work, we present a new streaming method that can map nanopore raw signals for real-time selective sequencing. Rather than converting read signals to bases, we propose to convert reference genomes to signals and fully operate in the signal space. Our method features a new way to index reference genomes using k-d trees, a novel seed selection strategy and a seed chaining algorithm tailored toward the current signal characteristics. We implemented the method as a tool Sigmap. Then we evaluated it on both simulated and real data and compared it to the state-of-the-art nanopore raw signal mapper Uncalled. Our results show that Sigmap yields comparable performance on mapping yeast simulated raw signals, and better mapping accuracy on mapping yeast real raw signals with a 4.4× speedup. Moreover, our method performed well on mapping raw signals to genomes of size >100 Mbp and correctly mapped 11.49% more real raw signals of green algae, which leads to a significantly higher F(1)-score (0.9354 versus 0.8660). AVAILABILITY AND IMPLEMENTATION: Sigmap code is accessible at https://github.com/haowenz/sigmap. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-12 /pmc/articles/PMC8336444/ /pubmed/34252938 http://dx.doi.org/10.1093/bioinformatics/btab264 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | General Computational Biology Zhang, Haowen Li, Haoran Jain, Chirag Cheng, Haoyu Au, Kin Fai Li, Heng Aluru, Srinivas Real-time mapping of nanopore raw signals |
title | Real-time mapping of nanopore raw signals |
title_full | Real-time mapping of nanopore raw signals |
title_fullStr | Real-time mapping of nanopore raw signals |
title_full_unstemmed | Real-time mapping of nanopore raw signals |
title_short | Real-time mapping of nanopore raw signals |
title_sort | real-time mapping of nanopore raw signals |
topic | General Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8336444/ https://www.ncbi.nlm.nih.gov/pubmed/34252938 http://dx.doi.org/10.1093/bioinformatics/btab264 |
work_keys_str_mv | AT zhanghaowen realtimemappingofnanoporerawsignals AT lihaoran realtimemappingofnanoporerawsignals AT jainchirag realtimemappingofnanoporerawsignals AT chenghaoyu realtimemappingofnanoporerawsignals AT aukinfai realtimemappingofnanoporerawsignals AT liheng realtimemappingofnanoporerawsignals AT alurusrinivas realtimemappingofnanoporerawsignals |