Cargando…

Nanopore basecalling from a perspective of instance segmentation

BACKGROUND: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment’s pass through a nanopore, genot...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yao-zhong, Akdemir, Arda, Tremmel, Georg, Imoto, Seiya, Miyano, Satoru, Shibuya, Tetsuo, Yamaguchi, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7178565/
https://www.ncbi.nlm.nih.gov/pubmed/32321433
http://dx.doi.org/10.1186/s12859-020-3459-0
_version_ 1783525484518178816
author Zhang, Yao-zhong
Akdemir, Arda
Tremmel, Georg
Imoto, Seiya
Miyano, Satoru
Shibuya, Tetsuo
Yamaguchi, Rui
author_facet Zhang, Yao-zhong
Akdemir, Arda
Tremmel, Georg
Imoto, Seiya
Miyano, Satoru
Shibuya, Tetsuo
Yamaguchi, Rui
author_sort Zhang, Yao-zhong
collection PubMed
description BACKGROUND: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment’s pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-the art nanopore basecallers achieve basecalling accuracy in a range from 85% to 95%. RESULT: In this work, we proposed a novel basecalling approach from a perspective of instance segmentation. Different from previous approaches of doing typical sequence labeling, we formulated the basecalling problem as a multi-label segmentation task. Meanwhile, we proposed a refined U-net model which we call UR-net that can model sequential dependencies for a one-dimensional segmentation task. The experiment results show that the proposed basecaller URnano achieves competitive results on the in-species data, compared to the recently proposed CTC-featured basecallers. CONCLUSION: Our results show that formulating the basecalling problem as a one-dimensional segmentation task is a promising approach, which does basecalling and segmentation jointly.
format Online
Article
Text
id pubmed-7178565
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71785652020-04-24 Nanopore basecalling from a perspective of instance segmentation Zhang, Yao-zhong Akdemir, Arda Tremmel, Georg Imoto, Seiya Miyano, Satoru Shibuya, Tetsuo Yamaguchi, Rui BMC Bioinformatics Methodology BACKGROUND: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment’s pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-the art nanopore basecallers achieve basecalling accuracy in a range from 85% to 95%. RESULT: In this work, we proposed a novel basecalling approach from a perspective of instance segmentation. Different from previous approaches of doing typical sequence labeling, we formulated the basecalling problem as a multi-label segmentation task. Meanwhile, we proposed a refined U-net model which we call UR-net that can model sequential dependencies for a one-dimensional segmentation task. The experiment results show that the proposed basecaller URnano achieves competitive results on the in-species data, compared to the recently proposed CTC-featured basecallers. CONCLUSION: Our results show that formulating the basecalling problem as a one-dimensional segmentation task is a promising approach, which does basecalling and segmentation jointly. BioMed Central 2020-04-23 /pmc/articles/PMC7178565/ /pubmed/32321433 http://dx.doi.org/10.1186/s12859-020-3459-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Zhang, Yao-zhong
Akdemir, Arda
Tremmel, Georg
Imoto, Seiya
Miyano, Satoru
Shibuya, Tetsuo
Yamaguchi, Rui
Nanopore basecalling from a perspective of instance segmentation
title Nanopore basecalling from a perspective of instance segmentation
title_full Nanopore basecalling from a perspective of instance segmentation
title_fullStr Nanopore basecalling from a perspective of instance segmentation
title_full_unstemmed Nanopore basecalling from a perspective of instance segmentation
title_short Nanopore basecalling from a perspective of instance segmentation
title_sort nanopore basecalling from a perspective of instance segmentation
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7178565/
https://www.ncbi.nlm.nih.gov/pubmed/32321433
http://dx.doi.org/10.1186/s12859-020-3459-0
work_keys_str_mv AT zhangyaozhong nanoporebasecallingfromaperspectiveofinstancesegmentation
AT akdemirarda nanoporebasecallingfromaperspectiveofinstancesegmentation
AT tremmelgeorg nanoporebasecallingfromaperspectiveofinstancesegmentation
AT imotoseiya nanoporebasecallingfromaperspectiveofinstancesegmentation
AT miyanosatoru nanoporebasecallingfromaperspectiveofinstancesegmentation
AT shibuyatetsuo nanoporebasecallingfromaperspectiveofinstancesegmentation
AT yamaguchirui nanoporebasecallingfromaperspectiveofinstancesegmentation