Cargando…
Nanopore basecalling from a perspective of instance segmentation
BACKGROUND: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment’s pass through a nanopore, genot...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7178565/ https://www.ncbi.nlm.nih.gov/pubmed/32321433 http://dx.doi.org/10.1186/s12859-020-3459-0 |
_version_ | 1783525484518178816 |
---|---|
author | Zhang, Yao-zhong Akdemir, Arda Tremmel, Georg Imoto, Seiya Miyano, Satoru Shibuya, Tetsuo Yamaguchi, Rui |
author_facet | Zhang, Yao-zhong Akdemir, Arda Tremmel, Georg Imoto, Seiya Miyano, Satoru Shibuya, Tetsuo Yamaguchi, Rui |
author_sort | Zhang, Yao-zhong |
collection | PubMed |
description | BACKGROUND: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment’s pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-the art nanopore basecallers achieve basecalling accuracy in a range from 85% to 95%. RESULT: In this work, we proposed a novel basecalling approach from a perspective of instance segmentation. Different from previous approaches of doing typical sequence labeling, we formulated the basecalling problem as a multi-label segmentation task. Meanwhile, we proposed a refined U-net model which we call UR-net that can model sequential dependencies for a one-dimensional segmentation task. The experiment results show that the proposed basecaller URnano achieves competitive results on the in-species data, compared to the recently proposed CTC-featured basecallers. CONCLUSION: Our results show that formulating the basecalling problem as a one-dimensional segmentation task is a promising approach, which does basecalling and segmentation jointly. |
format | Online Article Text |
id | pubmed-7178565 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-71785652020-04-24 Nanopore basecalling from a perspective of instance segmentation Zhang, Yao-zhong Akdemir, Arda Tremmel, Georg Imoto, Seiya Miyano, Satoru Shibuya, Tetsuo Yamaguchi, Rui BMC Bioinformatics Methodology BACKGROUND: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment’s pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-the art nanopore basecallers achieve basecalling accuracy in a range from 85% to 95%. RESULT: In this work, we proposed a novel basecalling approach from a perspective of instance segmentation. Different from previous approaches of doing typical sequence labeling, we formulated the basecalling problem as a multi-label segmentation task. Meanwhile, we proposed a refined U-net model which we call UR-net that can model sequential dependencies for a one-dimensional segmentation task. The experiment results show that the proposed basecaller URnano achieves competitive results on the in-species data, compared to the recently proposed CTC-featured basecallers. CONCLUSION: Our results show that formulating the basecalling problem as a one-dimensional segmentation task is a promising approach, which does basecalling and segmentation jointly. BioMed Central 2020-04-23 /pmc/articles/PMC7178565/ /pubmed/32321433 http://dx.doi.org/10.1186/s12859-020-3459-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Zhang, Yao-zhong Akdemir, Arda Tremmel, Georg Imoto, Seiya Miyano, Satoru Shibuya, Tetsuo Yamaguchi, Rui Nanopore basecalling from a perspective of instance segmentation |
title | Nanopore basecalling from a perspective of instance segmentation |
title_full | Nanopore basecalling from a perspective of instance segmentation |
title_fullStr | Nanopore basecalling from a perspective of instance segmentation |
title_full_unstemmed | Nanopore basecalling from a perspective of instance segmentation |
title_short | Nanopore basecalling from a perspective of instance segmentation |
title_sort | nanopore basecalling from a perspective of instance segmentation |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7178565/ https://www.ncbi.nlm.nih.gov/pubmed/32321433 http://dx.doi.org/10.1186/s12859-020-3459-0 |
work_keys_str_mv | AT zhangyaozhong nanoporebasecallingfromaperspectiveofinstancesegmentation AT akdemirarda nanoporebasecallingfromaperspectiveofinstancesegmentation AT tremmelgeorg nanoporebasecallingfromaperspectiveofinstancesegmentation AT imotoseiya nanoporebasecallingfromaperspectiveofinstancesegmentation AT miyanosatoru nanoporebasecallingfromaperspectiveofinstancesegmentation AT shibuyatetsuo nanoporebasecallingfromaperspectiveofinstancesegmentation AT yamaguchirui nanoporebasecallingfromaperspectiveofinstancesegmentation |