Cargando…
Accelerated nanopore basecalling with SLOW5 data format
MOTIVATION: Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as ‘basecalling’, is a major friction in any...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10261880/ https://www.ncbi.nlm.nih.gov/pubmed/37252813 http://dx.doi.org/10.1093/bioinformatics/btad352 |
_version_ | 1785057964549210112 |
---|---|
author | Samarakoon, Hiruna Ferguson, James M Gamaarachchi, Hasindu Deveson, Ira W |
author_facet | Samarakoon, Hiruna Ferguson, James M Gamaarachchi, Hasindu Deveson, Ira W |
author_sort | Samarakoon, Hiruna |
collection | PubMed |
description | MOTIVATION: Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as ‘basecalling’, is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format ‘SLOW5’ to streamline and accelerate nanopore basecalling on high-performance computing (HPC) and cloud environments. RESULTS: SLOW5 permits highly efficient sequential data access, eliminating a potential analysis bottleneck. To take advantage of this, we introduce Buttery-eel, an open-source wrapper for Oxford Nanopore’s Guppy basecaller that enables SLOW5 data access, resulting in performance improvements that are essential for scalable, affordable basecalling. AVAILABILITY AND IMPLEMENTATION: Buttery-eel is available at https://github.com/Psy-Fer/buttery-eel. |
format | Online Article Text |
id | pubmed-10261880 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-102618802023-06-15 Accelerated nanopore basecalling with SLOW5 data format Samarakoon, Hiruna Ferguson, James M Gamaarachchi, Hasindu Deveson, Ira W Bioinformatics Original Paper MOTIVATION: Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as ‘basecalling’, is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format ‘SLOW5’ to streamline and accelerate nanopore basecalling on high-performance computing (HPC) and cloud environments. RESULTS: SLOW5 permits highly efficient sequential data access, eliminating a potential analysis bottleneck. To take advantage of this, we introduce Buttery-eel, an open-source wrapper for Oxford Nanopore’s Guppy basecaller that enables SLOW5 data access, resulting in performance improvements that are essential for scalable, affordable basecalling. AVAILABILITY AND IMPLEMENTATION: Buttery-eel is available at https://github.com/Psy-Fer/buttery-eel. Oxford University Press 2023-05-30 /pmc/articles/PMC10261880/ /pubmed/37252813 http://dx.doi.org/10.1093/bioinformatics/btad352 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Samarakoon, Hiruna Ferguson, James M Gamaarachchi, Hasindu Deveson, Ira W Accelerated nanopore basecalling with SLOW5 data format |
title | Accelerated nanopore basecalling with SLOW5 data format |
title_full | Accelerated nanopore basecalling with SLOW5 data format |
title_fullStr | Accelerated nanopore basecalling with SLOW5 data format |
title_full_unstemmed | Accelerated nanopore basecalling with SLOW5 data format |
title_short | Accelerated nanopore basecalling with SLOW5 data format |
title_sort | accelerated nanopore basecalling with slow5 data format |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10261880/ https://www.ncbi.nlm.nih.gov/pubmed/37252813 http://dx.doi.org/10.1093/bioinformatics/btad352 |
work_keys_str_mv | AT samarakoonhiruna acceleratednanoporebasecallingwithslow5dataformat AT fergusonjamesm acceleratednanoporebasecallingwithslow5dataformat AT gamaarachchihasindu acceleratednanoporebasecallingwithslow5dataformat AT devesoniraw acceleratednanoporebasecallingwithslow5dataformat |