Cargando…

Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA

There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA),...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Gaoyang, Liu, Yongzhuang, Li, Deying, Liu, Bo, Li, Junyi, Hu, Yang, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8127778/
https://www.ncbi.nlm.nih.gov/pubmed/34012962
http://dx.doi.org/10.3389/fcell.2021.643645
_version_ 1783694012707766272
author Li, Gaoyang
Liu, Yongzhuang
Li, Deying
Liu, Bo
Li, Junyi
Hu, Yang
Wang, Yadong
author_facet Li, Gaoyang
Liu, Yongzhuang
Li, Deying
Liu, Bo
Li, Junyi
Hu, Yang
Wang, Yadong
author_sort Li, Gaoyang
collection PubMed
description There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA), a tailored long-read classification approach that uses a novel pseudo alignment algorithm based on sparse approximate match block (SAMB). Benchmarks on real sequencing datasets demonstrate that deSAMBA enables to achieve high yields and fast speed simultaneously, which outperforms state-of-the-art tools and has many potentials to cutting-edge metagenomics studies.
format Online
Article
Text
id pubmed-8127778
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81277782021-05-18 Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA Li, Gaoyang Liu, Yongzhuang Li, Deying Liu, Bo Li, Junyi Hu, Yang Wang, Yadong Front Cell Dev Biol Cell and Developmental Biology There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA), a tailored long-read classification approach that uses a novel pseudo alignment algorithm based on sparse approximate match block (SAMB). Benchmarks on real sequencing datasets demonstrate that deSAMBA enables to achieve high yields and fast speed simultaneously, which outperforms state-of-the-art tools and has many potentials to cutting-edge metagenomics studies. Frontiers Media S.A. 2021-04-28 /pmc/articles/PMC8127778/ /pubmed/34012962 http://dx.doi.org/10.3389/fcell.2021.643645 Text en Copyright © 2021 Li, Liu, Li, Liu, Li, Hu and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Li, Gaoyang
Liu, Yongzhuang
Li, Deying
Liu, Bo
Li, Junyi
Hu, Yang
Wang, Yadong
Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
title Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
title_full Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
title_fullStr Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
title_full_unstemmed Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
title_short Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
title_sort fast and accurate classification of meta-genomics long reads with desamba
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8127778/
https://www.ncbi.nlm.nih.gov/pubmed/34012962
http://dx.doi.org/10.3389/fcell.2021.643645
work_keys_str_mv AT ligaoyang fastandaccurateclassificationofmetagenomicslongreadswithdesamba
AT liuyongzhuang fastandaccurateclassificationofmetagenomicslongreadswithdesamba
AT lideying fastandaccurateclassificationofmetagenomicslongreadswithdesamba
AT liubo fastandaccurateclassificationofmetagenomicslongreadswithdesamba
AT lijunyi fastandaccurateclassificationofmetagenomicslongreadswithdesamba
AT huyang fastandaccurateclassificationofmetagenomicslongreadswithdesamba
AT wangyadong fastandaccurateclassificationofmetagenomicslongreadswithdesamba