Cargando…
Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA
There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA),...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8127778/ https://www.ncbi.nlm.nih.gov/pubmed/34012962 http://dx.doi.org/10.3389/fcell.2021.643645 |
_version_ | 1783694012707766272 |
---|---|
author | Li, Gaoyang Liu, Yongzhuang Li, Deying Liu, Bo Li, Junyi Hu, Yang Wang, Yadong |
author_facet | Li, Gaoyang Liu, Yongzhuang Li, Deying Liu, Bo Li, Junyi Hu, Yang Wang, Yadong |
author_sort | Li, Gaoyang |
collection | PubMed |
description | There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA), a tailored long-read classification approach that uses a novel pseudo alignment algorithm based on sparse approximate match block (SAMB). Benchmarks on real sequencing datasets demonstrate that deSAMBA enables to achieve high yields and fast speed simultaneously, which outperforms state-of-the-art tools and has many potentials to cutting-edge metagenomics studies. |
format | Online Article Text |
id | pubmed-8127778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81277782021-05-18 Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA Li, Gaoyang Liu, Yongzhuang Li, Deying Liu, Bo Li, Junyi Hu, Yang Wang, Yadong Front Cell Dev Biol Cell and Developmental Biology There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA), a tailored long-read classification approach that uses a novel pseudo alignment algorithm based on sparse approximate match block (SAMB). Benchmarks on real sequencing datasets demonstrate that deSAMBA enables to achieve high yields and fast speed simultaneously, which outperforms state-of-the-art tools and has many potentials to cutting-edge metagenomics studies. Frontiers Media S.A. 2021-04-28 /pmc/articles/PMC8127778/ /pubmed/34012962 http://dx.doi.org/10.3389/fcell.2021.643645 Text en Copyright © 2021 Li, Liu, Li, Liu, Li, Hu and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Li, Gaoyang Liu, Yongzhuang Li, Deying Liu, Bo Li, Junyi Hu, Yang Wang, Yadong Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA |
title | Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA |
title_full | Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA |
title_fullStr | Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA |
title_full_unstemmed | Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA |
title_short | Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA |
title_sort | fast and accurate classification of meta-genomics long reads with desamba |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8127778/ https://www.ncbi.nlm.nih.gov/pubmed/34012962 http://dx.doi.org/10.3389/fcell.2021.643645 |
work_keys_str_mv | AT ligaoyang fastandaccurateclassificationofmetagenomicslongreadswithdesamba AT liuyongzhuang fastandaccurateclassificationofmetagenomicslongreadswithdesamba AT lideying fastandaccurateclassificationofmetagenomicslongreadswithdesamba AT liubo fastandaccurateclassificationofmetagenomicslongreadswithdesamba AT lijunyi fastandaccurateclassificationofmetagenomicslongreadswithdesamba AT huyang fastandaccurateclassificationofmetagenomicslongreadswithdesamba AT wangyadong fastandaccurateclassificationofmetagenomicslongreadswithdesamba |