Cargando…

Taxonomic analysis of metagenomic data with kASA

The taxonomic analysis of sequencing data has become important in many areas of life sciences. However, currently available tools for that purpose either consume large amounts of RAM or yield insufficient quality and robustness. Here, we present kASA, a k-mer based tool capable of identifying and pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Weging, Silvio, Gogol-Döring, Andreas, Grosse, Ivo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8266618/
https://www.ncbi.nlm.nih.gov/pubmed/33784400
http://dx.doi.org/10.1093/nar/gkab200
_version_ 1783719980840255488
author Weging, Silvio
Gogol-Döring, Andreas
Grosse, Ivo
author_facet Weging, Silvio
Gogol-Döring, Andreas
Grosse, Ivo
author_sort Weging, Silvio
collection PubMed
description The taxonomic analysis of sequencing data has become important in many areas of life sciences. However, currently available tools for that purpose either consume large amounts of RAM or yield insufficient quality and robustness. Here, we present kASA, a k-mer based tool capable of identifying and profiling metagenomic DNA or protein sequences with high computational efficiency and a user-definable memory footprint. We ensure both high sensitivity and precision by using an amino acid-like encoding of k-mers together with a range of multiple k’s. Custom algorithms and data structures optimized for external memory storage enable a full-scale taxonomic analysis without compromise on laptop, desktop, and HPCC.
format Online
Article
Text
id pubmed-8266618
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82666182021-07-09 Taxonomic analysis of metagenomic data with kASA Weging, Silvio Gogol-Döring, Andreas Grosse, Ivo Nucleic Acids Res Methods Online The taxonomic analysis of sequencing data has become important in many areas of life sciences. However, currently available tools for that purpose either consume large amounts of RAM or yield insufficient quality and robustness. Here, we present kASA, a k-mer based tool capable of identifying and profiling metagenomic DNA or protein sequences with high computational efficiency and a user-definable memory footprint. We ensure both high sensitivity and precision by using an amino acid-like encoding of k-mers together with a range of multiple k’s. Custom algorithms and data structures optimized for external memory storage enable a full-scale taxonomic analysis without compromise on laptop, desktop, and HPCC. Oxford University Press 2021-03-30 /pmc/articles/PMC8266618/ /pubmed/33784400 http://dx.doi.org/10.1093/nar/gkab200 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Weging, Silvio
Gogol-Döring, Andreas
Grosse, Ivo
Taxonomic analysis of metagenomic data with kASA
title Taxonomic analysis of metagenomic data with kASA
title_full Taxonomic analysis of metagenomic data with kASA
title_fullStr Taxonomic analysis of metagenomic data with kASA
title_full_unstemmed Taxonomic analysis of metagenomic data with kASA
title_short Taxonomic analysis of metagenomic data with kASA
title_sort taxonomic analysis of metagenomic data with kasa
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8266618/
https://www.ncbi.nlm.nih.gov/pubmed/33784400
http://dx.doi.org/10.1093/nar/gkab200
work_keys_str_mv AT wegingsilvio taxonomicanalysisofmetagenomicdatawithkasa
AT gogoldoringandreas taxonomicanalysisofmetagenomicdatawithkasa
AT grosseivo taxonomicanalysisofmetagenomicdatawithkasa