Cargando…
AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization
BACKGROUND: With the advent of low cost, fast sequencing technologies metagenomic analyses are made possible. The large data volumes gathered by these techniques and the unpredictable diversity captured in them are still, however, a challenge for computational biology. RESULTS: In this paper we addr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4307196/ https://www.ncbi.nlm.nih.gov/pubmed/25495116 http://dx.doi.org/10.1186/s12859-014-0384-0 |
_version_ | 1782354419196624896 |
---|---|
author | Langenkämper, Daniel Goesmann, Alexander Nattkemper, Tim Wilhelm |
author_facet | Langenkämper, Daniel Goesmann, Alexander Nattkemper, Tim Wilhelm |
author_sort | Langenkämper, Daniel |
collection | PubMed |
description | BACKGROUND: With the advent of low cost, fast sequencing technologies metagenomic analyses are made possible. The large data volumes gathered by these techniques and the unpredictable diversity captured in them are still, however, a challenge for computational biology. RESULTS: In this paper we address the problem of rapid taxonomic assignment with small and adaptive data models (< 5 MB) and present the accelerated k-mer explorer (AKE). Acceleration in AKE’s taxonomic assignments is achieved by a special machine learning architecture, which is well suited to model data collections that are intrinsically hierarchical. We report classification accuracy reasonably well for ranks down to order, observed on a study on real world data (Acid Mine Drainage, Cow Rumen). CONCLUSION: We show that the execution time of this approach is orders of magnitude shorter than competitive approaches and that accuracy is comparable. The tool is presented to the public as a web application (url: https://ani.cebitec.uni-bielefeld.de/ake/, username: bmc, password: bmcbioinfo). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0384-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4307196 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43071962015-02-03 AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization Langenkämper, Daniel Goesmann, Alexander Nattkemper, Tim Wilhelm BMC Bioinformatics Methodology Article BACKGROUND: With the advent of low cost, fast sequencing technologies metagenomic analyses are made possible. The large data volumes gathered by these techniques and the unpredictable diversity captured in them are still, however, a challenge for computational biology. RESULTS: In this paper we address the problem of rapid taxonomic assignment with small and adaptive data models (< 5 MB) and present the accelerated k-mer explorer (AKE). Acceleration in AKE’s taxonomic assignments is achieved by a special machine learning architecture, which is well suited to model data collections that are intrinsically hierarchical. We report classification accuracy reasonably well for ranks down to order, observed on a study on real world data (Acid Mine Drainage, Cow Rumen). CONCLUSION: We show that the execution time of this approach is orders of magnitude shorter than competitive approaches and that accuracy is comparable. The tool is presented to the public as a web application (url: https://ani.cebitec.uni-bielefeld.de/ake/, username: bmc, password: bmcbioinfo). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0384-0) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-13 /pmc/articles/PMC4307196/ /pubmed/25495116 http://dx.doi.org/10.1186/s12859-014-0384-0 Text en © Langenkämper et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Langenkämper, Daniel Goesmann, Alexander Nattkemper, Tim Wilhelm AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization |
title | AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization |
title_full | AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization |
title_fullStr | AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization |
title_full_unstemmed | AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization |
title_short | AKE - the Accelerated k-mer Exploration web-tool for rapid taxonomic classification and visualization |
title_sort | ake - the accelerated k-mer exploration web-tool for rapid taxonomic classification and visualization |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4307196/ https://www.ncbi.nlm.nih.gov/pubmed/25495116 http://dx.doi.org/10.1186/s12859-014-0384-0 |
work_keys_str_mv | AT langenkamperdaniel aketheacceleratedkmerexplorationwebtoolforrapidtaxonomicclassificationandvisualization AT goesmannalexander aketheacceleratedkmerexplorationwebtoolforrapidtaxonomicclassificationandvisualization AT nattkempertimwilhelm aketheacceleratedkmerexplorationwebtoolforrapidtaxonomicclassificationandvisualization |