Cargando…
Accelerating metagenomic read classification on CUDA-enabled GPUs
BACKGROUND: Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic lab...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209836/ https://www.ncbi.nlm.nih.gov/pubmed/28049411 http://dx.doi.org/10.1186/s12859-016-1434-6 |
_version_ | 1782490802827558912 |
---|---|
author | Kobus, Robin Hundt, Christian Müller, André Schmidt, Bertil |
author_facet | Kobus, Robin Hundt, Christian Müller, André Schmidt, Bertil |
author_sort | Kobus, Robin |
collection | PubMed |
description | BACKGROUND: Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. RESULTS: We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation. CONCLUSION: cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at https://github.com/funatiq/cuclark free of charge. |
format | Online Article Text |
id | pubmed-5209836 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52098362017-01-04 Accelerating metagenomic read classification on CUDA-enabled GPUs Kobus, Robin Hundt, Christian Müller, André Schmidt, Bertil BMC Bioinformatics Software BACKGROUND: Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. RESULTS: We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation. CONCLUSION: cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at https://github.com/funatiq/cuclark free of charge. BioMed Central 2017-01-03 /pmc/articles/PMC5209836/ /pubmed/28049411 http://dx.doi.org/10.1186/s12859-016-1434-6 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Kobus, Robin Hundt, Christian Müller, André Schmidt, Bertil Accelerating metagenomic read classification on CUDA-enabled GPUs |
title | Accelerating metagenomic read classification on CUDA-enabled GPUs |
title_full | Accelerating metagenomic read classification on CUDA-enabled GPUs |
title_fullStr | Accelerating metagenomic read classification on CUDA-enabled GPUs |
title_full_unstemmed | Accelerating metagenomic read classification on CUDA-enabled GPUs |
title_short | Accelerating metagenomic read classification on CUDA-enabled GPUs |
title_sort | accelerating metagenomic read classification on cuda-enabled gpus |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209836/ https://www.ncbi.nlm.nih.gov/pubmed/28049411 http://dx.doi.org/10.1186/s12859-016-1434-6 |
work_keys_str_mv | AT kobusrobin acceleratingmetagenomicreadclassificationoncudaenabledgpus AT hundtchristian acceleratingmetagenomicreadclassificationoncudaenabledgpus AT mullerandre acceleratingmetagenomicreadclassificationoncudaenabledgpus AT schmidtbertil acceleratingmetagenomicreadclassificationoncudaenabledgpus |