Cargando…

Accelerating metagenomic read classification on CUDA-enabled GPUs

BACKGROUND: Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic lab...

Descripción completa

Detalles Bibliográficos
Autores principales: Kobus, Robin, Hundt, Christian, Müller, André, Schmidt, Bertil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209836/
https://www.ncbi.nlm.nih.gov/pubmed/28049411
http://dx.doi.org/10.1186/s12859-016-1434-6
_version_ 1782490802827558912
author Kobus, Robin
Hundt, Christian
Müller, André
Schmidt, Bertil
author_facet Kobus, Robin
Hundt, Christian
Müller, André
Schmidt, Bertil
author_sort Kobus, Robin
collection PubMed
description BACKGROUND: Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. RESULTS: We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation. CONCLUSION: cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at https://github.com/funatiq/cuclark free of charge.
format Online
Article
Text
id pubmed-5209836
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52098362017-01-04 Accelerating metagenomic read classification on CUDA-enabled GPUs Kobus, Robin Hundt, Christian Müller, André Schmidt, Bertil BMC Bioinformatics Software BACKGROUND: Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. RESULTS: We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (CLARK) method. Using the processing power of a single Titan X GPU, cuCLARK can reach classification speeds of up to 50 million reads per minute. Corresponding speedups for species- (genus-)level classification range between 3.2 and 6.6 (3.7 and 6.4) compared to multi-threaded CLARK executed on a 16-core Xeon CPU workstation. CONCLUSION: cuCLARK can perform metagenomic read classification at superior speeds on CUDA-enabled GPUs. It is free software licensed under GPL and can be downloaded at https://github.com/funatiq/cuclark free of charge. BioMed Central 2017-01-03 /pmc/articles/PMC5209836/ /pubmed/28049411 http://dx.doi.org/10.1186/s12859-016-1434-6 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Kobus, Robin
Hundt, Christian
Müller, André
Schmidt, Bertil
Accelerating metagenomic read classification on CUDA-enabled GPUs
title Accelerating metagenomic read classification on CUDA-enabled GPUs
title_full Accelerating metagenomic read classification on CUDA-enabled GPUs
title_fullStr Accelerating metagenomic read classification on CUDA-enabled GPUs
title_full_unstemmed Accelerating metagenomic read classification on CUDA-enabled GPUs
title_short Accelerating metagenomic read classification on CUDA-enabled GPUs
title_sort accelerating metagenomic read classification on cuda-enabled gpus
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209836/
https://www.ncbi.nlm.nih.gov/pubmed/28049411
http://dx.doi.org/10.1186/s12859-016-1434-6
work_keys_str_mv AT kobusrobin acceleratingmetagenomicreadclassificationoncudaenabledgpus
AT hundtchristian acceleratingmetagenomicreadclassificationoncudaenabledgpus
AT mullerandre acceleratingmetagenomicreadclassificationoncudaenabledgpus
AT schmidtbertil acceleratingmetagenomicreadclassificationoncudaenabledgpus