Cargando…

GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs

BACKGROUND: The analysis of biological networks has become a major challenge due to the recent development of high-throughput techniques that are rapidly producing very large data sets. The exploding volumes of biological data are craving for extreme computational power and special computing facilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Arefin, Ahmed Shamsul, Riveros, Carlos, Berretta, Regina, Moscato, Pablo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3429408/
https://www.ncbi.nlm.nih.gov/pubmed/22937144
http://dx.doi.org/10.1371/journal.pone.0044000
_version_ 1782241787859959808
author Arefin, Ahmed Shamsul
Riveros, Carlos
Berretta, Regina
Moscato, Pablo
author_facet Arefin, Ahmed Shamsul
Riveros, Carlos
Berretta, Regina
Moscato, Pablo
author_sort Arefin, Ahmed Shamsul
collection PubMed
description BACKGROUND: The analysis of biological networks has become a major challenge due to the recent development of high-throughput techniques that are rapidly producing very large data sets. The exploding volumes of biological data are craving for extreme computational power and special computing facilities (i.e. super-computers). An inexpensive solution, such as General Purpose computation based on Graphics Processing Units (GPGPU), can be adapted to tackle this challenge, but the limitation of the device internal memory can pose a new problem of scalability. An efficient data and computational parallelism with partitioning is required to provide a fast and scalable solution to this problem. RESULTS: We propose an efficient parallel formulation of the k-Nearest Neighbour (kNN) search problem, which is a popular method for classifying objects in several fields of research, such as pattern recognition, machine learning and bioinformatics. Being very simple and straightforward, the performance of the kNN search degrades dramatically for large data sets, since the task is computationally intensive. The proposed approach is not only fast but also scalable to large-scale instances. Based on our approach, we implemented a software tool GPU-FS-kNN (GPU-based Fast and Scalable k-Nearest Neighbour) for CUDA enabled GPUs. The basic approach is simple and adaptable to other available GPU architectures. We observed speed-ups of 50–60 times compared with CPU implementation on a well-known breast microarray study and its associated data sets. CONCLUSION: Our GPU-based Fast and Scalable k-Nearest Neighbour search technique (GPU-FS-kNN) provides a significant performance improvement for nearest neighbour computation in large-scale networks. Source code and the software tool is available under GNU Public License (GPL) at https://sourceforge.net/p/gpufsknn/.
format Online
Article
Text
id pubmed-3429408
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34294082012-08-30 GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs Arefin, Ahmed Shamsul Riveros, Carlos Berretta, Regina Moscato, Pablo PLoS One Research Article BACKGROUND: The analysis of biological networks has become a major challenge due to the recent development of high-throughput techniques that are rapidly producing very large data sets. The exploding volumes of biological data are craving for extreme computational power and special computing facilities (i.e. super-computers). An inexpensive solution, such as General Purpose computation based on Graphics Processing Units (GPGPU), can be adapted to tackle this challenge, but the limitation of the device internal memory can pose a new problem of scalability. An efficient data and computational parallelism with partitioning is required to provide a fast and scalable solution to this problem. RESULTS: We propose an efficient parallel formulation of the k-Nearest Neighbour (kNN) search problem, which is a popular method for classifying objects in several fields of research, such as pattern recognition, machine learning and bioinformatics. Being very simple and straightforward, the performance of the kNN search degrades dramatically for large data sets, since the task is computationally intensive. The proposed approach is not only fast but also scalable to large-scale instances. Based on our approach, we implemented a software tool GPU-FS-kNN (GPU-based Fast and Scalable k-Nearest Neighbour) for CUDA enabled GPUs. The basic approach is simple and adaptable to other available GPU architectures. We observed speed-ups of 50–60 times compared with CPU implementation on a well-known breast microarray study and its associated data sets. CONCLUSION: Our GPU-based Fast and Scalable k-Nearest Neighbour search technique (GPU-FS-kNN) provides a significant performance improvement for nearest neighbour computation in large-scale networks. Source code and the software tool is available under GNU Public License (GPL) at https://sourceforge.net/p/gpufsknn/. Public Library of Science 2012-08-28 /pmc/articles/PMC3429408/ /pubmed/22937144 http://dx.doi.org/10.1371/journal.pone.0044000 Text en © 2012 Arefin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Arefin, Ahmed Shamsul
Riveros, Carlos
Berretta, Regina
Moscato, Pablo
GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
title GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
title_full GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
title_fullStr GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
title_full_unstemmed GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
title_short GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
title_sort gpu-fs-knn: a software tool for fast and scalable knn computation using gpus
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3429408/
https://www.ncbi.nlm.nih.gov/pubmed/22937144
http://dx.doi.org/10.1371/journal.pone.0044000
work_keys_str_mv AT arefinahmedshamsul gpufsknnasoftwaretoolforfastandscalableknncomputationusinggpus
AT riveroscarlos gpufsknnasoftwaretoolforfastandscalableknncomputationusinggpus
AT berrettaregina gpufsknnasoftwaretoolforfastandscalableknncomputationusinggpus
AT moscatopablo gpufsknnasoftwaretoolforfastandscalableknncomputationusinggpus