Cargando…
CUDT: A CUDA Based Decision Tree Algorithm
Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be impr...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4130321/ https://www.ncbi.nlm.nih.gov/pubmed/25140346 http://dx.doi.org/10.1155/2014/745640 |
_version_ | 1782330314861838336 |
---|---|
author | Lo, Win-Tsung Chang, Yue-Shan Sheu, Ruey-Kai Chiu, Chun-Chieh Yuan, Shyan-Ming |
author_facet | Lo, Win-Tsung Chang, Yue-Shan Sheu, Ruey-Kai Chiu, Chun-Chieh Yuan, Shyan-Ming |
author_sort | Lo, Win-Tsung |
collection | PubMed |
description | Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set. |
format | Online Article Text |
id | pubmed-4130321 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-41303212014-08-19 CUDT: A CUDA Based Decision Tree Algorithm Lo, Win-Tsung Chang, Yue-Shan Sheu, Ruey-Kai Chiu, Chun-Chieh Yuan, Shyan-Ming ScientificWorldJournal Research Article Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set. Hindawi Publishing Corporation 2014 2014-07-22 /pmc/articles/PMC4130321/ /pubmed/25140346 http://dx.doi.org/10.1155/2014/745640 Text en Copyright © 2014 Win-Tsung Lo et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Lo, Win-Tsung Chang, Yue-Shan Sheu, Ruey-Kai Chiu, Chun-Chieh Yuan, Shyan-Ming CUDT: A CUDA Based Decision Tree Algorithm |
title | CUDT: A CUDA Based Decision Tree Algorithm |
title_full | CUDT: A CUDA Based Decision Tree Algorithm |
title_fullStr | CUDT: A CUDA Based Decision Tree Algorithm |
title_full_unstemmed | CUDT: A CUDA Based Decision Tree Algorithm |
title_short | CUDT: A CUDA Based Decision Tree Algorithm |
title_sort | cudt: a cuda based decision tree algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4130321/ https://www.ncbi.nlm.nih.gov/pubmed/25140346 http://dx.doi.org/10.1155/2014/745640 |
work_keys_str_mv | AT lowintsung cudtacudabaseddecisiontreealgorithm AT changyueshan cudtacudabaseddecisiontreealgorithm AT sheurueykai cudtacudabaseddecisiontreealgorithm AT chiuchunchieh cudtacudabaseddecisiontreealgorithm AT yuanshyanming cudtacudabaseddecisiontreealgorithm |