Cargando…

CUDT: A CUDA Based Decision Tree Algorithm

Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be impr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Win-Tsung, Chang, Yue-Shan, Sheu, Ruey-Kai, Chiu, Chun-Chieh, Yuan, Shyan-Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4130321/
https://www.ncbi.nlm.nih.gov/pubmed/25140346
http://dx.doi.org/10.1155/2014/745640
_version_ 1782330314861838336
author Lo, Win-Tsung
Chang, Yue-Shan
Sheu, Ruey-Kai
Chiu, Chun-Chieh
Yuan, Shyan-Ming
author_facet Lo, Win-Tsung
Chang, Yue-Shan
Sheu, Ruey-Kai
Chiu, Chun-Chieh
Yuan, Shyan-Ming
author_sort Lo, Win-Tsung
collection PubMed
description Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set.
format Online
Article
Text
id pubmed-4130321
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41303212014-08-19 CUDT: A CUDA Based Decision Tree Algorithm Lo, Win-Tsung Chang, Yue-Shan Sheu, Ruey-Kai Chiu, Chun-Chieh Yuan, Shyan-Ming ScientificWorldJournal Research Article Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new technology help. In order to improve data processing latency in huge data mining, in this paper, we design and implement a new parallelized decision tree algorithm on a CUDA (compute unified device architecture), which is a GPGPU solution provided by NVIDIA. In the proposed system, CPU is responsible for flow control while the GPU is responsible for computation. We have conducted many experiments to evaluate system performance of CUDT and made a comparison with traditional CPU version. The results show that CUDT is 5∼55 times faster than Weka-j48 and is 18 times speedup than SPRINT for large data set. Hindawi Publishing Corporation 2014 2014-07-22 /pmc/articles/PMC4130321/ /pubmed/25140346 http://dx.doi.org/10.1155/2014/745640 Text en Copyright © 2014 Win-Tsung Lo et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lo, Win-Tsung
Chang, Yue-Shan
Sheu, Ruey-Kai
Chiu, Chun-Chieh
Yuan, Shyan-Ming
CUDT: A CUDA Based Decision Tree Algorithm
title CUDT: A CUDA Based Decision Tree Algorithm
title_full CUDT: A CUDA Based Decision Tree Algorithm
title_fullStr CUDT: A CUDA Based Decision Tree Algorithm
title_full_unstemmed CUDT: A CUDA Based Decision Tree Algorithm
title_short CUDT: A CUDA Based Decision Tree Algorithm
title_sort cudt: a cuda based decision tree algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4130321/
https://www.ncbi.nlm.nih.gov/pubmed/25140346
http://dx.doi.org/10.1155/2014/745640
work_keys_str_mv AT lowintsung cudtacudabaseddecisiontreealgorithm
AT changyueshan cudtacudabaseddecisiontreealgorithm
AT sheurueykai cudtacudabaseddecisiontreealgorithm
AT chiuchunchieh cudtacudabaseddecisiontreealgorithm
AT yuanshyanming cudtacudabaseddecisiontreealgorithm