Cargando…

Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler

BACKGROUND: In healthcare area, big data, if integrated with machine learning, enables health practitioners to predict the result of a disorder or disease more accurately. In Autistic Spectrum Disorder (ASD), it is important to screen the patients to enable them to undergo proper treatments as early...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohd Radzi, Siti Fairuz, Hassan, Mohd Sayuti, Mohd Radzi, Muhammad Abdul Hadi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700876/
https://www.ncbi.nlm.nih.gov/pubmed/36434656
http://dx.doi.org/10.1186/s12911-022-02050-x
_version_ 1784839409364893696
author Mohd Radzi, Siti Fairuz
Hassan, Mohd Sayuti
Mohd Radzi, Muhammad Abdul Hadi
author_facet Mohd Radzi, Siti Fairuz
Hassan, Mohd Sayuti
Mohd Radzi, Muhammad Abdul Hadi
author_sort Mohd Radzi, Siti Fairuz
collection PubMed
description BACKGROUND: In healthcare area, big data, if integrated with machine learning, enables health practitioners to predict the result of a disorder or disease more accurately. In Autistic Spectrum Disorder (ASD), it is important to screen the patients to enable them to undergo proper treatments as early as possible. However, difficulties may arise in predicting ASD occurrences accurately, mainly caused by human errors. Data mining, if embedded into health screening practice, can help to overcome the difficulties. This study attempts to evaluate the performance of six best classifiers, taken from existing works, at analysing ASD screening training dataset. RESULT: We tested Naive Bayes, Logistic Regression, KNN, J48, Random Forest, SVM, and Deep Neural Network algorithms to ASD screening dataset and compared the classifiers’ based on significant parameters; sensitivity, specificity, accuracy, receiver operating characteristic, area under the curve, and runtime, in predicting ASD occurrences. We also found that most of previous studies focused on classifying health-related dataset while ignoring the missing values which may contribute to significant impacts to the classification result which in turn may impact the life of the patients. Thus, we addressed the missing values by implementing imputation method where they are replaced with the mean of the available records found in the dataset. CONCLUSION: We found that J48 produced promising results as compared to other classifiers when tested in both circumstances, with and without missing values. Our findings also suggested that SVM does not necessarily perform well for small and simple datasets. The outcome is hoped to assist health practitioners in making accurate diagnosis of ASD occurrences in patients.
format Online
Article
Text
id pubmed-9700876
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97008762022-11-27 Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler Mohd Radzi, Siti Fairuz Hassan, Mohd Sayuti Mohd Radzi, Muhammad Abdul Hadi BMC Med Inform Decis Mak Article BACKGROUND: In healthcare area, big data, if integrated with machine learning, enables health practitioners to predict the result of a disorder or disease more accurately. In Autistic Spectrum Disorder (ASD), it is important to screen the patients to enable them to undergo proper treatments as early as possible. However, difficulties may arise in predicting ASD occurrences accurately, mainly caused by human errors. Data mining, if embedded into health screening practice, can help to overcome the difficulties. This study attempts to evaluate the performance of six best classifiers, taken from existing works, at analysing ASD screening training dataset. RESULT: We tested Naive Bayes, Logistic Regression, KNN, J48, Random Forest, SVM, and Deep Neural Network algorithms to ASD screening dataset and compared the classifiers’ based on significant parameters; sensitivity, specificity, accuracy, receiver operating characteristic, area under the curve, and runtime, in predicting ASD occurrences. We also found that most of previous studies focused on classifying health-related dataset while ignoring the missing values which may contribute to significant impacts to the classification result which in turn may impact the life of the patients. Thus, we addressed the missing values by implementing imputation method where they are replaced with the mean of the available records found in the dataset. CONCLUSION: We found that J48 produced promising results as compared to other classifiers when tested in both circumstances, with and without missing values. Our findings also suggested that SVM does not necessarily perform well for small and simple datasets. The outcome is hoped to assist health practitioners in making accurate diagnosis of ASD occurrences in patients. BioMed Central 2022-11-24 /pmc/articles/PMC9700876/ /pubmed/36434656 http://dx.doi.org/10.1186/s12911-022-02050-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Article
Mohd Radzi, Siti Fairuz
Hassan, Mohd Sayuti
Mohd Radzi, Muhammad Abdul Hadi
Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler
title Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler
title_full Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler
title_fullStr Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler
title_full_unstemmed Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler
title_short Comparison of classification algorithms for predicting autistic spectrum disorder using WEKA modeler
title_sort comparison of classification algorithms for predicting autistic spectrum disorder using weka modeler
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700876/
https://www.ncbi.nlm.nih.gov/pubmed/36434656
http://dx.doi.org/10.1186/s12911-022-02050-x
work_keys_str_mv AT mohdradzisitifairuz comparisonofclassificationalgorithmsforpredictingautisticspectrumdisorderusingwekamodeler
AT hassanmohdsayuti comparisonofclassificationalgorithmsforpredictingautisticspectrumdisorderusingwekamodeler
AT mohdradzimuhammadabdulhadi comparisonofclassificationalgorithmsforpredictingautisticspectrumdisorderusingwekamodeler