Cargando…

Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques

Software defect prediction (SDP) in the initial period of the software development life cycle (SDLC) remains a critical and important assignment. SDP is essentially studied during few last decades as it leads to assure the quality of software systems. The quick forecast of defective or imperfect art...

Descripción completa

Detalles Bibliográficos
Autores principales:	Khan, Bilal, Naseem, Rashid, Shah, Muhammad Arif, Wakil, Karzan, Khan, Atif, Uddin, M. Irfan, Mahmoud, Marwan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987450/ https://www.ncbi.nlm.nih.gov/pubmed/33815733 http://dx.doi.org/10.1155/2021/8899263

_version_	1783668613485428736
author	Khan, Bilal Naseem, Rashid Shah, Muhammad Arif Wakil, Karzan Khan, Atif Uddin, M. Irfan Mahmoud, Marwan
author_facet	Khan, Bilal Naseem, Rashid Shah, Muhammad Arif Wakil, Karzan Khan, Atif Uddin, M. Irfan Mahmoud, Marwan
author_sort	Khan, Bilal
collection	PubMed
description	Software defect prediction (SDP) in the initial period of the software development life cycle (SDLC) remains a critical and important assignment. SDP is essentially studied during few last decades as it leads to assure the quality of software systems. The quick forecast of defective or imperfect artifacts in software development may serve the development team to use the existing assets competently and more effectively to provide extraordinary software products in the given or narrow time. Previously, several canvassers have industrialized models for defect prediction utilizing machine learning (ML) and statistical techniques. ML methods are considered as an operative and operational approach to pinpoint the defective modules, in which moving parts through mining concealed patterns amid software metrics (attributes). ML techniques are also utilized by several researchers on healthcare datasets. This study utilizes different ML techniques software defect prediction using seven broadly used datasets. The ML techniques include the multilayer perceptron (MLP), support vector machine (SVM), decision tree (J48), radial basis function (RBF), random forest (RF), hidden Markov model (HMM), credal decision tree (CDT), K-nearest neighbor (KNN), average one dependency estimator (A1DE), and Naïve Bayes (NB). The performance of each technique is evaluated using different measures, for instance, relative absolute error (RAE), mean absolute error (MAE), root mean squared error (RMSE), root relative squared error (RRSE), recall, and accuracy. The inclusive outcome shows the best performance of RF with 88.32% average accuracy and 2.96 rank value, second-best performance is achieved by SVM with 87.99% average accuracy and 3.83 rank values. Moreover, CDT also shows 87.88% average accuracy and 3.62 rank values, placed on the third position. The comprehensive outcomes of research can be utilized as a reference point for new research in the SDP domain, and therefore, any assertion concerning the enhancement in prediction over any new technique or model can be benchmarked and proved.
format	Online Article Text
id	pubmed-7987450
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-79874502021-04-02 Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques Khan, Bilal Naseem, Rashid Shah, Muhammad Arif Wakil, Karzan Khan, Atif Uddin, M. Irfan Mahmoud, Marwan J Healthc Eng Research Article Software defect prediction (SDP) in the initial period of the software development life cycle (SDLC) remains a critical and important assignment. SDP is essentially studied during few last decades as it leads to assure the quality of software systems. The quick forecast of defective or imperfect artifacts in software development may serve the development team to use the existing assets competently and more effectively to provide extraordinary software products in the given or narrow time. Previously, several canvassers have industrialized models for defect prediction utilizing machine learning (ML) and statistical techniques. ML methods are considered as an operative and operational approach to pinpoint the defective modules, in which moving parts through mining concealed patterns amid software metrics (attributes). ML techniques are also utilized by several researchers on healthcare datasets. This study utilizes different ML techniques software defect prediction using seven broadly used datasets. The ML techniques include the multilayer perceptron (MLP), support vector machine (SVM), decision tree (J48), radial basis function (RBF), random forest (RF), hidden Markov model (HMM), credal decision tree (CDT), K-nearest neighbor (KNN), average one dependency estimator (A1DE), and Naïve Bayes (NB). The performance of each technique is evaluated using different measures, for instance, relative absolute error (RAE), mean absolute error (MAE), root mean squared error (RMSE), root relative squared error (RRSE), recall, and accuracy. The inclusive outcome shows the best performance of RF with 88.32% average accuracy and 2.96 rank value, second-best performance is achieved by SVM with 87.99% average accuracy and 3.83 rank values. Moreover, CDT also shows 87.88% average accuracy and 3.62 rank values, placed on the third position. The comprehensive outcomes of research can be utilized as a reference point for new research in the SDP domain, and therefore, any assertion concerning the enhancement in prediction over any new technique or model can be benchmarked and proved. Hindawi 2021-03-15 /pmc/articles/PMC7987450/ /pubmed/33815733 http://dx.doi.org/10.1155/2021/8899263 Text en Copyright © 2021 Bilal Khan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Khan, Bilal Naseem, Rashid Shah, Muhammad Arif Wakil, Karzan Khan, Atif Uddin, M. Irfan Mahmoud, Marwan Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
title	Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
title_full	Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
title_fullStr	Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
title_full_unstemmed	Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
title_short	Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques
title_sort	software defect prediction for healthcare big data: an empirical evaluation of machine learning techniques
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987450/ https://www.ncbi.nlm.nih.gov/pubmed/33815733 http://dx.doi.org/10.1155/2021/8899263
work_keys_str_mv	AT khanbilal softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques AT naseemrashid softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques AT shahmuhammadarif softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques AT wakilkarzan softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques AT khanatif softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques AT uddinmirfan softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques AT mahmoudmarwan softwaredefectpredictionforhealthcarebigdataanempiricalevaluationofmachinelearningtechniques

Software Defect Prediction for Healthcare Big Data: An Empirical Evaluation of Machine Learning Techniques

Ejemplares similares