Cargando…

A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications

Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these...

Descripción completa

Detalles Bibliográficos
Autores principales: Mcmurray, Samuel, Sodhro, Ali Hassan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098870/
https://www.ncbi.nlm.nih.gov/pubmed/37050529
http://dx.doi.org/10.3390/s23073470
_version_ 1785024918058958848
author Mcmurray, Samuel
Sodhro, Ali Hassan
author_facet Mcmurray, Samuel
Sodhro, Ali Hassan
author_sort Mcmurray, Samuel
collection PubMed
description Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these systems increasing, the ability to accurately identify a defective model using Machine Learning (ML) has been overlooked and less addressed. Thus, this article contributes an investigation of various ML techniques for SDP. An investigation, comparative analysis and recommendation of appropriate Feature Extraction (FE) techniques, Principal Component Analysis (PCA), Partial Least Squares Regression (PLS), Feature Selection (FS) techniques, Fisher score, Recursive Feature Elimination (RFE), and Elastic Net are presented. Validation of the following techniques, both separately and in combination with ML algorithms, is performed: Support Vector Machine (SVM), Logistic Regression (LR), Naïve Bayes (NB), K-Nearest Neighbour (KNN), Multilayer Perceptron (MLP), Decision Tree (DT), and ensemble learning methods Bootstrap Aggregation (Bagging), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Random Forest(RF), and Generalized Stacking (Stacking). Extensive experimental setup was built and the results of the experiments revealed that FE and FS can both positively and negatively affect performance over the base model or Baseline. PLS, both separately and in combination with FS techniques, provides impressive, and the most consistent, improvements, while PCA, in combination with Elastic-Net, shows acceptable improvement.
format Online
Article
Text
id pubmed-10098870
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100988702023-04-14 A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications Mcmurray, Samuel Sodhro, Ali Hassan Sensors (Basel) Article Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these systems increasing, the ability to accurately identify a defective model using Machine Learning (ML) has been overlooked and less addressed. Thus, this article contributes an investigation of various ML techniques for SDP. An investigation, comparative analysis and recommendation of appropriate Feature Extraction (FE) techniques, Principal Component Analysis (PCA), Partial Least Squares Regression (PLS), Feature Selection (FS) techniques, Fisher score, Recursive Feature Elimination (RFE), and Elastic Net are presented. Validation of the following techniques, both separately and in combination with ML algorithms, is performed: Support Vector Machine (SVM), Logistic Regression (LR), Naïve Bayes (NB), K-Nearest Neighbour (KNN), Multilayer Perceptron (MLP), Decision Tree (DT), and ensemble learning methods Bootstrap Aggregation (Bagging), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Random Forest(RF), and Generalized Stacking (Stacking). Extensive experimental setup was built and the results of the experiments revealed that FE and FS can both positively and negatively affect performance over the base model or Baseline. PLS, both separately and in combination with FS techniques, provides impressive, and the most consistent, improvements, while PCA, in combination with Elastic-Net, shows acceptable improvement. MDPI 2023-03-26 /pmc/articles/PMC10098870/ /pubmed/37050529 http://dx.doi.org/10.3390/s23073470 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mcmurray, Samuel
Sodhro, Ali Hassan
A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
title A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
title_full A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
title_fullStr A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
title_full_unstemmed A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
title_short A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
title_sort study on ml-based software defect detection for security traceability in smart healthcare applications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098870/
https://www.ncbi.nlm.nih.gov/pubmed/37050529
http://dx.doi.org/10.3390/s23073470
work_keys_str_mv AT mcmurraysamuel astudyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications
AT sodhroalihassan astudyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications
AT mcmurraysamuel studyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications
AT sodhroalihassan studyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications