Cargando…
A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098870/ https://www.ncbi.nlm.nih.gov/pubmed/37050529 http://dx.doi.org/10.3390/s23073470 |
_version_ | 1785024918058958848 |
---|---|
author | Mcmurray, Samuel Sodhro, Ali Hassan |
author_facet | Mcmurray, Samuel Sodhro, Ali Hassan |
author_sort | Mcmurray, Samuel |
collection | PubMed |
description | Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these systems increasing, the ability to accurately identify a defective model using Machine Learning (ML) has been overlooked and less addressed. Thus, this article contributes an investigation of various ML techniques for SDP. An investigation, comparative analysis and recommendation of appropriate Feature Extraction (FE) techniques, Principal Component Analysis (PCA), Partial Least Squares Regression (PLS), Feature Selection (FS) techniques, Fisher score, Recursive Feature Elimination (RFE), and Elastic Net are presented. Validation of the following techniques, both separately and in combination with ML algorithms, is performed: Support Vector Machine (SVM), Logistic Regression (LR), Naïve Bayes (NB), K-Nearest Neighbour (KNN), Multilayer Perceptron (MLP), Decision Tree (DT), and ensemble learning methods Bootstrap Aggregation (Bagging), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Random Forest(RF), and Generalized Stacking (Stacking). Extensive experimental setup was built and the results of the experiments revealed that FE and FS can both positively and negatively affect performance over the base model or Baseline. PLS, both separately and in combination with FS techniques, provides impressive, and the most consistent, improvements, while PCA, in combination with Elastic-Net, shows acceptable improvement. |
format | Online Article Text |
id | pubmed-10098870 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100988702023-04-14 A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications Mcmurray, Samuel Sodhro, Ali Hassan Sensors (Basel) Article Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these systems increasing, the ability to accurately identify a defective model using Machine Learning (ML) has been overlooked and less addressed. Thus, this article contributes an investigation of various ML techniques for SDP. An investigation, comparative analysis and recommendation of appropriate Feature Extraction (FE) techniques, Principal Component Analysis (PCA), Partial Least Squares Regression (PLS), Feature Selection (FS) techniques, Fisher score, Recursive Feature Elimination (RFE), and Elastic Net are presented. Validation of the following techniques, both separately and in combination with ML algorithms, is performed: Support Vector Machine (SVM), Logistic Regression (LR), Naïve Bayes (NB), K-Nearest Neighbour (KNN), Multilayer Perceptron (MLP), Decision Tree (DT), and ensemble learning methods Bootstrap Aggregation (Bagging), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Random Forest(RF), and Generalized Stacking (Stacking). Extensive experimental setup was built and the results of the experiments revealed that FE and FS can both positively and negatively affect performance over the base model or Baseline. PLS, both separately and in combination with FS techniques, provides impressive, and the most consistent, improvements, while PCA, in combination with Elastic-Net, shows acceptable improvement. MDPI 2023-03-26 /pmc/articles/PMC10098870/ /pubmed/37050529 http://dx.doi.org/10.3390/s23073470 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Mcmurray, Samuel Sodhro, Ali Hassan A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications |
title | A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications |
title_full | A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications |
title_fullStr | A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications |
title_full_unstemmed | A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications |
title_short | A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications |
title_sort | study on ml-based software defect detection for security traceability in smart healthcare applications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098870/ https://www.ncbi.nlm.nih.gov/pubmed/37050529 http://dx.doi.org/10.3390/s23073470 |
work_keys_str_mv | AT mcmurraysamuel astudyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications AT sodhroalihassan astudyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications AT mcmurraysamuel studyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications AT sodhroalihassan studyonmlbasedsoftwaredefectdetectionforsecuritytraceabilityinsmarthealthcareapplications |