Cargando…
BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
Following the recent advances in wireless communication leading to increased Internet of Things (IoT) systems, many security threats are currently ravaging IoT systems, causing harm to information. Considering the vast application areas of IoT systems, ensuring that cyberattacks are holistically det...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572777/ https://www.ncbi.nlm.nih.gov/pubmed/36236506 http://dx.doi.org/10.3390/s22197409 |
_version_ | 1784810700780077056 |
---|---|
author | Okey, Ogobuchi Daniel Maidin, Siti Sarah Adasme, Pablo Lopes Rosa, Renata Saadi, Muhammad Carrillo Melgarejo, Dick Zegarra Rodríguez, Demóstenes |
author_facet | Okey, Ogobuchi Daniel Maidin, Siti Sarah Adasme, Pablo Lopes Rosa, Renata Saadi, Muhammad Carrillo Melgarejo, Dick Zegarra Rodríguez, Demóstenes |
author_sort | Okey, Ogobuchi Daniel |
collection | PubMed |
description | Following the recent advances in wireless communication leading to increased Internet of Things (IoT) systems, many security threats are currently ravaging IoT systems, causing harm to information. Considering the vast application areas of IoT systems, ensuring that cyberattacks are holistically detected to avoid harm is paramount. Machine learning (ML) algorithms have demonstrated high capacity in helping to mitigate attacks on IoT devices and other edge systems with reasonable accuracy. However, the dynamics of operation of intruders in IoT networks require more improved IDS models capable of detecting multiple attacks with a higher detection rate and lower computational resource requirement, which is one of the challenges of IoT systems. Many ensemble methods have been used with different ML classifiers, including decision trees and random forests, to propose IDS models for IoT environments. The boosting method is one of the approaches used to design an ensemble classifier. This paper proposes an efficient method for detecting cyberattacks and network intrusions based on boosted ML classifiers. Our proposed model is named BoostedEnML. First, we train six different ML classifiers (DT, RF, ET, LGBM, AD, and XGB) and obtain an ensemble using the stacking method and another with a majority voting approach. Two different datasets containing high-profile attacks, including distributed denial of service (DDoS), denial of service (DoS), botnets, infiltration, web attacks, heartbleed, portscan, and botnets, were used to train, evaluate, and test the IDS model. To ensure that we obtained a holistic and efficient model, we performed data balancing with synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) techniques; after that, we used stratified K-fold to split the data into training, validation, and testing sets. Based on the best two models, we construct our proposed BoostedEnsML model using LightGBM and XGBoost, as the combination of the two classifiers gives a lightweight yet efficient model, which is part of the target of this research. Experimental results show that BoostedEnsML outperformed existing ensemble models in terms of accuracy, precision, recall, F-score, and area under the curve (AUC), reaching 100% in each case on the selected datasets for multiclass classification. |
format | Online Article Text |
id | pubmed-9572777 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95727772022-10-17 BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning Okey, Ogobuchi Daniel Maidin, Siti Sarah Adasme, Pablo Lopes Rosa, Renata Saadi, Muhammad Carrillo Melgarejo, Dick Zegarra Rodríguez, Demóstenes Sensors (Basel) Article Following the recent advances in wireless communication leading to increased Internet of Things (IoT) systems, many security threats are currently ravaging IoT systems, causing harm to information. Considering the vast application areas of IoT systems, ensuring that cyberattacks are holistically detected to avoid harm is paramount. Machine learning (ML) algorithms have demonstrated high capacity in helping to mitigate attacks on IoT devices and other edge systems with reasonable accuracy. However, the dynamics of operation of intruders in IoT networks require more improved IDS models capable of detecting multiple attacks with a higher detection rate and lower computational resource requirement, which is one of the challenges of IoT systems. Many ensemble methods have been used with different ML classifiers, including decision trees and random forests, to propose IDS models for IoT environments. The boosting method is one of the approaches used to design an ensemble classifier. This paper proposes an efficient method for detecting cyberattacks and network intrusions based on boosted ML classifiers. Our proposed model is named BoostedEnML. First, we train six different ML classifiers (DT, RF, ET, LGBM, AD, and XGB) and obtain an ensemble using the stacking method and another with a majority voting approach. Two different datasets containing high-profile attacks, including distributed denial of service (DDoS), denial of service (DoS), botnets, infiltration, web attacks, heartbleed, portscan, and botnets, were used to train, evaluate, and test the IDS model. To ensure that we obtained a holistic and efficient model, we performed data balancing with synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) techniques; after that, we used stratified K-fold to split the data into training, validation, and testing sets. Based on the best two models, we construct our proposed BoostedEnsML model using LightGBM and XGBoost, as the combination of the two classifiers gives a lightweight yet efficient model, which is part of the target of this research. Experimental results show that BoostedEnsML outperformed existing ensemble models in terms of accuracy, precision, recall, F-score, and area under the curve (AUC), reaching 100% in each case on the selected datasets for multiclass classification. MDPI 2022-09-29 /pmc/articles/PMC9572777/ /pubmed/36236506 http://dx.doi.org/10.3390/s22197409 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Okey, Ogobuchi Daniel Maidin, Siti Sarah Adasme, Pablo Lopes Rosa, Renata Saadi, Muhammad Carrillo Melgarejo, Dick Zegarra Rodríguez, Demóstenes BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning |
title | BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning |
title_full | BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning |
title_fullStr | BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning |
title_full_unstemmed | BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning |
title_short | BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning |
title_sort | boostedenml: efficient technique for detecting cyberattacks in iot systems using boosted ensemble machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572777/ https://www.ncbi.nlm.nih.gov/pubmed/36236506 http://dx.doi.org/10.3390/s22197409 |
work_keys_str_mv | AT okeyogobuchidaniel boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning AT maidinsitisarah boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning AT adasmepablo boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning AT lopesrosarenata boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning AT saadimuhammad boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning AT carrillomelgarejodick boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning AT zegarrarodriguezdemostenes boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning |