Cargando…

BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning

Following the recent advances in wireless communication leading to increased Internet of Things (IoT) systems, many security threats are currently ravaging IoT systems, causing harm to information. Considering the vast application areas of IoT systems, ensuring that cyberattacks are holistically det...

Descripción completa

Detalles Bibliográficos
Autores principales: Okey, Ogobuchi Daniel, Maidin, Siti Sarah, Adasme, Pablo, Lopes Rosa, Renata, Saadi, Muhammad, Carrillo Melgarejo, Dick, Zegarra Rodríguez, Demóstenes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572777/
https://www.ncbi.nlm.nih.gov/pubmed/36236506
http://dx.doi.org/10.3390/s22197409
_version_ 1784810700780077056
author Okey, Ogobuchi Daniel
Maidin, Siti Sarah
Adasme, Pablo
Lopes Rosa, Renata
Saadi, Muhammad
Carrillo Melgarejo, Dick
Zegarra Rodríguez, Demóstenes
author_facet Okey, Ogobuchi Daniel
Maidin, Siti Sarah
Adasme, Pablo
Lopes Rosa, Renata
Saadi, Muhammad
Carrillo Melgarejo, Dick
Zegarra Rodríguez, Demóstenes
author_sort Okey, Ogobuchi Daniel
collection PubMed
description Following the recent advances in wireless communication leading to increased Internet of Things (IoT) systems, many security threats are currently ravaging IoT systems, causing harm to information. Considering the vast application areas of IoT systems, ensuring that cyberattacks are holistically detected to avoid harm is paramount. Machine learning (ML) algorithms have demonstrated high capacity in helping to mitigate attacks on IoT devices and other edge systems with reasonable accuracy. However, the dynamics of operation of intruders in IoT networks require more improved IDS models capable of detecting multiple attacks with a higher detection rate and lower computational resource requirement, which is one of the challenges of IoT systems. Many ensemble methods have been used with different ML classifiers, including decision trees and random forests, to propose IDS models for IoT environments. The boosting method is one of the approaches used to design an ensemble classifier. This paper proposes an efficient method for detecting cyberattacks and network intrusions based on boosted ML classifiers. Our proposed model is named BoostedEnML. First, we train six different ML classifiers (DT, RF, ET, LGBM, AD, and XGB) and obtain an ensemble using the stacking method and another with a majority voting approach. Two different datasets containing high-profile attacks, including distributed denial of service (DDoS), denial of service (DoS), botnets, infiltration, web attacks, heartbleed, portscan, and botnets, were used to train, evaluate, and test the IDS model. To ensure that we obtained a holistic and efficient model, we performed data balancing with synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) techniques; after that, we used stratified K-fold to split the data into training, validation, and testing sets. Based on the best two models, we construct our proposed BoostedEnsML model using LightGBM and XGBoost, as the combination of the two classifiers gives a lightweight yet efficient model, which is part of the target of this research. Experimental results show that BoostedEnsML outperformed existing ensemble models in terms of accuracy, precision, recall, F-score, and area under the curve (AUC), reaching 100% in each case on the selected datasets for multiclass classification.
format Online
Article
Text
id pubmed-9572777
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95727772022-10-17 BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning Okey, Ogobuchi Daniel Maidin, Siti Sarah Adasme, Pablo Lopes Rosa, Renata Saadi, Muhammad Carrillo Melgarejo, Dick Zegarra Rodríguez, Demóstenes Sensors (Basel) Article Following the recent advances in wireless communication leading to increased Internet of Things (IoT) systems, many security threats are currently ravaging IoT systems, causing harm to information. Considering the vast application areas of IoT systems, ensuring that cyberattacks are holistically detected to avoid harm is paramount. Machine learning (ML) algorithms have demonstrated high capacity in helping to mitigate attacks on IoT devices and other edge systems with reasonable accuracy. However, the dynamics of operation of intruders in IoT networks require more improved IDS models capable of detecting multiple attacks with a higher detection rate and lower computational resource requirement, which is one of the challenges of IoT systems. Many ensemble methods have been used with different ML classifiers, including decision trees and random forests, to propose IDS models for IoT environments. The boosting method is one of the approaches used to design an ensemble classifier. This paper proposes an efficient method for detecting cyberattacks and network intrusions based on boosted ML classifiers. Our proposed model is named BoostedEnML. First, we train six different ML classifiers (DT, RF, ET, LGBM, AD, and XGB) and obtain an ensemble using the stacking method and another with a majority voting approach. Two different datasets containing high-profile attacks, including distributed denial of service (DDoS), denial of service (DoS), botnets, infiltration, web attacks, heartbleed, portscan, and botnets, were used to train, evaluate, and test the IDS model. To ensure that we obtained a holistic and efficient model, we performed data balancing with synthetic minority oversampling technique (SMOTE) and adaptive synthetic (ADASYN) techniques; after that, we used stratified K-fold to split the data into training, validation, and testing sets. Based on the best two models, we construct our proposed BoostedEnsML model using LightGBM and XGBoost, as the combination of the two classifiers gives a lightweight yet efficient model, which is part of the target of this research. Experimental results show that BoostedEnsML outperformed existing ensemble models in terms of accuracy, precision, recall, F-score, and area under the curve (AUC), reaching 100% in each case on the selected datasets for multiclass classification. MDPI 2022-09-29 /pmc/articles/PMC9572777/ /pubmed/36236506 http://dx.doi.org/10.3390/s22197409 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Okey, Ogobuchi Daniel
Maidin, Siti Sarah
Adasme, Pablo
Lopes Rosa, Renata
Saadi, Muhammad
Carrillo Melgarejo, Dick
Zegarra Rodríguez, Demóstenes
BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
title BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
title_full BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
title_fullStr BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
title_full_unstemmed BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
title_short BoostedEnML: Efficient Technique for Detecting Cyberattacks in IoT Systems Using Boosted Ensemble Machine Learning
title_sort boostedenml: efficient technique for detecting cyberattacks in iot systems using boosted ensemble machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572777/
https://www.ncbi.nlm.nih.gov/pubmed/36236506
http://dx.doi.org/10.3390/s22197409
work_keys_str_mv AT okeyogobuchidaniel boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning
AT maidinsitisarah boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning
AT adasmepablo boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning
AT lopesrosarenata boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning
AT saadimuhammad boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning
AT carrillomelgarejodick boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning
AT zegarrarodriguezdemostenes boostedenmlefficienttechniquefordetectingcyberattacksiniotsystemsusingboostedensemblemachinelearning