Cargando…

Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models

Corona Virus Disease 2019 (COVID-19) pandemic has increased the importance of Virtual Learning Environments (VLEs) instigating students to study from their homes. Every day a tremendous amount of data is generated when students interact with VLEs to perform different activities and access learning m...

Descripción completa

Detalles Bibliográficos
Autores principales: Adnan, Muhammad, Alarood, Alaa Abdul Salam, Uddin, M. Irfan, ur Rehman, Izaz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044349/
https://www.ncbi.nlm.nih.gov/pubmed/35494796
http://dx.doi.org/10.7717/peerj-cs.803
_version_ 1784695086512078848
author Adnan, Muhammad
Alarood, Alaa Abdul Salam
Uddin, M. Irfan
ur Rehman, Izaz
author_facet Adnan, Muhammad
Alarood, Alaa Abdul Salam
Uddin, M. Irfan
ur Rehman, Izaz
author_sort Adnan, Muhammad
collection PubMed
description Corona Virus Disease 2019 (COVID-19) pandemic has increased the importance of Virtual Learning Environments (VLEs) instigating students to study from their homes. Every day a tremendous amount of data is generated when students interact with VLEs to perform different activities and access learning material. To make the generated data useful, it must be processed and managed by the proper machine learning (ML) algorithm. ML algorithms’ applications are many folds with Education Data Mining (EDM) and Learning Analytics (LA) as their major fields. ML algorithms are commonly used to process raw data to discover hidden patterns and construct a model to make future predictions, such as predicting students’ performance, dropouts, engagement, etc. However, in VLE, it is important to select the right and most applicable ML algorithm to give the best performance results. In this study, we aim to improve those ML and DL algorithms’ performance that give an inferior performance in terms of performance, accuracy, precision, recall, and F1 score. Several ML algorithms were applied on Open University Learning Analytics (OULA) dataset to reveal which one offers the best results in terms of performance, accuracy, precision, recall, and F1 score. Two popular ML algorithms called Decision Tree (DT) and Feed-Forward Neural Network (FFNN) provided unsatisfactory results. They were selected and experimented with various techniques such as grid search cross-validation, adaptive boosting, extreme gradient boosting, early stopping, feature engineering, and dropping inactive neurons to improve their performance scores. Moreover, we also determined the feature weights/importance in predicting the students’ study performance, leading to the design and development of the adaptive learning system. The ML techniques and the methods used in this research study can be used by instructors/administrators to optimize learning content and provide informed guidance to students, thus improving their learning experience and making it exciting and adaptive.
format Online
Article
Text
id pubmed-9044349
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90443492022-04-28 Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models Adnan, Muhammad Alarood, Alaa Abdul Salam Uddin, M. Irfan ur Rehman, Izaz PeerJ Comput Sci Human-Computer Interaction Corona Virus Disease 2019 (COVID-19) pandemic has increased the importance of Virtual Learning Environments (VLEs) instigating students to study from their homes. Every day a tremendous amount of data is generated when students interact with VLEs to perform different activities and access learning material. To make the generated data useful, it must be processed and managed by the proper machine learning (ML) algorithm. ML algorithms’ applications are many folds with Education Data Mining (EDM) and Learning Analytics (LA) as their major fields. ML algorithms are commonly used to process raw data to discover hidden patterns and construct a model to make future predictions, such as predicting students’ performance, dropouts, engagement, etc. However, in VLE, it is important to select the right and most applicable ML algorithm to give the best performance results. In this study, we aim to improve those ML and DL algorithms’ performance that give an inferior performance in terms of performance, accuracy, precision, recall, and F1 score. Several ML algorithms were applied on Open University Learning Analytics (OULA) dataset to reveal which one offers the best results in terms of performance, accuracy, precision, recall, and F1 score. Two popular ML algorithms called Decision Tree (DT) and Feed-Forward Neural Network (FFNN) provided unsatisfactory results. They were selected and experimented with various techniques such as grid search cross-validation, adaptive boosting, extreme gradient boosting, early stopping, feature engineering, and dropping inactive neurons to improve their performance scores. Moreover, we also determined the feature weights/importance in predicting the students’ study performance, leading to the design and development of the adaptive learning system. The ML techniques and the methods used in this research study can be used by instructors/administrators to optimize learning content and provide informed guidance to students, thus improving their learning experience and making it exciting and adaptive. PeerJ Inc. 2022-02-21 /pmc/articles/PMC9044349/ /pubmed/35494796 http://dx.doi.org/10.7717/peerj-cs.803 Text en © 2022 Adnan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Human-Computer Interaction
Adnan, Muhammad
Alarood, Alaa Abdul Salam
Uddin, M. Irfan
ur Rehman, Izaz
Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
title Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
title_full Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
title_fullStr Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
title_full_unstemmed Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
title_short Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
title_sort utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models
topic Human-Computer Interaction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044349/
https://www.ncbi.nlm.nih.gov/pubmed/35494796
http://dx.doi.org/10.7717/peerj-cs.803
work_keys_str_mv AT adnanmuhammad utilizinggridsearchcrossvalidationwithadaptiveboostingforaugmentingperformanceofmachinelearningmodels
AT alaroodalaaabdulsalam utilizinggridsearchcrossvalidationwithadaptiveboostingforaugmentingperformanceofmachinelearningmodels
AT uddinmirfan utilizinggridsearchcrossvalidationwithadaptiveboostingforaugmentingperformanceofmachinelearningmodels
AT urrehmanizaz utilizinggridsearchcrossvalidationwithadaptiveboostingforaugmentingperformanceofmachinelearningmodels