Cargando…

Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features

Predicting students’ performance during their years of academic study has been investigated tremendously. It offers important insights that can help and guide institutions to make timely decisions and changes leading to better student outcome achievements. In the post-COVID-19 pandemic era, the adop...

Descripción completa

Detalles Bibliográficos
Autor principal: Brahim, Ghassen Ben
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762194/
https://www.ncbi.nlm.nih.gov/pubmed/35070634
http://dx.doi.org/10.1007/s13369-021-06548-w
_version_ 1784633705522790400
author Brahim, Ghassen Ben
author_facet Brahim, Ghassen Ben
author_sort Brahim, Ghassen Ben
collection PubMed
description Predicting students’ performance during their years of academic study has been investigated tremendously. It offers important insights that can help and guide institutions to make timely decisions and changes leading to better student outcome achievements. In the post-COVID-19 pandemic era, the adoption of e-learning has gained momentum and has increased the availability of online related learning data. This has encouraged researchers to develop machine learning (ML)-based models to predict students’ performance during online classes. The study presented in this paper, focuses on predicting student performance during a series of online interactive sessions by considering a dataset collected using digital electronics education and design suite. The dataset tracks the interaction of students during online lab work in terms of text editing, a number of keystrokes, time spent in each activity, etc., along with the exam score achieved per session. Our proposed prediction model consists of extracting a total of 86 novel statistical features, which were semantically categorized in three broad categories based on different criteria: (1) activity type, (2) timing statistics, and (3) peripheral activity count. This set of features were further reduced during the feature selection phase and only influential features were retained for training purposes. Our proposed ML model aims to predict whether a student’s performance will be low or high. Five popular classifiers were used in our study, namely: random forest (RF), support vector machine, Naïve Bayes, logistic regression, and multilayer perceptron. We evaluated our model under three different scenarios: (1) 80:20 random data split for training and testing, (2) fivefold cross-validation, and (3) train the model on all sessions but one which will be used for testing. Results showed that our model achieved the best classification accuracy performance of 97.4% with the RF classifier. We demonstrated that, under similar experimental setup, our model outperformed other existing studies.
format Online
Article
Text
id pubmed-8762194
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-87621942022-01-18 Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features Brahim, Ghassen Ben Arab J Sci Eng Research Article-Computer Engineering and Computer Science Predicting students’ performance during their years of academic study has been investigated tremendously. It offers important insights that can help and guide institutions to make timely decisions and changes leading to better student outcome achievements. In the post-COVID-19 pandemic era, the adoption of e-learning has gained momentum and has increased the availability of online related learning data. This has encouraged researchers to develop machine learning (ML)-based models to predict students’ performance during online classes. The study presented in this paper, focuses on predicting student performance during a series of online interactive sessions by considering a dataset collected using digital electronics education and design suite. The dataset tracks the interaction of students during online lab work in terms of text editing, a number of keystrokes, time spent in each activity, etc., along with the exam score achieved per session. Our proposed prediction model consists of extracting a total of 86 novel statistical features, which were semantically categorized in three broad categories based on different criteria: (1) activity type, (2) timing statistics, and (3) peripheral activity count. This set of features were further reduced during the feature selection phase and only influential features were retained for training purposes. Our proposed ML model aims to predict whether a student’s performance will be low or high. Five popular classifiers were used in our study, namely: random forest (RF), support vector machine, Naïve Bayes, logistic regression, and multilayer perceptron. We evaluated our model under three different scenarios: (1) 80:20 random data split for training and testing, (2) fivefold cross-validation, and (3) train the model on all sessions but one which will be used for testing. Results showed that our model achieved the best classification accuracy performance of 97.4% with the RF classifier. We demonstrated that, under similar experimental setup, our model outperformed other existing studies. Springer Berlin Heidelberg 2022-01-17 2022 /pmc/articles/PMC8762194/ /pubmed/35070634 http://dx.doi.org/10.1007/s13369-021-06548-w Text en © King Fahd University of Petroleum & Minerals 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Research Article-Computer Engineering and Computer Science
Brahim, Ghassen Ben
Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features
title Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features
title_full Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features
title_fullStr Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features
title_full_unstemmed Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features
title_short Predicting Student Performance from Online Engagement Activities Using Novel Statistical Features
title_sort predicting student performance from online engagement activities using novel statistical features
topic Research Article-Computer Engineering and Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762194/
https://www.ncbi.nlm.nih.gov/pubmed/35070634
http://dx.doi.org/10.1007/s13369-021-06548-w
work_keys_str_mv AT brahimghassenben predictingstudentperformancefromonlineengagementactivitiesusingnovelstatisticalfeatures