Cargando…

Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics

Student attrition poses a major challenge to academic institutions, funding bodies and students. With the rise of Big Data and predictive analytics, a growing body of work in higher education research has demonstrated the feasibility of predicting student dropout from readily available macro-level (...

Descripción completa

Detalles Bibliográficos
Autores principales: Matz, Sandra C., Bukow, Christina S., Peters, Heinrich, Deacons, Christine, Dinu, Alice, Stachl, Clemens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082180/
https://www.ncbi.nlm.nih.gov/pubmed/37029155
http://dx.doi.org/10.1038/s41598-023-32484-w
_version_ 1785021265207099392
author Matz, Sandra C.
Bukow, Christina S.
Peters, Heinrich
Deacons, Christine
Dinu, Alice
Stachl, Clemens
author_facet Matz, Sandra C.
Bukow, Christina S.
Peters, Heinrich
Deacons, Christine
Dinu, Alice
Stachl, Clemens
author_sort Matz, Sandra C.
collection PubMed
description Student attrition poses a major challenge to academic institutions, funding bodies and students. With the rise of Big Data and predictive analytics, a growing body of work in higher education research has demonstrated the feasibility of predicting student dropout from readily available macro-level (e.g., socio-demographics or early performance metrics) and micro-level data (e.g., logins to learning management systems). Yet, the existing work has largely overlooked a critical meso-level element of student success known to drive retention: students’ experience at university and their social embeddedness within their cohort. In partnership with a mobile application that facilitates communication between students and universities, we collected both (1) institutional macro-level data and (2) behavioral micro and meso-level engagement data (e.g., the quantity and quality of interactions with university services and events as well as with other students) to predict dropout after the first semester. Analyzing the records of 50,095 students from four US universities and community colleges, we demonstrate that the combined macro and meso-level data can predict dropout with high levels of predictive performance (average AUC across linear and non-linear models = 78%; max AUC = 88%). Behavioral engagement variables representing students’ experience at university (e.g., network centrality, app engagement, event ratings) were found to add incremental predictive power beyond institutional variables (e.g., GPA or ethnicity). Finally, we highlight the generalizability of our results by showing that models trained on one university can predict retention at another university with reasonably high levels of predictive performance.
format Online
Article
Text
id pubmed-10082180
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100821802023-04-09 Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics Matz, Sandra C. Bukow, Christina S. Peters, Heinrich Deacons, Christine Dinu, Alice Stachl, Clemens Sci Rep Article Student attrition poses a major challenge to academic institutions, funding bodies and students. With the rise of Big Data and predictive analytics, a growing body of work in higher education research has demonstrated the feasibility of predicting student dropout from readily available macro-level (e.g., socio-demographics or early performance metrics) and micro-level data (e.g., logins to learning management systems). Yet, the existing work has largely overlooked a critical meso-level element of student success known to drive retention: students’ experience at university and their social embeddedness within their cohort. In partnership with a mobile application that facilitates communication between students and universities, we collected both (1) institutional macro-level data and (2) behavioral micro and meso-level engagement data (e.g., the quantity and quality of interactions with university services and events as well as with other students) to predict dropout after the first semester. Analyzing the records of 50,095 students from four US universities and community colleges, we demonstrate that the combined macro and meso-level data can predict dropout with high levels of predictive performance (average AUC across linear and non-linear models = 78%; max AUC = 88%). Behavioral engagement variables representing students’ experience at university (e.g., network centrality, app engagement, event ratings) were found to add incremental predictive power beyond institutional variables (e.g., GPA or ethnicity). Finally, we highlight the generalizability of our results by showing that models trained on one university can predict retention at another university with reasonably high levels of predictive performance. Nature Publishing Group UK 2023-04-07 /pmc/articles/PMC10082180/ /pubmed/37029155 http://dx.doi.org/10.1038/s41598-023-32484-w Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Matz, Sandra C.
Bukow, Christina S.
Peters, Heinrich
Deacons, Christine
Dinu, Alice
Stachl, Clemens
Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
title Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
title_full Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
title_fullStr Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
title_full_unstemmed Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
title_short Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
title_sort using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082180/
https://www.ncbi.nlm.nih.gov/pubmed/37029155
http://dx.doi.org/10.1038/s41598-023-32484-w
work_keys_str_mv AT matzsandrac usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics
AT bukowchristinas usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics
AT petersheinrich usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics
AT deaconschristine usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics
AT dinualice usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics
AT stachlclemens usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics