Cargando…
Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
Student attrition poses a major challenge to academic institutions, funding bodies and students. With the rise of Big Data and predictive analytics, a growing body of work in higher education research has demonstrated the feasibility of predicting student dropout from readily available macro-level (...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082180/ https://www.ncbi.nlm.nih.gov/pubmed/37029155 http://dx.doi.org/10.1038/s41598-023-32484-w |
_version_ | 1785021265207099392 |
---|---|
author | Matz, Sandra C. Bukow, Christina S. Peters, Heinrich Deacons, Christine Dinu, Alice Stachl, Clemens |
author_facet | Matz, Sandra C. Bukow, Christina S. Peters, Heinrich Deacons, Christine Dinu, Alice Stachl, Clemens |
author_sort | Matz, Sandra C. |
collection | PubMed |
description | Student attrition poses a major challenge to academic institutions, funding bodies and students. With the rise of Big Data and predictive analytics, a growing body of work in higher education research has demonstrated the feasibility of predicting student dropout from readily available macro-level (e.g., socio-demographics or early performance metrics) and micro-level data (e.g., logins to learning management systems). Yet, the existing work has largely overlooked a critical meso-level element of student success known to drive retention: students’ experience at university and their social embeddedness within their cohort. In partnership with a mobile application that facilitates communication between students and universities, we collected both (1) institutional macro-level data and (2) behavioral micro and meso-level engagement data (e.g., the quantity and quality of interactions with university services and events as well as with other students) to predict dropout after the first semester. Analyzing the records of 50,095 students from four US universities and community colleges, we demonstrate that the combined macro and meso-level data can predict dropout with high levels of predictive performance (average AUC across linear and non-linear models = 78%; max AUC = 88%). Behavioral engagement variables representing students’ experience at university (e.g., network centrality, app engagement, event ratings) were found to add incremental predictive power beyond institutional variables (e.g., GPA or ethnicity). Finally, we highlight the generalizability of our results by showing that models trained on one university can predict retention at another university with reasonably high levels of predictive performance. |
format | Online Article Text |
id | pubmed-10082180 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-100821802023-04-09 Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics Matz, Sandra C. Bukow, Christina S. Peters, Heinrich Deacons, Christine Dinu, Alice Stachl, Clemens Sci Rep Article Student attrition poses a major challenge to academic institutions, funding bodies and students. With the rise of Big Data and predictive analytics, a growing body of work in higher education research has demonstrated the feasibility of predicting student dropout from readily available macro-level (e.g., socio-demographics or early performance metrics) and micro-level data (e.g., logins to learning management systems). Yet, the existing work has largely overlooked a critical meso-level element of student success known to drive retention: students’ experience at university and their social embeddedness within their cohort. In partnership with a mobile application that facilitates communication between students and universities, we collected both (1) institutional macro-level data and (2) behavioral micro and meso-level engagement data (e.g., the quantity and quality of interactions with university services and events as well as with other students) to predict dropout after the first semester. Analyzing the records of 50,095 students from four US universities and community colleges, we demonstrate that the combined macro and meso-level data can predict dropout with high levels of predictive performance (average AUC across linear and non-linear models = 78%; max AUC = 88%). Behavioral engagement variables representing students’ experience at university (e.g., network centrality, app engagement, event ratings) were found to add incremental predictive power beyond institutional variables (e.g., GPA or ethnicity). Finally, we highlight the generalizability of our results by showing that models trained on one university can predict retention at another university with reasonably high levels of predictive performance. Nature Publishing Group UK 2023-04-07 /pmc/articles/PMC10082180/ /pubmed/37029155 http://dx.doi.org/10.1038/s41598-023-32484-w Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Matz, Sandra C. Bukow, Christina S. Peters, Heinrich Deacons, Christine Dinu, Alice Stachl, Clemens Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
title | Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
title_full | Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
title_fullStr | Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
title_full_unstemmed | Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
title_short | Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
title_sort | using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082180/ https://www.ncbi.nlm.nih.gov/pubmed/37029155 http://dx.doi.org/10.1038/s41598-023-32484-w |
work_keys_str_mv | AT matzsandrac usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics AT bukowchristinas usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics AT petersheinrich usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics AT deaconschristine usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics AT dinualice usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics AT stachlclemens usingmachinelearningtopredictstudentretentionfromsociodemographiccharacteristicsandappbasedengagementmetrics |