Cargando…
An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
International large-scale assessments, such as PISA, provide structured and static data. However, due to its extensive databases, several researchers place it as a reference in Big Data in Education. With the goal of exploring which factors at country, school and student level have a higher relevanc...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728733/ https://www.ncbi.nlm.nih.gov/pubmed/33329221 http://dx.doi.org/10.3389/fpsyg.2020.575167 |
_version_ | 1783621333216657408 |
---|---|
author | Gamazo, Adriana Martínez-Abad, Fernando |
author_facet | Gamazo, Adriana Martínez-Abad, Fernando |
author_sort | Gamazo, Adriana |
collection | PubMed |
description | International large-scale assessments, such as PISA, provide structured and static data. However, due to its extensive databases, several researchers place it as a reference in Big Data in Education. With the goal of exploring which factors at country, school and student level have a higher relevance in predicting student performance, this paper proposes an Educational Data Mining approach to detect and analyze factors linked to academic performance. To this end, we conducted a secondary data analysis and built decision trees (C4.5 algorithm) to obtain a predictive model of school performance. Specifically, we selected as predictor variables a set of socioeconomic, process and outcome variables from PISA 2018 and other sources (World Bank, 2020). Since the unit of analysis were schools from all the countries included in PISA 2018 (n = 21,903), student and teacher predictor variables were imputed to the school database. Based on the available student performance scores in Reading, Math, and Science, we applied k-means clustering to obtain a categorized (three categories) target variable of global school performance. Results show the existence of two main branches in the decision tree, split according to the schools’ mean socioeconomic status (SES). While performance in high-SES schools is influenced by educational factors such as metacognitive strategies or achievement motivation, performance in low-SES schools is affected in greater measure by country-level socioeconomic indicators such as GDP, and individual educational indicators are relegated to a secondary level. Since these evidences are in line and delve into previous research, this work concludes by analyzing its potential contribution to support the decision making processes regarding educational policies. |
format | Online Article Text |
id | pubmed-7728733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-77287332020-12-15 An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques Gamazo, Adriana Martínez-Abad, Fernando Front Psychol Psychology International large-scale assessments, such as PISA, provide structured and static data. However, due to its extensive databases, several researchers place it as a reference in Big Data in Education. With the goal of exploring which factors at country, school and student level have a higher relevance in predicting student performance, this paper proposes an Educational Data Mining approach to detect and analyze factors linked to academic performance. To this end, we conducted a secondary data analysis and built decision trees (C4.5 algorithm) to obtain a predictive model of school performance. Specifically, we selected as predictor variables a set of socioeconomic, process and outcome variables from PISA 2018 and other sources (World Bank, 2020). Since the unit of analysis were schools from all the countries included in PISA 2018 (n = 21,903), student and teacher predictor variables were imputed to the school database. Based on the available student performance scores in Reading, Math, and Science, we applied k-means clustering to obtain a categorized (three categories) target variable of global school performance. Results show the existence of two main branches in the decision tree, split according to the schools’ mean socioeconomic status (SES). While performance in high-SES schools is influenced by educational factors such as metacognitive strategies or achievement motivation, performance in low-SES schools is affected in greater measure by country-level socioeconomic indicators such as GDP, and individual educational indicators are relegated to a secondary level. Since these evidences are in line and delve into previous research, this work concludes by analyzing its potential contribution to support the decision making processes regarding educational policies. Frontiers Media S.A. 2020-11-27 /pmc/articles/PMC7728733/ /pubmed/33329221 http://dx.doi.org/10.3389/fpsyg.2020.575167 Text en Copyright © 2020 Gamazo and Martínez-Abad. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Gamazo, Adriana Martínez-Abad, Fernando An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques |
title | An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques |
title_full | An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques |
title_fullStr | An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques |
title_full_unstemmed | An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques |
title_short | An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques |
title_sort | exploration of factors linked to academic performance in pisa 2018 through data mining techniques |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728733/ https://www.ncbi.nlm.nih.gov/pubmed/33329221 http://dx.doi.org/10.3389/fpsyg.2020.575167 |
work_keys_str_mv | AT gamazoadriana anexplorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques AT martinezabadfernando anexplorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques AT gamazoadriana explorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques AT martinezabadfernando explorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques |