Cargando…

An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques

International large-scale assessments, such as PISA, provide structured and static data. However, due to its extensive databases, several researchers place it as a reference in Big Data in Education. With the goal of exploring which factors at country, school and student level have a higher relevanc...

Descripción completa

Detalles Bibliográficos
Autores principales: Gamazo, Adriana, Martínez-Abad, Fernando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728733/
https://www.ncbi.nlm.nih.gov/pubmed/33329221
http://dx.doi.org/10.3389/fpsyg.2020.575167
_version_ 1783621333216657408
author Gamazo, Adriana
Martínez-Abad, Fernando
author_facet Gamazo, Adriana
Martínez-Abad, Fernando
author_sort Gamazo, Adriana
collection PubMed
description International large-scale assessments, such as PISA, provide structured and static data. However, due to its extensive databases, several researchers place it as a reference in Big Data in Education. With the goal of exploring which factors at country, school and student level have a higher relevance in predicting student performance, this paper proposes an Educational Data Mining approach to detect and analyze factors linked to academic performance. To this end, we conducted a secondary data analysis and built decision trees (C4.5 algorithm) to obtain a predictive model of school performance. Specifically, we selected as predictor variables a set of socioeconomic, process and outcome variables from PISA 2018 and other sources (World Bank, 2020). Since the unit of analysis were schools from all the countries included in PISA 2018 (n = 21,903), student and teacher predictor variables were imputed to the school database. Based on the available student performance scores in Reading, Math, and Science, we applied k-means clustering to obtain a categorized (three categories) target variable of global school performance. Results show the existence of two main branches in the decision tree, split according to the schools’ mean socioeconomic status (SES). While performance in high-SES schools is influenced by educational factors such as metacognitive strategies or achievement motivation, performance in low-SES schools is affected in greater measure by country-level socioeconomic indicators such as GDP, and individual educational indicators are relegated to a secondary level. Since these evidences are in line and delve into previous research, this work concludes by analyzing its potential contribution to support the decision making processes regarding educational policies.
format Online
Article
Text
id pubmed-7728733
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77287332020-12-15 An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques Gamazo, Adriana Martínez-Abad, Fernando Front Psychol Psychology International large-scale assessments, such as PISA, provide structured and static data. However, due to its extensive databases, several researchers place it as a reference in Big Data in Education. With the goal of exploring which factors at country, school and student level have a higher relevance in predicting student performance, this paper proposes an Educational Data Mining approach to detect and analyze factors linked to academic performance. To this end, we conducted a secondary data analysis and built decision trees (C4.5 algorithm) to obtain a predictive model of school performance. Specifically, we selected as predictor variables a set of socioeconomic, process and outcome variables from PISA 2018 and other sources (World Bank, 2020). Since the unit of analysis were schools from all the countries included in PISA 2018 (n = 21,903), student and teacher predictor variables were imputed to the school database. Based on the available student performance scores in Reading, Math, and Science, we applied k-means clustering to obtain a categorized (three categories) target variable of global school performance. Results show the existence of two main branches in the decision tree, split according to the schools’ mean socioeconomic status (SES). While performance in high-SES schools is influenced by educational factors such as metacognitive strategies or achievement motivation, performance in low-SES schools is affected in greater measure by country-level socioeconomic indicators such as GDP, and individual educational indicators are relegated to a secondary level. Since these evidences are in line and delve into previous research, this work concludes by analyzing its potential contribution to support the decision making processes regarding educational policies. Frontiers Media S.A. 2020-11-27 /pmc/articles/PMC7728733/ /pubmed/33329221 http://dx.doi.org/10.3389/fpsyg.2020.575167 Text en Copyright © 2020 Gamazo and Martínez-Abad. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Gamazo, Adriana
Martínez-Abad, Fernando
An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
title An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
title_full An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
title_fullStr An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
title_full_unstemmed An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
title_short An Exploration of Factors Linked to Academic Performance in PISA 2018 Through Data Mining Techniques
title_sort exploration of factors linked to academic performance in pisa 2018 through data mining techniques
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728733/
https://www.ncbi.nlm.nih.gov/pubmed/33329221
http://dx.doi.org/10.3389/fpsyg.2020.575167
work_keys_str_mv AT gamazoadriana anexplorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques
AT martinezabadfernando anexplorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques
AT gamazoadriana explorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques
AT martinezabadfernando explorationoffactorslinkedtoacademicperformanceinpisa2018throughdataminingtechniques