Cargando…

Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests

The Programme for International Student Assessment (PISA) introduced the measurement of problem-solving skills in the 2012 cycle. The items in this new domain employ scenario-based environments in terms of students interacting with computers. Process data collected from log files are a record of stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Zhuangzhuang, He, Qiwei, von Davier, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882413/
https://www.ncbi.nlm.nih.gov/pubmed/31824363
http://dx.doi.org/10.3389/fpsyg.2019.02461
_version_ 1783474157160235008
author Han, Zhuangzhuang
He, Qiwei
von Davier, Matthias
author_facet Han, Zhuangzhuang
He, Qiwei
von Davier, Matthias
author_sort Han, Zhuangzhuang
collection PubMed
description The Programme for International Student Assessment (PISA) introduced the measurement of problem-solving skills in the 2012 cycle. The items in this new domain employ scenario-based environments in terms of students interacting with computers. Process data collected from log files are a record of students’ interactions with the testing platform. This study suggests a two-stage approach for generating features from process data and selecting the features that predict students’ responses using a released problem-solving item—the Climate Control Task. The primary objectives of the study are (1) introducing an approach for generating features from the process data and using them to predict the response to this item, and (2) finding out which features have the most predictive value. To achieve these goals, a tree-based ensemble method, the random forest algorithm, is used to explore the association between response data and predictive features. Also, features can be ranked by importance in terms of predictive performance. This study can be considered as providing an alternative way to analyze process data having a pedagogical purpose.
format Online
Article
Text
id pubmed-6882413
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68824132019-12-10 Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests Han, Zhuangzhuang He, Qiwei von Davier, Matthias Front Psychol Psychology The Programme for International Student Assessment (PISA) introduced the measurement of problem-solving skills in the 2012 cycle. The items in this new domain employ scenario-based environments in terms of students interacting with computers. Process data collected from log files are a record of students’ interactions with the testing platform. This study suggests a two-stage approach for generating features from process data and selecting the features that predict students’ responses using a released problem-solving item—the Climate Control Task. The primary objectives of the study are (1) introducing an approach for generating features from the process data and using them to predict the response to this item, and (2) finding out which features have the most predictive value. To achieve these goals, a tree-based ensemble method, the random forest algorithm, is used to explore the association between response data and predictive features. Also, features can be ranked by importance in terms of predictive performance. This study can be considered as providing an alternative way to analyze process data having a pedagogical purpose. Frontiers Media S.A. 2019-11-21 /pmc/articles/PMC6882413/ /pubmed/31824363 http://dx.doi.org/10.3389/fpsyg.2019.02461 Text en Copyright © 2019 Han, He and von Davier. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Han, Zhuangzhuang
He, Qiwei
von Davier, Matthias
Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
title Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
title_full Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
title_fullStr Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
title_full_unstemmed Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
title_short Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
title_sort predictive feature generation and selection using process data from pisa interactive problem-solving items: an application of random forests
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882413/
https://www.ncbi.nlm.nih.gov/pubmed/31824363
http://dx.doi.org/10.3389/fpsyg.2019.02461
work_keys_str_mv AT hanzhuangzhuang predictivefeaturegenerationandselectionusingprocessdatafrompisainteractiveproblemsolvingitemsanapplicationofrandomforests
AT heqiwei predictivefeaturegenerationandselectionusingprocessdatafrompisainteractiveproblemsolvingitemsanapplicationofrandomforests
AT vondaviermatthias predictivefeaturegenerationandselectionusingprocessdatafrompisainteractiveproblemsolvingitemsanapplicationofrandomforests