Cargando…
Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests
The Programme for International Student Assessment (PISA) introduced the measurement of problem-solving skills in the 2012 cycle. The items in this new domain employ scenario-based environments in terms of students interacting with computers. Process data collected from log files are a record of stu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882413/ https://www.ncbi.nlm.nih.gov/pubmed/31824363 http://dx.doi.org/10.3389/fpsyg.2019.02461 |
_version_ | 1783474157160235008 |
---|---|
author | Han, Zhuangzhuang He, Qiwei von Davier, Matthias |
author_facet | Han, Zhuangzhuang He, Qiwei von Davier, Matthias |
author_sort | Han, Zhuangzhuang |
collection | PubMed |
description | The Programme for International Student Assessment (PISA) introduced the measurement of problem-solving skills in the 2012 cycle. The items in this new domain employ scenario-based environments in terms of students interacting with computers. Process data collected from log files are a record of students’ interactions with the testing platform. This study suggests a two-stage approach for generating features from process data and selecting the features that predict students’ responses using a released problem-solving item—the Climate Control Task. The primary objectives of the study are (1) introducing an approach for generating features from the process data and using them to predict the response to this item, and (2) finding out which features have the most predictive value. To achieve these goals, a tree-based ensemble method, the random forest algorithm, is used to explore the association between response data and predictive features. Also, features can be ranked by importance in terms of predictive performance. This study can be considered as providing an alternative way to analyze process data having a pedagogical purpose. |
format | Online Article Text |
id | pubmed-6882413 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68824132019-12-10 Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests Han, Zhuangzhuang He, Qiwei von Davier, Matthias Front Psychol Psychology The Programme for International Student Assessment (PISA) introduced the measurement of problem-solving skills in the 2012 cycle. The items in this new domain employ scenario-based environments in terms of students interacting with computers. Process data collected from log files are a record of students’ interactions with the testing platform. This study suggests a two-stage approach for generating features from process data and selecting the features that predict students’ responses using a released problem-solving item—the Climate Control Task. The primary objectives of the study are (1) introducing an approach for generating features from the process data and using them to predict the response to this item, and (2) finding out which features have the most predictive value. To achieve these goals, a tree-based ensemble method, the random forest algorithm, is used to explore the association between response data and predictive features. Also, features can be ranked by importance in terms of predictive performance. This study can be considered as providing an alternative way to analyze process data having a pedagogical purpose. Frontiers Media S.A. 2019-11-21 /pmc/articles/PMC6882413/ /pubmed/31824363 http://dx.doi.org/10.3389/fpsyg.2019.02461 Text en Copyright © 2019 Han, He and von Davier. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Han, Zhuangzhuang He, Qiwei von Davier, Matthias Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests |
title | Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests |
title_full | Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests |
title_fullStr | Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests |
title_full_unstemmed | Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests |
title_short | Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests |
title_sort | predictive feature generation and selection using process data from pisa interactive problem-solving items: an application of random forests |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882413/ https://www.ncbi.nlm.nih.gov/pubmed/31824363 http://dx.doi.org/10.3389/fpsyg.2019.02461 |
work_keys_str_mv | AT hanzhuangzhuang predictivefeaturegenerationandselectionusingprocessdatafrompisainteractiveproblemsolvingitemsanapplicationofrandomforests AT heqiwei predictivefeaturegenerationandselectionusingprocessdatafrompisainteractiveproblemsolvingitemsanapplicationofrandomforests AT vondaviermatthias predictivefeaturegenerationandselectionusingprocessdatafrompisainteractiveproblemsolvingitemsanapplicationofrandomforests |