Cargando…

Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing

The increasing digitalization in the field of psychological and educational testing opens up new opportunities to innovate assessments in many respects (e.g., new item formats, flexible test assembly, efficient data handling). In particular, computerized adaptive testing provides the opportunity to...

Descripción completa

Detalles Bibliográficos
Autores principales:	Born, Sebastian, Fink, Aron, Spoden, Christian, Frey, Andreas
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2019
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6563622/ https://www.ncbi.nlm.nih.gov/pubmed/31244717 http://dx.doi.org/10.3389/fpsyg.2019.01277

_version_	1783426573581418496
author	Born, Sebastian Fink, Aron Spoden, Christian Frey, Andreas
author_facet	Born, Sebastian Fink, Aron Spoden, Christian Frey, Andreas
author_sort	Born, Sebastian
collection	PubMed
description	The increasing digitalization in the field of psychological and educational testing opens up new opportunities to innovate assessments in many respects (e.g., new item formats, flexible test assembly, efficient data handling). In particular, computerized adaptive testing provides the opportunity to make tests more individualized and more efficient. The newly developed continuous calibration strategy (CCS) from Fink et al. (2018) makes it possible to construct computerized adaptive tests in application areas where separate calibration studies are not feasible. Due to the goal of reporting on a common metric across test cycles, the equating is crucial for the CCS. The quality of the equating depends on the common items selected and the scale transformation method applied. Given the novelty of the CCS, the aim of the study was to evaluate different equating setups in the CCS and to derive practical recommendations. The impact of different equating setups on the precision of item parameter estimates and on the quality of the equating was examined in a Monte Carlo simulation, based on a fully crossed design with the factors common item difficulty distribution (bimodal, normal, uniform), scale transformation method (mean/mean, mean/sigma, Haebara, Stocking-Lord), and sample size per test cycle (50, 100, 300). The quality of the equating was operationalized by three criteria (proportion of feasible equatings, proportion of drifted items, and error of transformation constants). The precision of the item parameter estimates increased with increasing sample size per test cycle, but no substantial difference was found with respect to the common item difficulty distribution and the scale transformation method. With regard to the feasibility of the equatings, no differences were found for the different scale transformation methods. However, when using the moment methods (mean/mean, mean/sigma), quite extreme levels of error for the transformation constants A and B occurred. Among the characteristic curve method the performance of the Stocking-Lord method was slightly better than for the Haebara method. Thus, while no clear recommendation can be made with regard to the common item difficulty distribution, the characteristic curve methods turned out to be the most favorable scale transformation methods within the CCS.
format	Online Article Text
id	pubmed-6563622
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-65636222019-06-26 Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing Born, Sebastian Fink, Aron Spoden, Christian Frey, Andreas Front Psychol Psychology The increasing digitalization in the field of psychological and educational testing opens up new opportunities to innovate assessments in many respects (e.g., new item formats, flexible test assembly, efficient data handling). In particular, computerized adaptive testing provides the opportunity to make tests more individualized and more efficient. The newly developed continuous calibration strategy (CCS) from Fink et al. (2018) makes it possible to construct computerized adaptive tests in application areas where separate calibration studies are not feasible. Due to the goal of reporting on a common metric across test cycles, the equating is crucial for the CCS. The quality of the equating depends on the common items selected and the scale transformation method applied. Given the novelty of the CCS, the aim of the study was to evaluate different equating setups in the CCS and to derive practical recommendations. The impact of different equating setups on the precision of item parameter estimates and on the quality of the equating was examined in a Monte Carlo simulation, based on a fully crossed design with the factors common item difficulty distribution (bimodal, normal, uniform), scale transformation method (mean/mean, mean/sigma, Haebara, Stocking-Lord), and sample size per test cycle (50, 100, 300). The quality of the equating was operationalized by three criteria (proportion of feasible equatings, proportion of drifted items, and error of transformation constants). The precision of the item parameter estimates increased with increasing sample size per test cycle, but no substantial difference was found with respect to the common item difficulty distribution and the scale transformation method. With regard to the feasibility of the equatings, no differences were found for the different scale transformation methods. However, when using the moment methods (mean/mean, mean/sigma), quite extreme levels of error for the transformation constants A and B occurred. Among the characteristic curve method the performance of the Stocking-Lord method was slightly better than for the Haebara method. Thus, while no clear recommendation can be made with regard to the common item difficulty distribution, the characteristic curve methods turned out to be the most favorable scale transformation methods within the CCS. Frontiers Media S.A. 2019-06-06 /pmc/articles/PMC6563622/ /pubmed/31244717 http://dx.doi.org/10.3389/fpsyg.2019.01277 Text en Copyright © 2019 Born, Fink, Spoden and Frey. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Psychology Born, Sebastian Fink, Aron Spoden, Christian Frey, Andreas Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing
title	Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing
title_full	Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing
title_fullStr	Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing
title_full_unstemmed	Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing
title_short	Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing
title_sort	evaluating different equating setups in the continuous item pool calibration for computerized adaptive testing
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6563622/ https://www.ncbi.nlm.nih.gov/pubmed/31244717 http://dx.doi.org/10.3389/fpsyg.2019.01277
work_keys_str_mv	AT bornsebastian evaluatingdifferentequatingsetupsinthecontinuousitempoolcalibrationforcomputerizedadaptivetesting AT finkaron evaluatingdifferentequatingsetupsinthecontinuousitempoolcalibrationforcomputerizedadaptivetesting AT spodenchristian evaluatingdifferentequatingsetupsinthecontinuousitempoolcalibrationforcomputerizedadaptivetesting AT freyandreas evaluatingdifferentequatingsetupsinthecontinuousitempoolcalibrationforcomputerizedadaptivetesting

Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing

Ejemplares similares