Cargando…

Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit

In the context of item response theory (IRT), linking the scales of two measurement points is a prerequisite to examine a change in competence over time. In educational large-scale assessments, non-identical test forms sharing a number of anchor-items are frequently scaled and linked using two− or t...

Descripción completa

Detalles Bibliográficos
Autores principales: Fischer, Luise, Rohm, Theresa, Carstensen, Claus H., Gnambs, Timo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8289883/
https://www.ncbi.nlm.nih.gov/pubmed/34295279
http://dx.doi.org/10.3389/fpsyg.2021.633896
_version_ 1783724386348433408
author Fischer, Luise
Rohm, Theresa
Carstensen, Claus H.
Gnambs, Timo
author_facet Fischer, Luise
Rohm, Theresa
Carstensen, Claus H.
Gnambs, Timo
author_sort Fischer, Luise
collection PubMed
description In the context of item response theory (IRT), linking the scales of two measurement points is a prerequisite to examine a change in competence over time. In educational large-scale assessments, non-identical test forms sharing a number of anchor-items are frequently scaled and linked using two− or three-parametric item response models. However, if item pools are limited and/or sample sizes are small to medium, the sparser Rasch model is a suitable alternative regarding the precision of parameter estimation. As the Rasch model implies stricter assumptions about the response process, a violation of these assumptions may manifest as model misfit in form of item discrimination parameters empirically deviating from their fixed value of one. The present simulation study investigated the performance of four IRT linking methods—fixed parameter calibration, mean/mean linking, weighted mean/mean linking, and concurrent calibration—applied to Rasch-scaled data with a small item pool. Moreover, the number of anchor items required in the absence/presence of moderate model misfit was investigated in small to medium sample sizes. Effects on the link outcome were operationalized as bias, relative bias, and root mean square error of the estimated sample mean and variance of the latent variable. In the light of this limited context, concurrent calibration had substantial convergence issues, while the other methods resulted in an overall satisfying and similar parameter recovery—even in the presence of moderate model misfit. Our findings suggest that in case of model misfit, the share of anchor items should exceed 20% as is currently proposed in the literature. Future studies should further investigate the effects of anchor item composition regarding unbalanced model misfit.
format Online
Article
Text
id pubmed-8289883
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82898832021-07-21 Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit Fischer, Luise Rohm, Theresa Carstensen, Claus H. Gnambs, Timo Front Psychol Psychology In the context of item response theory (IRT), linking the scales of two measurement points is a prerequisite to examine a change in competence over time. In educational large-scale assessments, non-identical test forms sharing a number of anchor-items are frequently scaled and linked using two− or three-parametric item response models. However, if item pools are limited and/or sample sizes are small to medium, the sparser Rasch model is a suitable alternative regarding the precision of parameter estimation. As the Rasch model implies stricter assumptions about the response process, a violation of these assumptions may manifest as model misfit in form of item discrimination parameters empirically deviating from their fixed value of one. The present simulation study investigated the performance of four IRT linking methods—fixed parameter calibration, mean/mean linking, weighted mean/mean linking, and concurrent calibration—applied to Rasch-scaled data with a small item pool. Moreover, the number of anchor items required in the absence/presence of moderate model misfit was investigated in small to medium sample sizes. Effects on the link outcome were operationalized as bias, relative bias, and root mean square error of the estimated sample mean and variance of the latent variable. In the light of this limited context, concurrent calibration had substantial convergence issues, while the other methods resulted in an overall satisfying and similar parameter recovery—even in the presence of moderate model misfit. Our findings suggest that in case of model misfit, the share of anchor items should exceed 20% as is currently proposed in the literature. Future studies should further investigate the effects of anchor item composition regarding unbalanced model misfit. Frontiers Media S.A. 2021-07-06 /pmc/articles/PMC8289883/ /pubmed/34295279 http://dx.doi.org/10.3389/fpsyg.2021.633896 Text en Copyright © 2021 Fischer, Rohm, Carstensen and Gnambs. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Fischer, Luise
Rohm, Theresa
Carstensen, Claus H.
Gnambs, Timo
Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit
title Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit
title_full Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit
title_fullStr Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit
title_full_unstemmed Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit
title_short Linking of Rasch-Scaled Tests: Consequences of Limited Item Pools and Model Misfit
title_sort linking of rasch-scaled tests: consequences of limited item pools and model misfit
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8289883/
https://www.ncbi.nlm.nih.gov/pubmed/34295279
http://dx.doi.org/10.3389/fpsyg.2021.633896
work_keys_str_mv AT fischerluise linkingofraschscaledtestsconsequencesoflimiteditempoolsandmodelmisfit
AT rohmtheresa linkingofraschscaledtestsconsequencesoflimiteditempoolsandmodelmisfit
AT carstensenclaush linkingofraschscaledtestsconsequencesoflimiteditempoolsandmodelmisfit
AT gnambstimo linkingofraschscaledtestsconsequencesoflimiteditempoolsandmodelmisfit