Cargando…

Methods for Estimating Item-Score Reliability

Reliability is usually estimated for a test score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the item’s contribution to the test score’s reliability, for identifying unreliable scores in aberrant item-score patterns in person-fit analysis, and for s...

Descripción completa

Detalles Bibliográficos
Autores principales: Zijlmans, Eva A. O., van der Ark, L. Andries, Tijmstra, Jesper, Sijtsma, Klaas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6140096/
https://www.ncbi.nlm.nih.gov/pubmed/30237646
http://dx.doi.org/10.1177/0146621618758290
_version_ 1783355555151085568
author Zijlmans, Eva A. O.
van der Ark, L. Andries
Tijmstra, Jesper
Sijtsma, Klaas
author_facet Zijlmans, Eva A. O.
van der Ark, L. Andries
Tijmstra, Jesper
Sijtsma, Klaas
author_sort Zijlmans, Eva A. O.
collection PubMed
description Reliability is usually estimated for a test score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the item’s contribution to the test score’s reliability, for identifying unreliable scores in aberrant item-score patterns in person-fit analysis, and for selecting the most reliable item from a test to use as a single-item measure. Four methods were discussed for estimating item-score reliability: the Molenaar–Sijtsma method (method MS), Guttman’s method [Formula: see text] , the latent class reliability coefficient (method LCRC), and the correction for attenuation (method CA). A simulation study was used to compare the methods with respect to median bias, variability (interquartile range [IQR]), and percentage of outliers. The simulation study consisted of six conditions: standard, polytomous items, unequal [Formula: see text] parameters, two-dimensional data, long test, and small sample size. Methods MS and CA were the most accurate. Method LCRC showed almost unbiased results, but large variability. Method [Formula: see text] consistently underestimated item-score reliabilty, but showed a smaller IQR than the other methods.
format Online
Article
Text
id pubmed-6140096
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-61400962018-09-25 Methods for Estimating Item-Score Reliability Zijlmans, Eva A. O. van der Ark, L. Andries Tijmstra, Jesper Sijtsma, Klaas Appl Psychol Meas Articles Reliability is usually estimated for a test score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the item’s contribution to the test score’s reliability, for identifying unreliable scores in aberrant item-score patterns in person-fit analysis, and for selecting the most reliable item from a test to use as a single-item measure. Four methods were discussed for estimating item-score reliability: the Molenaar–Sijtsma method (method MS), Guttman’s method [Formula: see text] , the latent class reliability coefficient (method LCRC), and the correction for attenuation (method CA). A simulation study was used to compare the methods with respect to median bias, variability (interquartile range [IQR]), and percentage of outliers. The simulation study consisted of six conditions: standard, polytomous items, unequal [Formula: see text] parameters, two-dimensional data, long test, and small sample size. Methods MS and CA were the most accurate. Method LCRC showed almost unbiased results, but large variability. Method [Formula: see text] consistently underestimated item-score reliabilty, but showed a smaller IQR than the other methods. SAGE Publications 2018-04-09 2018-10 /pmc/articles/PMC6140096/ /pubmed/30237646 http://dx.doi.org/10.1177/0146621618758290 Text en © The Author(s) 2018 http://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution 4.0 License (http://www.creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Zijlmans, Eva A. O.
van der Ark, L. Andries
Tijmstra, Jesper
Sijtsma, Klaas
Methods for Estimating Item-Score Reliability
title Methods for Estimating Item-Score Reliability
title_full Methods for Estimating Item-Score Reliability
title_fullStr Methods for Estimating Item-Score Reliability
title_full_unstemmed Methods for Estimating Item-Score Reliability
title_short Methods for Estimating Item-Score Reliability
title_sort methods for estimating item-score reliability
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6140096/
https://www.ncbi.nlm.nih.gov/pubmed/30237646
http://dx.doi.org/10.1177/0146621618758290
work_keys_str_mv AT zijlmansevaao methodsforestimatingitemscorereliability
AT vanderarklandries methodsforestimatingitemscorereliability
AT tijmstrajesper methodsforestimatingitemscorereliability
AT sijtsmaklaas methodsforestimatingitemscorereliability