Cargando…

On the Statistical and Practical Limitations of Thurstonian IRT Models

Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) suc...

Descripción completa

Detalles Bibliográficos
Autores principales: Bürkner, Paul-Christian, Schulte, Niklas, Holling, Heinz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6713979/
https://www.ncbi.nlm.nih.gov/pubmed/31488915
http://dx.doi.org/10.1177/0013164419832063
_version_ 1783446972714188800
author Bürkner, Paul-Christian
Schulte, Niklas
Holling, Heinz
author_facet Bürkner, Paul-Christian
Schulte, Niklas
Holling, Heinz
author_sort Bürkner, Paul-Christian
collection PubMed
description Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed. For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation. Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating forced-choice questionnaires that have the potential to resist faking intentions. According to extensive simulations, measuring up to five traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor for Bayesian estimation methods. As a result, persons’ trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons. However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizable number of traits. More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate. We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizable number of traits.
format Online
Article
Text
id pubmed-6713979
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-67139792020-10-01 On the Statistical and Practical Limitations of Thurstonian IRT Models Bürkner, Paul-Christian Schulte, Niklas Holling, Heinz Educ Psychol Meas Article Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been proposed. For convenient model specification, we introduce the thurstonianIRT R package, which uses Mplus, lavaan, and Stan for model estimation. Based on practical considerations, we establish that items within one block need to be equally keyed to achieve similar social desirability, which is essential for creating forced-choice questionnaires that have the potential to resist faking intentions. According to extensive simulations, measuring up to five traits using blocks of only equally keyed items does not yield sufficiently accurate trait scores and inter-trait correlation estimates, neither for frequentist nor for Bayesian estimation methods. As a result, persons’ trait scores remain partially ipsative and, thus, do not allow for valid comparisons between persons. However, we demonstrate that trait scores based on only equally keyed blocks can be improved substantially by measuring a sizable number of traits. More specifically, in our simulations of 30 traits, scores based on only equally keyed blocks were non-ipsative and highly accurate. We conclude that in high-stakes situations where persons are motivated to give fake answers, Thurstonian IRT models should only be applied to tests measuring a sizable number of traits. SAGE Publications 2019-02-22 2019-10 /pmc/articles/PMC6713979/ /pubmed/31488915 http://dx.doi.org/10.1177/0013164419832063 Text en © The Author(s) 2019 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Article
Bürkner, Paul-Christian
Schulte, Niklas
Holling, Heinz
On the Statistical and Practical Limitations of Thurstonian IRT Models
title On the Statistical and Practical Limitations of Thurstonian IRT Models
title_full On the Statistical and Practical Limitations of Thurstonian IRT Models
title_fullStr On the Statistical and Practical Limitations of Thurstonian IRT Models
title_full_unstemmed On the Statistical and Practical Limitations of Thurstonian IRT Models
title_short On the Statistical and Practical Limitations of Thurstonian IRT Models
title_sort on the statistical and practical limitations of thurstonian irt models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6713979/
https://www.ncbi.nlm.nih.gov/pubmed/31488915
http://dx.doi.org/10.1177/0013164419832063
work_keys_str_mv AT burknerpaulchristian onthestatisticalandpracticallimitationsofthurstonianirtmodels
AT schulteniklas onthestatisticalandpracticallimitationsofthurstonianirtmodels
AT hollingheinz onthestatisticalandpracticallimitationsofthurstonianirtmodels